IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i12p6591-d577793.html
   My bibliography  Save this article

Enhancing Influenza Epidemics Forecasting Accuracy in China with Both Official and Unofficial Online News Articles, 2019–2020

Author

Listed:
  • Jingwei Li

    (School of Management, Xi’an Jiaotong University, Xi’an 710049, China
    Department of Information Systems, City University of Hong Kong, Hong Kong 999077, China)

  • Choon-Ling Sia

    (Department of Information Systems, City University of Hong Kong, Hong Kong 999077, China)

  • Zhuo Chen

    (College of Public Health, University of Georgia, Athens, GA 30602, USA
    School of Economics, University of Nottingham Ningbo China, Ningbo 315000, China)

  • Wei Huang

    (College of Business, Southern University of Science and Technology, Shenzhen 518000, China)

Abstract

Real-time online data sources have contributed to timely and accurate forecasting of influenza activities while also suffered from instability and linguistic noise. Few previous studies have focused on unofficial online news articles, which are abundant in their numbers, rich in information, and relatively low in noise. This study examined whether monitoring both official and unofficial online news articles can improve influenza activity forecasting accuracy during influenza outbreaks. Data were retrieved from a Chinese commercial online platform and the website of the Chinese National Influenza Center. We modeled weekly fractions of influenza-related online news articles and compared them against weekly influenza-like illness (ILI) rates using autoregression analyses. We retrieved 153,958,695 and 149,822,871 online news articles focusing on the south and north of mainland China separately from 6 October 2019 to 17 May 2020. Our model based on online news articles could significantly improve the forecasting accuracy, compared to other influenza surveillance models based on historical ILI rates ( p = 0.002 in the south; p = 0.000 in the north) or adding microblog data as an exogenous input ( p = 0.029 in the south; p = 0.000 in the north). Our finding also showed that influenza forecasting based on online news articles could be 1–2 weeks ahead of official ILI surveillance reports. The results revealed that monitoring online news articles could supplement traditional influenza surveillance systems, improve resource allocation, and offer models for surveillance of other emerging diseases.

Suggested Citation

  • Jingwei Li & Choon-Ling Sia & Zhuo Chen & Wei Huang, 2021. "Enhancing Influenza Epidemics Forecasting Accuracy in China with Both Official and Unofficial Online News Articles, 2019–2020," IJERPH, MDPI, vol. 18(12), pages 1-13, June.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:12:p:6591-:d:577793
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/12/6591/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/12/6591/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui & Yang, Yuhong, 2004. "Combining time series models for forecasting," International Journal of Forecasting, Elsevier, vol. 20(1), pages 69-84.
    2. Smolinski, M.S. & Crawley, A.W. & Baltrusaitis, K. & Chunara, R. & Olsen, J.M. & Wójcik, O. & Santillana, M. & Nguyen, A. & Brownstein, J.S., 2015. "Flu near you: Crowdsourced symptom reporting spanning 2 influenza seasons," American Journal of Public Health, American Public Health Association, vol. 105(10), pages 2124-2130.
    3. Kuhn, Max, 2008. "Building Predictive Models in R Using the caret Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 28(i05).
    4. Hswen, Y. & Brownstein, J.S. & Liu, J. & Hawkins, J.B., 2017. "Use of a digital health application for influenza surveillance in China," American Journal of Public Health, American Public Health Association, vol. 107(7), pages 1130-1136.
    5. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    6. Jan Kreft, 2019. "Instant Articles (Facebook): The Impact of Trust and Relations Among the Partners Pursuing the Strategy of Coopetition," Eurasian Studies in Business and Economics, in: Mehmet Huseyin Bilgin & Hakan Danis & Ender Demir & Ugur Can (ed.), Eurasian Business Perspectives, pages 243-253, Springer.
    7. Yufang Wang & Kuai Xu & Yun Kang & Haiyan Wang & Feng Wang & Adrian Avram, 2020. "Regional Influenza Prediction with Sampling Twitter Data and PDE Model," IJERPH, MDPI, vol. 17(3), pages 1-12, January.
    8. David A Broniatowski & Michael J Paul & Mark Dredze, 2013. "National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-1, December.
    9. repec:aph:ajpbhl:10.2105/ajph.2017.303767_1 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kuchler, Theresa & Russel, Dominic & Stroebel, Johannes, 2022. "JUE Insight: The geographic spread of COVID-19 correlates with the structure of social networks as measured by Facebook," Journal of Urban Economics, Elsevier, vol. 127(C).
    2. Sasikiran Kandula & Jeffrey Shaman, 2019. "Reappraising the utility of Google Flu Trends," PLOS Computational Biology, Public Library of Science, vol. 15(8), pages 1-16, August.
    3. Fantazzini, Dean, 2020. "Short-term forecasting of the COVID-19 pandemic using Google Trends data: Evidence from 158 countries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 33-54.
    4. Samuel V Scarpino & James G Scott & Rosalind M Eggo & Bruce Clements & Nedialko B Dimitrov & Lauren Ancel Meyers, 2020. "Socioeconomic bias in influenza surveillance," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-19, July.
    5. Hongying Dai & Brian R. Lee & Jianqiang Hao, 2017. "Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys," The ANNALS of the American Academy of Political and Social Science, , vol. 669(1), pages 75-92, January.
    6. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    7. Jose L Herrera & Ravi Srinivasan & John S Brownstein & Alison P Galvani & Lauren Ancel Meyers, 2016. "Disease Surveillance on Complex Social Networks," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-16, July.
    8. Ibrahim Musa & Hyun Woo Park & Lkhagvadorj Munkhdalai & Keun Ho Ryu, 2018. "Global Research on Syndromic Surveillance from 1993 to 2017: Bibliometric Analysis and Visualization," Sustainability, MDPI, vol. 10(10), pages 1-20, September.
    9. Valentina Lorenzoni & Gianni Andreozzi & Andrea Bazzani & Virginia Casigliani & Salvatore Pirri & Lara Tavoschi & Giuseppe Turchetti, 2022. "How Italy Tweeted about COVID-19: Detecting Reactions to the Pandemic from Social Media," IJERPH, MDPI, vol. 19(13), pages 1-14, June.
    10. Cordova, Amado & Stanley, Karlyn D., 2021. "Public-private partnership for building a resilient broadband infrastructure in Puerto Rico," Telecommunications Policy, Elsevier, vol. 45(4).
    11. Amir Hassan Zadeh & Hamed M. Zolbanin & Ramesh Sharda & Dursun Delen, 2019. "Social Media for Nowcasting Flu Activity: Spatio-Temporal Big Data Analysis," Information Systems Frontiers, Springer, vol. 21(4), pages 743-760, August.
    12. Qiong Jia & Yue Guo & Guanlin Wang & Stuart J. Barnes, 2020. "Big Data Analytics in the Fight against Major Public Health Incidents (Including COVID-19): A Conceptual Framework," IJERPH, MDPI, vol. 17(17), pages 1-21, August.
    13. Sequoia I Leuba & Reza Yaesoubi & Marina Antillon & Ted Cohen & Christoph Zimmer, 2020. "Tracking and predicting U.S. influenza activity with a real-time surveillance network," PLOS Computational Biology, Public Library of Science, vol. 16(11), pages 1-14, November.
    14. Tomeny, Theodore S. & Vargo, Christopher J. & El-Toukhy, Sherine, 2017. "Geographic and demographic correlates of autism-related anti-vaccine beliefs on Twitter, 2009-15," Social Science & Medicine, Elsevier, vol. 191(C), pages 168-175.
    15. Logan C Brooks & David C Farrow & Sangwon Hyun & Ryan J Tibshirani & Roni Rosenfeld, 2018. "Nonmechanistic forecasts of seasonal influenza with iterative one-week-ahead distributions," PLOS Computational Biology, Public Library of Science, vol. 14(6), pages 1-29, June.
    16. David H Chae & Sean Clouston & Mark L Hatzenbuehler & Michael R Kramer & Hannah L F Cooper & Sacoby M Wilson & Seth I Stephens-Davidowitz & Robert S Gold & Bruce G Link, 2015. "Association between an Internet-Based Measure of Area Racism and Black Mortality," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-12, April.
    17. Prabal Das & D. A. Sachindra & Kironmala Chanda, 2022. "Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(15), pages 6043-6071, December.
    18. Jie Zhao & Ji Chen & Damien Beillouin & Hans Lambers & Yadong Yang & Pete Smith & Zhaohai Zeng & Jørgen E. Olesen & Huadong Zang, 2022. "Global systematic review with meta-analysis reveals yield advantage of legume-based rotations and its drivers," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    19. Piaopiao Chen & Agnès H. Michel & Jianzhi Zhang, 2022. "Transposon insertional mutagenesis of diverse yeast strains suggests coordinated gene essentiality polymorphisms," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    20. Paulo Infante & Gonçalo Jacinto & Anabela Afonso & Leonor Rego & Pedro Nogueira & Marcelo Silva & Vitor Nogueira & José Saias & Paulo Quaresma & Daniel Santos & Patrícia Góis & Paulo Rebelo Manuel, 2023. "Factors That Influence the Type of Road Traffic Accidents: A Case Study in a District of Portugal," Sustainability, MDPI, vol. 15(3), pages 1-16, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:12:p:6591-:d:577793. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.