IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0188941.html
   My bibliography  Save this article

Forecasting influenza-like illness dynamics for military populations using neural networks and social media

Author

Listed:
  • Svitlana Volkova
  • Ellyn Ayton
  • Katherine Porterfield
  • Courtney D Corley

Abstract

This work is the first to take advantage of recurrent neural networks to predict influenza-like illness (ILI) dynamics from various linguistic signals extracted from social media data. Unlike other approaches that rely on timeseries analysis of historical ILI data and the state-of-the-art machine learning models, we build and evaluate the predictive power of neural network architectures based on Long Short Term Memory (LSTMs) units capable of nowcasting (predicting in “real-time”) and forecasting (predicting the future) ILI dynamics in the 2011 – 2014 influenza seasons. To build our models we integrate information people post in social media e.g., topics, embeddings, word ngrams, stylistic patterns, and communication behavior using hashtags and mentions. We then quantitatively evaluate the predictive power of different social media signals and contrast the performance of the-state-of-the-art regression models with neural networks using a diverse set of evaluation metrics. Finally, we combine ILI and social media signals to build a joint neural network model for ILI dynamics prediction. Unlike the majority of the existing work, we specifically focus on developing models for local rather than national ILI surveillance, specifically for military rather than general populations in 26 U.S. and six international locations., and analyze how model performance depends on the amount of social media data available per location. Our approach demonstrates several advantages: (a) Neural network architectures that rely on LSTM units trained on social media data yield the best performance compared to previously used regression models. (b) Previously under-explored language and communication behavior features are more predictive of ILI dynamics than stylistic and topic signals expressed in social media. (c) Neural network models learned exclusively from social media signals yield comparable or better performance to the models learned from ILI historical data, thus, signals from social media can be potentially used to accurately forecast ILI dynamics for the regions where ILI historical data is not available. (d) Neural network models learned from combined ILI and social media signals significantly outperform models that rely solely on ILI historical data, which adds to a great potential of alternative public sources for ILI dynamics prediction. (e) Location-specific models outperform previously used location-independent models e.g., U.S. only. (f) Prediction results significantly vary across geolocations depending on the amount of social media data available and ILI activity patterns. (g) Model performance improves with more tweets available per geo-location e.g., the error gets lower and the Pearson score gets higher for locations with more tweets.

Suggested Citation

  • Svitlana Volkova & Ellyn Ayton & Katherine Porterfield & Courtney D Corley, 2017. "Forecasting influenza-like illness dynamics for military populations using neural networks and social media," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-22, December.
  • Handle: RePEc:plo:pone00:0188941
    DOI: 10.1371/journal.pone.0188941
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0188941
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0188941&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0188941?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nicholas Generous & Geoffrey Fairchild & Alina Deshpande & Sara Y Del Valle & Reid Priedhorsky, 2014. "Global Disease Monitoring and Forecasting with Wikipedia," PLOS Computational Biology, Public Library of Science, vol. 10(11), pages 1-16, November.
    2. David A Broniatowski & Michael J Paul & Mark Dredze, 2013. "National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-1, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sangwon Chae & Sungjun Kwon & Donghyun Lee, 2018. "Predicting Infectious Disease Using Deep Learning and Big Data," IJERPH, MDPI, vol. 15(8), pages 1-20, July.
    2. Victor Olsavszky & Mihnea Dosius & Cristian Vladescu & Johannes Benecke, 2020. "Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database," IJERPH, MDPI, vol. 17(14), pages 1-17, July.
    3. Daniel Alejandro Gónzalez-Bandala & Juan Carlos Cuevas-Tello & Daniel E. Noyola & Andreu Comas-García & Christian A García-Sepúlveda, 2020. "Computational Forecasting Methodology for Acute Respiratory Infectious Disease Dynamics," IJERPH, MDPI, vol. 17(12), pages 1-20, June.
    4. Taichi Murayama & Nobuyuki Shimizu & Sumio Fujita & Shoko Wakamiya & Eiji Aramaki, 2020. "Robust two-stage influenza prediction model considering regular and irregular trends," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-14, May.
    5. Mohammed A. A. Al-qaness & Ahmed A. Ewees & Hong Fan & Mohamed Abd Elaziz, 2020. "Optimized Forecasting Method for Weekly Influenza Confirmed Cases," IJERPH, MDPI, vol. 17(10), pages 1-12, May.
    6. Maria Glenski & Tim Weninger & Svitlana Volkova, 2019. "Improved Forecasting of Cryptocurrency Price using Social Signals," Papers 1907.00558, arXiv.org.
    7. Dave Osthus & Ashlynn R Daughton & Reid Priedhorsky, 2019. "Even a good influenza forecasting model can benefit from internet-based nowcasts, but those benefits are limited," PLOS Computational Biology, Public Library of Science, vol. 15(2), pages 1-19, February.
    8. Nurlan Temirbekov & Marzhan Temirbekova & Dinara Tamabay & Syrym Kasenov & Seilkhan Askarov & Zulfiya Tukenova, 2023. "Assessment of the Negative Impact of Urban Air Pollution on Population Health Using Machine Learning Method," IJERPH, MDPI, vol. 20(18), pages 1-15, September.
    9. Yu-Chih Wei & Yan-Ling Ou & Jianqiang Li & Wei-Chen Wu, 2022. "Forecasting the Potential Number of Influenza-like Illness Cases by Fusing Internet Public Opinion," Sustainability, MDPI, vol. 14(5), pages 1-24, February.
    10. Tian-Shyug Lee & I-Fei Chen & Ting-Jen Chang & Chi-Jie Lu, 2020. "Forecasting Weekly Influenza Outpatient Visits Using a Two-Dimensional Hierarchical Decision Tree Scheme," IJERPH, MDPI, vol. 17(13), pages 1-15, July.
    11. Kookjin Lee & Jaideep Ray & Cosmin Safta, 2021. "The predictive skill of convolutional neural networks models for disease forecasting," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-26, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Samuel V Scarpino & James G Scott & Rosalind M Eggo & Bruce Clements & Nedialko B Dimitrov & Lauren Ancel Meyers, 2020. "Socioeconomic bias in influenza surveillance," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-19, July.
    2. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    3. Ibrahim Musa & Hyun Woo Park & Lkhagvadorj Munkhdalai & Keun Ho Ryu, 2018. "Global Research on Syndromic Surveillance from 1993 to 2017: Bibliometric Analysis and Visualization," Sustainability, MDPI, vol. 10(10), pages 1-20, September.
    4. Logan C Brooks & David C Farrow & Sangwon Hyun & Ryan J Tibshirani & Roni Rosenfeld, 2018. "Nonmechanistic forecasts of seasonal influenza with iterative one-week-ahead distributions," PLOS Computational Biology, Public Library of Science, vol. 14(6), pages 1-29, June.
    5. Kuchler, Theresa & Russel, Dominic & Stroebel, Johannes, 2022. "JUE Insight: The geographic spread of COVID-19 correlates with the structure of social networks as measured by Facebook," Journal of Urban Economics, Elsevier, vol. 127(C).
    6. Hyekyung Woo & Youngtae Cho & Eunyoung Shim & Kihwang Lee & Gilyoung Song, 2015. "Public Trauma after the Sewol Ferry Disaster: The Role of Social Media in Understanding the Public Mood," IJERPH, MDPI, vol. 12(9), pages 1-10, September.
    7. HeeChel Kim & Hong-Woo Chun & Seonho Kim & Byoung-Youl Coh & Oh-Jin Kwon & Yeong-Ho Moon, 2017. "Longitudinal Study-Based Dementia Prediction for Public Health," IJERPH, MDPI, vol. 14(9), pages 1-16, August.
    8. Paolo BRUNORI & Giuliano RESCE, 2020. "Searching for the peak Google Trends and the Covid-19 outbreak in Italy," Working Papers - Economics wp2020_05.rdf, Universita' degli Studi di Firenze, Dipartimento di Scienze per l'Economia e l'Impresa.
    9. Fernando Arias & Ariel Guerra-Adames & Maytee Zambrano & Efraín Quintero-Guerra & Nathalia Tejedor-Flores, 2022. "Analyzing Spanish-Language Public Sentiment in the Context of a Pandemic and Social Unrest: The Panama Case," IJERPH, MDPI, vol. 19(16), pages 1-19, August.
    10. Fantazzini, Dean, 2020. "Short-term forecasting of the COVID-19 pandemic using Google Trends data: Evidence from 158 countries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 33-54.
    11. Ira Puspitasari & Alia Firdauzy, 2019. "Characterizing Consumer Behavior in Leveraging Social Media for E-Patient and Health-Related Activities," IJERPH, MDPI, vol. 16(18), pages 1-17, September.
    12. David A. Broniatowski, 2018. "Building the tower without climbing it: Progress in engineering systems," Systems Engineering, John Wiley & Sons, vol. 21(3), pages 259-281, May.
    13. Hongying Dai & Brian R. Lee & Jianqiang Hao, 2017. "Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys," The ANNALS of the American Academy of Political and Social Science, , vol. 669(1), pages 75-92, January.
    14. Jose L Herrera & Ravi Srinivasan & John S Brownstein & Alison P Galvani & Lauren Ancel Meyers, 2016. "Disease Surveillance on Complex Social Networks," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-16, July.
    15. Muhammad Imran & Umair Qazi & Ferda Ofli, 2022. "TBCOV: Two Billion Multilingual COVID-19 Tweets with Sentiment, Entity, Geo, and Gender Labels," Data, MDPI, vol. 7(1), pages 1-27, January.
    16. David A. Broniatowski & Conrad Tucker, 2017. "Assessing causal claims about complex engineered systems with quantitative data: internal, external, and construct validity," Systems Engineering, John Wiley & Sons, vol. 20(6), pages 483-496, November.
    17. Valentina Lorenzoni & Gianni Andreozzi & Andrea Bazzani & Virginia Casigliani & Salvatore Pirri & Lara Tavoschi & Giuseppe Turchetti, 2022. "How Italy Tweeted about COVID-19: Detecting Reactions to the Pandemic from Social Media," IJERPH, MDPI, vol. 19(13), pages 1-14, June.
    18. Yufang Wang & Kuai Xu & Yun Kang & Haiyan Wang & Feng Wang & Adrian Avram, 2020. "Regional Influenza Prediction with Sampling Twitter Data and PDE Model," IJERPH, MDPI, vol. 17(3), pages 1-12, January.
    19. Xiaodong Cao & Piers MacNaughton & Zhengyi Deng & Jie Yin & Xi Zhang & Joseph G. Allen, 2018. "Using Twitter to Better Understand the Spatiotemporal Patterns of Public Sentiment: A Case Study in Massachusetts, USA," IJERPH, MDPI, vol. 15(2), pages 1-15, February.
    20. Julissa Alexandra Galarza-Villamar & Mariette McCampbell & Andres Galarza-Villamar & Cees Leeuwis & Francesco Cecchi & John Galarza-Rodrigo, 2021. "A Public Bad Game Method to Study Dynamics in Socio-Ecological Systems (Part II): Results of Testing Musa-Game in Rwanda and Adding Emergence and Spatiality to the Analysis," Sustainability, MDPI, vol. 13(16), pages 1-27, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0188941. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.