IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0215600.html
   My bibliography  Save this article

Regional level influenza study based on Twitter and machine learning method

Author

Listed:
  • Hongxin Xue
  • Yanping Bai
  • Hongping Hu
  • Haijian Liang

Abstract

The significance of flu prediction is that the appropriate preventive and control measures can be taken by relevant departments after assessing predicted data; thus, morbidity and mortality can be reduced. In this paper, three flu prediction models, based on twitter and US Centers for Disease Control’s (CDC’s) Influenza-Like Illness (ILI) data, are proposed (models 1-3) to verify the factors that affect the spread of the flu. In this work, an Improved Particle Swarm Optimization algorithm to optimize the parameters of Support Vector Regression (IPSO-SVR) was proposed. The IPSO-SVR was trained by the independent and dependent variables of the three models (models 1-3) as input and output. The trained IPSO-SVR method was used to predict the regional unweighted percentage ILI (%ILI) events in the US. The prediction results of each model are analyzed and compared. The results show that the IPSO-SVR method (model 3) demonstrates excellent performance in real-time prediction of ILIs, and further highlights the benefits of using real-time twitter data, thus providing an effective means for the prevention and control of flu.

Suggested Citation

  • Hongxin Xue & Yanping Bai & Hongping Hu & Haijian Liang, 2019. "Regional level influenza study based on Twitter and machine learning method," PLOS ONE, Public Library of Science, vol. 14(4), pages 1-23, April.
  • Handle: RePEc:plo:pone00:0215600
    DOI: 10.1371/journal.pone.0215600
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0215600
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0215600&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0215600?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Tatiana Petukhova & Davor Ojkic & Beverly McEwen & Rob Deardon & Zvonimir Poljak, 2018. "Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza," PLOS ONE, Public Library of Science, vol. 13(6), pages 1-17, June.
    2. Declan Butler, 2013. "When Google got flu wrong," Nature, Nature, vol. 494(7436), pages 155-156, February.
    3. Eui-Ki Kim & Jong Hyeon Seok & Jang Seok Oh & Hyong Woo Lee & Kyung Hyun Kim, 2013. "Use of Hangeul Twitter to Track and Predict Human Influenza Infection," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-11, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Soo Beom Choi & Insung Ahn, 2020. "Forecasting seasonal influenza-like illness in South Korea after 2 and 30 weeks using Google Trends and influenza data from Argentina," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-14, July.
    2. Shu-Heng Chen & Ragupathy Venkatachalam, 2017. "Information aggregation and computational intelligence," Evolutionary and Institutional Economics Review, Springer, vol. 14(1), pages 231-252, June.
    3. Krzysztof Bartosz Klimiuk & Dawid Krefta & Karol Kołkowski & Karol Flisikowski & Małgorzata Sokołowska-Wojdyło & Łukasz Balwicki, 2022. "Seasonal Patterns and Trends in Dermatoses in Poland," IJERPH, MDPI, vol. 19(15), pages 1-14, July.
    4. Steven Heston & Nitish R. Sinha, 2016. "News versus Sentiment : Predicting Stock Returns from News Stories," Finance and Economics Discussion Series 2016-048, Board of Governors of the Federal Reserve System (U.S.).
    5. Khatri, Vijay, 2016. "Managerial work in the realm of the digital universe: The role of the data triad," Business Horizons, Elsevier, vol. 59(6), pages 673-688.
    6. Mark Huberty, 2015. "Awaiting the Second Big Data Revolution: From Digital Noise to Value Creation," Journal of Industry, Competition and Trade, Springer, vol. 15(1), pages 35-47, March.
    7. Baki Cakici & Pedro Sanches, 2014. "Detecting the Visible: The Discursive Construction of Health Threats in a Syndromic Surveillance System Design," Societies, MDPI, vol. 4(3), pages 1-15, July.
    8. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    9. Ibrahim Musa & Hyun Woo Park & Lkhagvadorj Munkhdalai & Keun Ho Ryu, 2018. "Global Research on Syndromic Surveillance from 1993 to 2017: Bibliometric Analysis and Visualization," Sustainability, MDPI, vol. 10(10), pages 1-20, September.
    10. Huberty, Mark, 2015. "Can we vote with our tweet? On the perennial difficulty of election forecasting with social media," International Journal of Forecasting, Elsevier, vol. 31(3), pages 992-1007.
    11. Rivera, Roberto, 2016. "A dynamic linear model to forecast hotel registrations in Puerto Rico using Google Trends data," Tourism Management, Elsevier, vol. 57(C), pages 12-20.
    12. F. Antolini & L. Grassini, 2019. "Foreign arrivals nowcasting in Italy with Google Trends data," Quality & Quantity: International Journal of Methodology, Springer, vol. 53(5), pages 2385-2401, September.
    13. Nataliya Shakhovska & Ivan Izonin & Nataliia Melnykova, 2021. "The Hierarchical Classifier for COVID-19 Resistance Evaluation," Data, MDPI, vol. 6(1), pages 1-17, January.
    14. Victor Olsavszky & Mihnea Dosius & Cristian Vladescu & Johannes Benecke, 2020. "Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database," IJERPH, MDPI, vol. 17(14), pages 1-17, July.
    15. Daniel Bjorkegren & Darrell Grissen, 2017. "Behavior Revealed in Mobile Phone Usage Predicts Loan Repayment," Papers 1712.05840, arXiv.org, revised Dec 2019.
    16. Arora, Vishal S. & McKee, Martin & Stuckler, David, 2019. "Google Trends: Opportunities and limitations in health and health policy research," Health Policy, Elsevier, vol. 123(3), pages 338-341.
    17. Jiachen Sun & Peter A. Gloor, 2021. "Assessing the Predictive Power of Online Social Media to Analyze COVID-19 Outbreaks in the 50 U.S. States," Future Internet, MDPI, vol. 13(7), pages 1-13, July.
    18. Daniel E. O'Leary & Veda C. Storey, 2020. "A Google–Wikipedia–Twitter Model as a Leading Indicator of the Numbers of Coronavirus Deaths," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 27(3), pages 151-158, July.
    19. Jun, Seung-Pyo & Sung, Tae-Eung & Park, Hyun-Woo, 2017. "Forecasting by analogy using the web search traffic," Technological Forecasting and Social Change, Elsevier, vol. 115(C), pages 37-51.
    20. Jun, Seung-Pyo & Park, Do-Hyung, 2016. "Consumer information search behavior and purchasing decisions: Empirical evidence from Korea," Technological Forecasting and Social Change, Elsevier, vol. 107(C), pages 97-111.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0215600. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.