IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v15y2018i8p1596-d160427.html
   My bibliography  Save this article

Predicting Infectious Disease Using Deep Learning and Big Data

Author

Listed:
  • Sangwon Chae

    (Department of Business Administration, Korea Polytechnic University, 237 Sangidaehak-ro, Siheung-si, Gyeonggi-do 15073, Korea)

  • Sungjun Kwon

    (Department of Business Administration, Korea Polytechnic University, 237 Sangidaehak-ro, Siheung-si, Gyeonggi-do 15073, Korea)

  • Donghyun Lee

    (Department of Business Administration, Korea Polytechnic University, 237 Sangidaehak-ro, Siheung-si, Gyeonggi-do 15073, Korea)

Abstract

Infectious disease occurs when a person is infected by a pathogen from another person or an animal. It is a problem that causes harm at both individual and macro scales. The Korea Center for Disease Control (KCDC) operates a surveillance system to minimize infectious disease contagions. However, in this system, it is difficult to immediately act against infectious disease because of missing and delayed reports. Moreover, infectious disease trends are not known, which means prediction is not easy. This study predicts infectious diseases by optimizing the parameters of deep learning algorithms while considering big data including social media data. The performance of the deep neural network (DNN) and long-short term memory (LSTM) learning models were compared with the autoregressive integrated moving average (ARIMA) when predicting three infectious diseases one week into the future. The results show that the DNN and LSTM models perform better than ARIMA. When predicting chickenpox, the top-10 DNN and LSTM models improved average performance by 24% and 19%, respectively. The DNN model performed stably and the LSTM model was more accurate when infectious disease was spreading. We believe that this study’s models can help eliminate reporting delays in existing surveillance systems and, therefore, minimize costs to society.

Suggested Citation

  • Sangwon Chae & Sungjun Kwon & Donghyun Lee, 2018. "Predicting Infectious Disease Using Deep Learning and Big Data," IJERPH, MDPI, vol. 15(8), pages 1-20, July.
  • Handle: RePEc:gam:jijerp:v:15:y:2018:i:8:p:1596-:d:160427
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/15/8/1596/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/15/8/1596/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Andre Esteva & Brett Kuprel & Roberto A. Novoa & Justin Ko & Susan M. Swetter & Helen M. Blau & Sebastian Thrun, 2017. "Correction: Corrigendum: Dermatologist-level classification of skin cancer with deep neural networks," Nature, Nature, vol. 546(7660), pages 686-686, June.
    2. Andrea Freyer Dugas & Mehdi Jalalpour & Yulia Gel & Scott Levin & Fred Torcaso & Takeru Igusa & Richard E Rothman, 2013. "Influenza Forecasting with Google Flu Trends," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-7, February.
    3. Wei Bao & Jun Yue & Yulei Rao, 2017. "A deep learning framework for financial time series using stacked autoencoders and long-short term memory," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-24, July.
    4. Yan-Ling Zheng & Li-Ping Zhang & Xue-Liang Zhang & Kai Wang & Yu-Jian Zheng, 2015. "Forecast Model Analysis for the Morbidity of Tuberculosis in Xinjiang, China," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-13, March.
    5. Nicholas Thapen & Donal Simmie & Chris Hankin & Joseph Gillard, 2016. "DEFENDER: Detecting and Forecasting Epidemics Using Novel Data-Analytics for Enhanced Response," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-19, May.
    6. Andre Esteva & Brett Kuprel & Roberto A. Novoa & Justin Ko & Susan M. Swetter & Helen M. Blau & Sebastian Thrun, 2017. "Dermatologist-level classification of skin cancer with deep neural networks," Nature, Nature, vol. 542(7639), pages 115-118, February.
    7. Neil M. Ferguson & Derek A. T. Cummings & Christophe Fraser & James C. Cajka & Philip C. Cooley & Donald S. Burke, 2006. "Strategies for mitigating an influenza pandemic," Nature, Nature, vol. 442(7101), pages 448-452, July.
    8. Svitlana Volkova & Ellyn Ayton & Katherine Porterfield & Courtney D Corley, 2017. "Forecasting influenza-like illness dynamics for military populations using neural networks and social media," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-22, December.
    9. Chris Allen & Ming-Hsiang Tsou & Anoshe Aslam & Anna Nagel & Jean-Mark Gawron, 2016. "Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza," PLOS ONE, Public Library of Science, vol. 11(7), pages 1-10, July.
    10. Ali Tafti & Ryan Zotti & Wolfgang Jank, 2016. "Real-Time Diffusion of Information on Twitter and the Financial Markets," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-16, August.
    11. Yuanbin Song & Fan Wang & Bin Wang & Shaohua Tao & Huiping Zhang & Sai Liu & Oscar Ramirez & Qiyi Zeng, 2015. "Time Series Analyses of Hand, Foot and Mouth Disease Integrating Weather Variables," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-18, March.
    12. Qinneng Xu & Yulia R Gel & L Leticia Ramirez Ramirez & Kusha Nezafati & Qingpeng Zhang & Kwok-Leung Tsui, 2017. "Forecasting influenza in Hong Kong with Google search queries and statistical model fusion," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-17, May.
    13. Vittoria Colizza & Alain Barrat & Marc Barthelemy & Alain-Jacques Valleron & Alessandro Vespignani, 2007. "Modeling the Worldwide Spread of Pandemic Influenza: Baseline Case and Containment Interventions," PLOS Medicine, Public Library of Science, vol. 4(1), pages 1-16, January.
    14. Stephen Eubank & Hasan Guclu & V. S. Anil Kumar & Madhav V. Marathe & Aravind Srinivasan & Zoltán Toroczkai & Nan Wang, 2004. "Modelling disease outbreaks in realistic urban social networks," Nature, Nature, vol. 429(6988), pages 180-184, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shruti Sharma & Yogesh Kumar Gupta & Abhinava K. Mishra, 2023. "Analysis and Prediction of COVID-19 Multivariate Data Using Deep Ensemble Learning Methods," IJERPH, MDPI, vol. 20(11), pages 1-23, May.
    2. Jihye Lim & Jungyoon Kim & Songhee Cheon, 2019. "A Deep Neural Network-Based Method for Early Detection of Osteoarthritis Using Statistical Data," IJERPH, MDPI, vol. 16(7), pages 1-11, April.
    3. Jiucheng Xu & Keqiang Xu & Zhichao Li & Fengxia Meng & Taotian Tu & Lei Xu & Qiyong Liu, 2020. "Forecast of Dengue Cases in 20 Chinese Cities Based on the Deep Learning Method," IJERPH, MDPI, vol. 17(2), pages 1-14, January.
    4. Alekh Gour & Shikha Aggarwal & Subodha Kumar, 2022. "Lending ears to unheard voices: An empirical analysis of user‐generated content on social media," Production and Operations Management, Production and Operations Management Society, vol. 31(6), pages 2457-2476, June.
    5. Srinka Basu & Sugata Sen, 2023. "COVID 19 Pandemic, Socio-Economic Behaviour and Infection Characteristics: An Inter-Country Predictive Study Using Deep Learning," Computational Economics, Springer;Society for Computational Economics, vol. 61(2), pages 645-676, February.
    6. Victor Olsavszky & Mihnea Dosius & Cristian Vladescu & Johannes Benecke, 2020. "Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database," IJERPH, MDPI, vol. 17(14), pages 1-17, July.
    7. Vanessa Alcalá-Rmz & Laura A. Zanella-Calzada & Carlos E. Galván-Tejada & Alejandra García-Hernández & Miguel Cruz & Adan Valladares-Salgado & Jorge I. Galván-Tejada & Hamurabi Gamboa-Rosales, 2019. "Identification of Diabetic Patients through Clinical and Para-Clinical Features in Mexico: An Approach Using Deep Neural Networks," IJERPH, MDPI, vol. 16(3), pages 1-12, January.
    8. Israel Edem Agbehadji & Bankole Osita Awuzie & Alfred Beati Ngowi & Richard C. Millham, 2020. "Review of Big Data Analytics, Artificial Intelligence and Nature-Inspired Computing Models towards Accurate Detection of COVID-19 Pandemic Cases and Contact Tracing," IJERPH, MDPI, vol. 17(15), pages 1-16, July.
    9. Monday Osayande & Osagie Osifo, 2024. "Application Of Covid-19 Data: Investigating The Impact On Weekly Stock Market Returns In Nigeria," Journal of Academic Research in Economics, Spiru Haret University, Faculty of Accounting and Financial Management Constanta, vol. 16(2 (July)), pages 403-416.
    10. Jinhai Li & Yunlei Ma & Xinglong Xu & Jiaming Pei & Youshi He, 2022. "A Study on Epidemic Information Screening, Prevention and Control of Public Opinion Based on Health and Medical Big Data: A Case Study of COVID-19," IJERPH, MDPI, vol. 19(16), pages 1-21, August.
    11. Rui Zhang & Zhen Guo & Yujie Meng & Songwang Wang & Shaoqiong Li & Ran Niu & Yu Wang & Qing Guo & Yonghong Li, 2021. "Comparison of ARIMA and LSTM in Forecasting the Incidence of HFMD Combined and Uncombined with Exogenous Meteorological Variables in Ningbo, China," IJERPH, MDPI, vol. 18(11), pages 1-14, June.
    12. Laura Zoboroski & Torrey Wagner & Brent Langhals, 2021. "Classical and Neural Network Machine Learning to Determine the Risk of Marijuana Use," IJERPH, MDPI, vol. 18(14), pages 1-15, July.
    13. Paulina Phoobane & Muthoni Masinde & Tafadzwanashe Mabhaudhi, 2022. "Predicting Infectious Diseases: A Bibliometric Review on Africa," IJERPH, MDPI, vol. 19(3), pages 1-20, February.
    14. Corrado Lanera & Ileana Baldi & Andrea Francavilla & Elisa Barbieri & Lara Tramontan & Antonio Scamarcia & Luigi Cantarutti & Carlo Giaquinto & Dario Gregori, 2022. "A Deep Learning Approach to Estimate the Incidence of Infectious Disease Cases for Routinely Collected Ambulatory Records: The Example of Varicella-Zoster," IJERPH, MDPI, vol. 19(10), pages 1-13, May.
    15. Bowen Long & Fangya Tan & Mark Newman, 2023. "Forecasting the Monkeypox Outbreak Using ARIMA, Prophet, NeuralProphet, and LSTM Models in the United States," Forecasting, MDPI, vol. 5(1), pages 1-11, January.
    16. Lee, Donghyun & Kim, Mingyu & Lee, Beomhui & Chae, Sangwon & Kwon, Sungjun & Kang, Sungwon, 2022. "Integrated explainable deep learning prediction of harmful algal blooms," Technological Forecasting and Social Change, Elsevier, vol. 185(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Victor Olsavszky & Mihnea Dosius & Cristian Vladescu & Johannes Benecke, 2020. "Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database," IJERPH, MDPI, vol. 17(14), pages 1-17, July.
    2. Jungyoon Kim & Jihye Lim, 2021. "A Deep Neural Network-Based Method for Prediction of Dementia Using Big Data," IJERPH, MDPI, vol. 18(10), pages 1-13, May.
    3. Songhee Cheon & Jungyoon Kim & Jihye Lim, 2019. "The Use of Deep Learning to Predict Stroke Patient Mortality," IJERPH, MDPI, vol. 16(11), pages 1-12, May.
    4. Daniel Alejandro Gónzalez-Bandala & Juan Carlos Cuevas-Tello & Daniel E. Noyola & Andreu Comas-García & Christian A García-Sepúlveda, 2020. "Computational Forecasting Methodology for Acute Respiratory Infectious Disease Dynamics," IJERPH, MDPI, vol. 17(12), pages 1-20, June.
    5. Logan C Brooks & David C Farrow & Sangwon Hyun & Ryan J Tibshirani & Roni Rosenfeld, 2015. "Flexible Modeling of Epidemics with an Empirical Bayes Framework," PLOS Computational Biology, Public Library of Science, vol. 11(8), pages 1-18, August.
    6. Floriana Gargiulo & Sônia Ternes & Sylvie Huet & Guillaume Deffuant, 2010. "An Iterative Approach for Generating Statistically Realistic Populations of Households," PLOS ONE, Public Library of Science, vol. 5(1), pages 1-9, January.
    7. Lin Lu & Laurent Dercle & Binsheng Zhao & Lawrence H. Schwartz, 2021. "Deep learning for the prediction of early on-treatment response in metastatic colorectal cancer from serial medical imaging," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    8. Zheng Yan & Wenqian Robertson & Yaosheng Lou & Tom W. Robertson & Sung Yong Park, 2021. "Finding leading scholars in mobile phone behavior: a mixed-method analysis of an emerging interdisciplinary field," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9499-9517, December.
    9. Freddy Gabbay & Rotem Lev Aharoni & Ori Schweitzer, 2022. "Deep Neural Network Memory Performance and Throughput Modeling and Simulation Framework," Mathematics, MDPI, vol. 10(21), pages 1-20, November.
    10. Wiriya Mahikul & Somkid Kripattanapong & Piya Hanvoravongchai & Aronrag Meeyai & Sopon Iamsirithaworn & Prasert Auewarakul & Wirichada Pan-ngum, 2020. "Contact Mixing Patterns and Population Movement among Migrant Workers in an Urban Setting in Thailand," IJERPH, MDPI, vol. 17(7), pages 1-11, March.
    11. Gang Yu & Kai Sun & Chao Xu & Xing-Hua Shi & Chong Wu & Ting Xie & Run-Qi Meng & Xiang-He Meng & Kuan-Song Wang & Hong-Mei Xiao & Hong-Wen Deng, 2021. "Accurate recognition of colorectal cancer with semi-supervised deep learning on pathological images," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    12. DonHee Lee & Seong No Yoon, 2021. "Application of Artificial Intelligence-Based Technologies in the Healthcare Industry: Opportunities and Challenges," IJERPH, MDPI, vol. 18(1), pages 1-18, January.
    13. Xiaoyan Mu & Anthony Gar-On Yeh & Xiaohu Zhang, 2021. "The interplay of spatial spread of COVID-19 and human mobility in the urban system of China during the Chinese New Year," Environment and Planning B, , vol. 48(7), pages 1955-1971, September.
    14. Claus Zippel & Sabine Bohnet-Joschko, 2021. "Rise of Clinical Studies in the Field of Machine Learning: A Review of Data Registered in ClinicalTrials.gov," IJERPH, MDPI, vol. 18(10), pages 1-14, May.
    15. Dario Sipari & Betsy D. M. Chaparro-Rico & Daniele Cafolla, 2022. "SANE (Easy Gait Analysis System): Towards an AI-Assisted Automatic Gait-Analysis," IJERPH, MDPI, vol. 19(16), pages 1-27, August.
    16. Jamil Ahmad & Abdul Khader Jilani Saudagar & Khalid Mahmood Malik & Waseem Ahmad & Muhammad Badruddin Khan & Mozaherul Hoque Abul Hasanat & Abdullah AlTameem & Mohammed AlKhathami & Muhammad Sajjad, 2022. "Disease Progression Detection via Deep Sequence Learning of Successive Radiographic Scans," IJERPH, MDPI, vol. 19(1), pages 1-16, January.
    17. Xiao, Yao & Yang, Mofeng & Zhu, Zheng & Yang, Hai & Zhang, Lei & Ghader, Sepehr, 2021. "Modeling indoor-level non-pharmaceutical interventions during the COVID-19 pandemic: A pedestrian dynamics-based microscopic simulation approach," Transport Policy, Elsevier, vol. 109(C), pages 12-23.
    18. Rasheed Omobolaji Alabi & Alhadi Almangush & Mohammed Elmusrati & Ilmo Leivo & Antti Mäkitie, 2022. "Measuring the Usability and Quality of Explanations of a Machine Learning Web-Based Tool for Oral Tongue Cancer Prognostication," IJERPH, MDPI, vol. 19(14), pages 1-13, July.
    19. Andreas Fügener & Jörn Grahl & Alok Gupta & Wolfgang Ketter, 2022. "Cognitive Challenges in Human–Artificial Intelligence Collaboration: Investigating the Path Toward Productive Delegation," Information Systems Research, INFORMS, vol. 33(2), pages 678-696, June.
    20. Vidhya V. & Anjan Gudigar & U. Raghavendra & Ajay Hegde & Girish R. Menon & Filippo Molinari & Edward J. Ciaccio & U. Rajendra Acharya, 2021. "Automated Detection and Screening of Traumatic Brain Injury (TBI) Using Computed Tomography Images: A Comprehensive Review and Future Perspectives," IJERPH, MDPI, vol. 18(12), pages 1-29, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:15:y:2018:i:8:p:1596-:d:160427. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.