IDEAS home Printed from https://ideas.repec.org/a/plo/pntd00/0005973.html
   My bibliography  Save this article

Developing a dengue forecast model using machine learning: A case study in China

Author

Listed:
  • Pi Guo
  • Tao Liu
  • Qin Zhang
  • Li Wang
  • Jianpeng Xiao
  • Qingying Zhang
  • Ganfeng Luo
  • Zhihao Li
  • Jianfeng He
  • Yonghui Zhang
  • Wenjun Ma

Abstract

Background: In China, dengue remains an important public health issue with expanded areas and increased incidence recently. Accurate and timely forecasts of dengue incidence in China are still lacking. We aimed to use the state-of-the-art machine learning algorithms to develop an accurate predictive model of dengue. Methodology/Principal findings: Weekly dengue cases, Baidu search queries and climate factors (mean temperature, relative humidity and rainfall) during 2011–2014 in Guangdong were gathered. A dengue search index was constructed for developing the predictive models in combination with climate factors. The observed year and week were also included in the models to control for the long-term trend and seasonality. Several machine learning algorithms, including the support vector regression (SVR) algorithm, step-down linear regression model, gradient boosted regression tree algorithm (GBM), negative binomial regression model (NBM), least absolute shrinkage and selection operator (LASSO) linear regression model and generalized additive model (GAM), were used as candidate models to predict dengue incidence. Performance and goodness of fit of the models were assessed using the root-mean-square error (RMSE) and R-squared measures. The residuals of the models were examined using the autocorrelation and partial autocorrelation function analyses to check the validity of the models. The models were further validated using dengue surveillance data from five other provinces. The epidemics during the last 12 weeks and the peak of the 2014 large outbreak were accurately forecasted by the SVR model selected by a cross-validation technique. Moreover, the SVR model had the consistently smallest prediction error rates for tracking the dynamics of dengue and forecasting the outbreaks in other areas in China. Conclusion and significance: The proposed SVR model achieved a superior performance in comparison with other forecasting techniques assessed in this study. The findings can help the government and community respond early to dengue epidemics. Author summary: Dengue epidemics have posed a great burden expanding of disease, with areas expanding and incidence increasing in China recently. It has remained challenging to develop a robust and accurate forecast model and enhance predictability of dengue incidence. Several state-of-the-art machine learning algorithms, including the support vector regression algorithm, step-down linear regression model, gradient boosted regression tree algorithm, negative binomial regression model, least absolute shrinkage and selection operator linear regression model and generalized additive model, were compared and evaluated to forecast dengue incidence in this study. The SVR model, based on selection by a cross-validation technique, was superior to other models assessed using weekly dengue surveillance data, Baidu search query data and meteorological data during 2011–2014 in Guangdong province. The high accuracy and robustness of the proposed SVR model to predict the occurrence of an outbreak was also validated using data from other provinces, including Yunnan, Guangxi, Hunan, Fujian and Zhejiang, spanning southern China. To the best of our knowledge, this is the first attempt to thoroughly evaluate different algorithms for dengue incidence prediction. Our identification of the optimal model will help to precisely track dengue dynamics in the country.

Suggested Citation

  • Pi Guo & Tao Liu & Qin Zhang & Li Wang & Jianpeng Xiao & Qingying Zhang & Ganfeng Luo & Zhihao Li & Jianfeng He & Yonghui Zhang & Wenjun Ma, 2017. "Developing a dengue forecast model using machine learning: A case study in China," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 11(10), pages 1-22, October.
  • Handle: RePEc:plo:pntd00:0005973
    DOI: 10.1371/journal.pntd.0005973
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosntds/article?id=10.1371/journal.pntd.0005973
    Download Restriction: no

    File URL: https://journals.plos.org/plosntds/article/file?id=10.1371/journal.pntd.0005973&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pntd.0005973?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Samir Bhatt & Peter W. Gething & Oliver J. Brady & Jane P. Messina & Andrew W. Farlow & Catherine L. Moyes & John M. Drake & John S. Brownstein & Anne G. Hoen & Osman Sankoh & Monica F. Myers & Dylan , 2013. "The global distribution and burden of dengue," Nature, Nature, vol. 496(7446), pages 504-507, April.
    2. Donald S Shepard & Eduardo A Undurraga & Yara A Halasa, 2013. "Economic and Disease Burden of Dengue in Southeast Asia," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 7(2), pages 1-12, February.
    3. Kraisak Kesorn & Phatsavee Ongruk & Jakkrawarn Chompoosri & Atchara Phumee & Usavadee Thavara & Apiwat Tawatsin & Padet Siriyasatien, 2015. "Morbidity Rate Prediction of Dengue Hemorrhagic Fever (DHF) Using the Support Vector Machine and the Aedes aegypti Infection Rate in Similar Climates and Geographical Areas," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-16, May.
    4. Zhihao Li & Tao Liu & Guanghu Zhu & Hualiang Lin & Yonghui Zhang & Jianfeng He & Aiping Deng & Zhiqiang Peng & Jianpeng Xiao & Shannon Rutherford & Runsheng Xie & Weilin Zeng & Xing Li & Wenjun Ma, 2017. "Dengue Baidu Search Index data can improve the prediction of local dengue epidemic: A case study in Guangzhou, China," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 11(3), pages 1-13, March.
    5. Hyndman, Rob J. & Koehler, Anne B., 2006. "Another look at measures of forecast accuracy," International Journal of Forecasting, Elsevier, vol. 22(4), pages 679-688.
    6. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhang, Zhenzhen & Ma, Xia & Zhang, Yongxin & Sun, Guiquan & Zhang, Zi-Ke, 2023. "Identifying critical driving factors for human brucellosis in Inner Mongolia, China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 626(C).
    2. Panja, Madhurima & Chakraborty, Tanujit & Nadim, Sk Shahid & Ghosh, Indrajit & Kumar, Uttam & Liu, Nan, 2023. "An ensemble neural network approach to forecast Dengue outbreak based on climatic condition," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Laith Hussain-Alkhateeb & Tatiana Rivera Ramírez & Axel Kroeger & Ernesto Gozzer & Silvia Runge-Ranzinger, 2021. "Early warning systems (EWSs) for chikungunya, dengue, malaria, yellow fever, and Zika outbreaks: What is the evidence? A scoping review," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 15(9), pages 1-25, September.
    2. Gerhart Knerer & Christine S M Currie & Sally C Brailsford, 2020. "The economic impact and cost-effectiveness of combined vector-control and dengue vaccination strategies in Thailand: results from a dynamic transmission model," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 14(10), pages 1-32, October.
    3. Zhichao Li, 2022. "Forecasting Weekly Dengue Cases by Integrating Google Earth Engine-Based Risk Predictor Generation and Google Colab-Based Deep Learning Modeling in Fortaleza and the Federal District, Brazil," IJERPH, MDPI, vol. 19(20), pages 1-16, October.
    4. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    5. Zhang, Chuan & Tian, Yu-Xin & Fan, Zhi-Ping, 2022. "Forecasting sales using online review and search engine data: A method based on PCA–DSFOA–BPNN," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1005-1024.
    6. Ibrahim Musa & Hyun Woo Park & Lkhagvadorj Munkhdalai & Keun Ho Ryu, 2018. "Global Research on Syndromic Surveillance from 1993 to 2017: Bibliometric Analysis and Visualization," Sustainability, MDPI, vol. 10(10), pages 1-20, September.
    7. Haocheng Wu & Chen Wu & Qinbao Lu & Zheyuan Ding & Ming Xue & Junfen Lin, 2019. "Evaluating the effects of control interventions and estimating the inapparent infections for dengue outbreak in Hangzhou, China," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-16, August.
    8. Adriana Zubieta-Zavala & Malaquias López-Cervantes & Guillermo Salinas-Escudero & Adrian Ramírez-Chávez & José Ramos Castañeda & Sendy Isarel Hernández-Gaytán & Juan Guillermo López Yescas & Luis Durá, 2018. "Economic impact of dengue in Mexico considering reported cases for 2012 to 2016," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 12(12), pages 1-18, December.
    9. Leigh R Bowman & Sarah Donegan & Philip J McCall, 2016. "Is Dengue Vector Control Deficient in Effectiveness or Evidence?: Systematic Review and Meta-analysis," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 10(3), pages 1-24, March.
    10. Christopher Fitzpatrick & Alexander Haines & Mathieu Bangert & Andrew Farlow & Janet Hemingway & Raman Velayudhan, 2017. "An economic evaluation of vector control in the age of a dengue vaccine," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 11(8), pages 1-27, August.
    11. Tiago França Melo De Lima & Raquel Martins Lana & Tiago Garcia De Senna Carneiro & Cláudia Torres Codeço & Gabriel Souza Machado & Lucas Saraiva Ferreira & Líliam César De Castro Medeiros & Clodoveu A, 2016. "DengueME: A Tool for the Modeling and Simulation of Dengue Spatiotemporal Dynamics," IJERPH, MDPI, vol. 13(9), pages 1-21, September.
    12. Katsikopoulos, Konstantinos V. & Şimşek, Özgür & Buckmann, Marcus & Gigerenzer, Gerd, 2022. "Transparent modeling of influenza incidence: Big data or a single data point from psychological theory?," International Journal of Forecasting, Elsevier, vol. 38(2), pages 613-619.
    13. Mohd ‘Ammar Ihsan Ahmad Zamzuri & Farah Nabila Abd Majid & Rahmat Dapari & Mohd Rohaizat Hassan & Abd Majid Mohd Isa, 2022. "Perceived Risk for Dengue Infection Mediates the Relationship between Attitude and Practice for Dengue Prevention: A Study in Seremban, Malaysia," IJERPH, MDPI, vol. 19(20), pages 1-17, October.
    14. Dan Liu & Songjing Guo & Mingjun Zou & Cong Chen & Fei Deng & Zhong Xie & Sheng Hu & Liang Wu, 2019. "A dengue fever predicting model based on Baidu search index data and climate data in South China," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-16, December.
    15. Luana Nice da Silva Oliveira & Alexander Itria & Erika Coutinho Lima, 2019. "Cost of illness and program of dengue: A systematic review," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-15, February.
    16. Nuri Hacıevliyagil & Krzysztof Drachal & Ibrahim Halil Eksi, 2022. "Predicting House Prices Using DMA Method: Evidence from Turkey," Economies, MDPI, vol. 10(3), pages 1-27, March.
    17. Jiucheng Xu & Keqiang Xu & Zhichao Li & Fengxia Meng & Taotian Tu & Lei Xu & Qiyong Liu, 2020. "Forecast of Dengue Cases in 20 Chinese Cities Based on the Deep Learning Method," IJERPH, MDPI, vol. 17(2), pages 1-14, January.
    18. Emmanuelle Kumaran & Dyna Doum & Vanney Keo & Ly Sokha & BunLeng Sam & Vibol Chan & Neal Alexander & John Bradley & Marco Liverani & Didot Budi Prasetyo & Agus Rachmat & Sergio Lopes & Jeffrey Hii & L, 2018. "Dengue knowledge, attitudes and practices and their impact on community-based vector control in rural Cambodia," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 12(2), pages 1-16, February.
    19. repec:prg:jnlcfu:v:2022:y:2022:i:1:id:572 is not listed on IDEAS
    20. David H Chae & Sean Clouston & Mark L Hatzenbuehler & Michael R Kramer & Hannah L F Cooper & Sacoby M Wilson & Seth I Stephens-Davidowitz & Robert S Gold & Bruce G Link, 2015. "Association between an Internet-Based Measure of Area Racism and Black Mortality," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-12, April.
    21. Chang, Andrew C. & Hanson, Tyler J., 2016. "The accuracy of forecasts prepared for the Federal Open Market Committee," Journal of Economics and Business, Elsevier, vol. 83(C), pages 23-43.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pntd00:0005973. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosntds (email available below). General contact details of provider: https://journals.plos.org/plosntds/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.