IDEAS home Printed from https://ideas.repec.org/a/plo/pntd00/0008710.html
   My bibliography  Save this article

Weekly dengue forecasts in Iquitos, Peru; San Juan, Puerto Rico; and Singapore

Author

Listed:
  • Corey M Benedum
  • Kimberly M Shea
  • Helen E Jenkins
  • Louis Y Kim
  • Natasha Markuzon

Abstract

Background: Predictive models can serve as early warning systems and can be used to forecast future risk of various infectious diseases. Conventionally, regression and time series models are used to forecast dengue incidence, using dengue surveillance (e.g., case counts) and weather data. However, these models may be limited in terms of model assumptions and the number of predictors that can be included. Machine learning (ML) methods are designed to work with a large number of predictors and thus offer an appealing alternative. Here, we compared the performance of ML algorithms with that of regression models in predicting dengue cases and outbreaks from 4 to up to 12 weeks in advance. Many countries lack sufficient health surveillance infrastructure, as such we evaluated the contribution of dengue surveillance and weather data on the predictive power of these models. Methods: We developed ML, regression, and time series models to forecast weekly dengue case counts and outbreaks in Iquitos, Peru; San Juan, Puerto Rico; and Singapore from 1990–2016. Forecasts were generated using available weekly dengue surveillance, and weather data. We evaluated the agreement between model forecasts and actual dengue observations using Mean Absolute Error and Matthew’s Correlation Coefficient (MCC). Results: For near term predictions of weekly case counts and when using surveillance data, ML models had 21% and 33% less error than regression and time series models respectively. However, using weather data only, ML models did not demonstrate a practical advantage. When forecasting weekly dengue outbreaks 12 weeks in advance, ML models achieved a maximum MCC of 0.61. Conclusions: Our results identified 2 scenarios when ML models are advantageous over regression model: 1) predicting dengue weekly case counts 4 weeks ahead when dengue surveillance data are available and 2) predicting weekly dengue outbreaks 12 weeks ahead when dengue surveillance data are unavailable. Given the advantages of ML models, dengue early warning systems may be improved by the inclusion of these models. Author summary: Accurate and timely forecasts of dengue fever can help mitigate the impact of the disease. Currently, regression and time series models are frequently used to predict dengue cases and outbreaks. However, these models may be limited in terms of model assumptions and the number of predictors that can be included. Machine learning (ML) models offer an appealing alternative as they have a nonlinear framework and can be applied to high dimensional data. In this study, we compared the performance of ML algorithms with that of regression and time series models in predicting dengue cases and outbreaks from 4 to up to 12 weeks in advance in 3 dengue-endemic regions. Model predictions were based upon local dengue surveillance (e.g., case counts), population, temporal, and weather data. Many countries lack sufficient health surveillance infrastructure, as such we evaluated the contribution of dengue surveillance and weather data on the predictive power of the models. Our results identified 2 scenarios when ML models performed better than conventional models: 1) predicting dengue weekly case counts 4 weeks ahead when dengue surveillance data are available and 2) predicting weekly dengue outbreaks 12 weeks ahead when dengue surveillance data are unavailable. This research suggests that ML models can be a beneficial tool for dengue early warning systems.

Suggested Citation

  • Corey M Benedum & Kimberly M Shea & Helen E Jenkins & Louis Y Kim & Natasha Markuzon, 2020. "Weekly dengue forecasts in Iquitos, Peru; San Juan, Puerto Rico; and Singapore," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 14(10), pages 1-26, October.
  • Handle: RePEc:plo:pntd00:0008710
    DOI: 10.1371/journal.pntd.0008710
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosntds/article?id=10.1371/journal.pntd.0008710
    Download Restriction: no

    File URL: https://journals.plos.org/plosntds/article/file?id=10.1371/journal.pntd.0008710&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pntd.0008710?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pntd00:0008710. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosntds (email available below). General contact details of provider: https://journals.plos.org/plosntds/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.