IDEAS home Printed from https://ideas.repec.org/a/eee/chsofr/v156y2022ics0960077921011334.html
   My bibliography  Save this article

A random forest model for forecasting regional COVID-19 cases utilizing reproduction number estimates and demographic data

Author

Listed:
  • Galasso, Joseph
  • Cao, Duy M.
  • Hochberg, Robert

Abstract

During the COVID-19 pandemic, predicting case spikes at the local level is important for a precise, targeted public health response and is generally done with compartmental models. The performance of compartmental models is highly dependent on the accuracy of their assumptions about disease dynamics within a population; thus, such models are susceptible to human error, unexpected events, or unknown characteristics of a novel infectious agent like COVID-19. We present a relatively non-parametric random forest model that forecasts the number of COVID-19 cases at the U.S. county level. Its most prioritized training features are derived from easily accessible, standard epidemiological data (i.e., regional test positivity rate) and the effective reproduction number (Rt) from compartmental models. A novel input training feature is case projections generated by aligning estimated effective reproduction number (pre-computed by COVIDActNow.org) with real time testing data until maximally correlated, helping our model fit better to the epidemic’s trajectory as ascertained by traditional models. Poor reliability of Rt is partially mitigated with dynamic population mobility and prevalence and mortality of non-COVID-19 diseases to gauge population disease susceptibility. The model was used to generate forecasts for 1, 2, 3, and 4 weeks into the future for each reference week within 11/01/2020 - 01/10/2021 for 3068 counties. Over this time period, it maintained a mean absolute error (MAE) of less than 300 weekly cases/100,000 and consistently outperformed or performed comparably with gold-standard compartmental models. Furthermore, it holds great potential in ensemble modeling due to its potential for a more expansive training feature set while maintaining good performance and limited resource utilization.

Suggested Citation

  • Galasso, Joseph & Cao, Duy M. & Hochberg, Robert, 2022. "A random forest model for forecasting regional COVID-19 cases utilizing reproduction number estimates and demographic data," Chaos, Solitons & Fractals, Elsevier, vol. 156(C).
  • Handle: RePEc:eee:chsofr:v:156:y:2022:i:c:s0960077921011334
    DOI: 10.1016/j.chaos.2021.111779
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0960077921011334
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.chaos.2021.111779?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zeroual, Abdelhafid & Harrou, Fouzi & Dairi, Abdelkader & Sun, Ying, 2020. "Deep learning methods for forecasting COVID-19 time-Series data: A Comparative study," Chaos, Solitons & Fractals, Elsevier, vol. 140(C).
    2. Shelby R. Buckman & Reuven Glick & Kevin J. Lansing & Nicolas Petrosky-Nadeau & Lily Seitelman, 2020. "Replicating and Projecting the Path of COVID-19 with a Model-Implied Reproduction Number," Working Paper Series 2020-24, Federal Reserve Bank of San Francisco.
    3. Ribeiro, Matheus Henrique Dal Molin & da Silva, Ramon Gomes & Mariani, Viviana Cocco & Coelho, Leandro dos Santos, 2020. "Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives for Brazil," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Masum, Mohammad & Masud, M.A. & Adnan, Muhaiminul Islam & Shahriar, Hossain & Kim, Sangil, 2022. "Comparative study of a mathematical epidemic model, statistical modeling, and deep learning for COVID-19 forecasting and management," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    2. Iloanusi, Ogechukwu & Ross, Arun, 2021. "Leveraging weather data for forecasting cases-to-mortality rates due to COVID-19," Chaos, Solitons & Fractals, Elsevier, vol. 152(C).
    3. Medeiros, Marcelo C. & Street, Alexandre & Valladão, Davi & Vasconcelos, Gabriel & Zilberman, Eduardo, 2022. "Short-term Covid-19 forecast for latecomers," International Journal of Forecasting, Elsevier, vol. 38(2), pages 467-488.
    4. Ahed Abugabah & Farah Shahid, 2023. "Intelligent Health Care and Diseases Management System: Multi-Day-Ahead Predictions of COVID-19," Mathematics, MDPI, vol. 11(4), pages 1-19, February.
    5. Tayarani N., Mohammad-H., 2021. "Applications of artificial intelligence in battling against covid-19: A literature review," Chaos, Solitons & Fractals, Elsevier, vol. 142(C).
    6. Xiaojin Xie & Kangyang Luo & Zhixiang Yin & Guoqiang Wang, 2021. "Nonlinear Combinational Dynamic Transmission Rate Model and Its Application in Global COVID-19 Epidemic Prediction and Analysis," Mathematics, MDPI, vol. 9(18), pages 1-17, September.
    7. Faizeh Hatami & Shi Chen & Rajib Paul & Jean-Claude Thill, 2022. "Simulating and Forecasting the COVID-19 Spread in a U.S. Metropolitan Region with a Spatial SEIR Model," IJERPH, MDPI, vol. 19(23), pages 1-16, November.
    8. Faik Bilgili & Emrah Koçak & Sevda Kuşkaya, 2023. "Dynamics and Co-movements Between the COVID-19 Outbreak and the Stock Market in Latin American Countries: An Evaluation Based on the Wavelet-Partial Wavelet Coherence Model," Evaluation Review, , vol. 47(4), pages 630-652, August.
    9. Khan, Junaid Iqbal & Ullah, Farman & Lee, Sungchang, 2022. "Attention based parameter estimation and states forecasting of COVID-19 pandemic using modified SIQRD Model," Chaos, Solitons & Fractals, Elsevier, vol. 165(P2).
    10. Dalton Garcia Borges de Souza & Erivelton Antonio dos Santos & Francisco Tarcísio Alves Júnior & Mariá Cristina Vasconcelos Nascimento, 2021. "On Comparing Cross-Validated Forecasting Models with a Novel Fuzzy-TOPSIS Metric: A COVID-19 Case Study," Sustainability, MDPI, vol. 13(24), pages 1-25, December.
    11. Zhao, Xinxing & Li, Kainan & Ang, Candice Ke En & Ho, Andrew Fu Wah & Liu, Nan & Ong, Marcus Eng Hock & Cheong, Kang Hao, 2022. "A deep learning architecture for forecasting daily emergency department visits with acuity levels," Chaos, Solitons & Fractals, Elsevier, vol. 165(P1).
    12. Rafael Pérez Abreu C. & Samantha Estrada & Héctor de-la-Torre-Gutiérrez, 2021. "A Two-Step Polynomial and Nonlinear Growth Approach for Modeling COVID-19 Cases in Mexico," Mathematics, MDPI, vol. 9(18), pages 1-18, September.
    13. Perone, G., 2020. "Comparison of ARIMA, ETS, NNAR and hybrid models to forecast the second wave of COVID-19 hospitalizations in Italy," Health, Econometrics and Data Group (HEDG) Working Papers 20/18, HEDG, c/o Department of Economics, University of York.
    14. da Silva, Ramon Gomes & Ribeiro, Matheus Henrique Dal Molin & Mariani, Viviana Cocco & Coelho, Leandro dos Santos, 2020. "Forecasting Brazilian and American COVID-19 cases based on artificial intelligence coupled with climatic exogenous variables," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    15. Yongchao Jin & Renfang Wang & Xiaodie Zhuang & Kenan Wang & Honglian Wang & Chenxi Wang & Xiyin Wang, 2022. "Prediction of COVID-19 Data Using an ARIMA-LSTM Hybrid Forecast Model," Mathematics, MDPI, vol. 10(21), pages 1-13, October.
    16. Mandal, Manotosh & Jana, Soovoojeet & Nandi, Swapan Kumar & Khatua, Anupam & Adak, Sayani & Kar, T.K., 2020. "A model based study on the dynamics of COVID-19: Prediction and control," Chaos, Solitons & Fractals, Elsevier, vol. 136(C).
    17. Christian Acal & Manuel Escabias & Ana M. Aguilera & Mariano J. Valderrama, 2021. "COVID-19 Data Imputation by Multiple Function-on-Function Principal Component Regression," Mathematics, MDPI, vol. 9(11), pages 1-23, May.
    18. Gilgur, Alexander & Ramirez-Marquez, Jose Emmanuel, 2022. "Modeling mobility, risk, and pandemic severity during the first year of COVID," Socio-Economic Planning Sciences, Elsevier, vol. 84(C).
    19. Barraza, Néstor Ruben & Pena, Gabriel & Moreno, Verónica, 2020. "A non-homogeneous Markov early epidemic growth dynamics model. Application to the SARS-CoV-2 pandemic," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    20. Jiří Mazurek, 2021. "The evaluation of COVID-19 prediction precision with a Lyapunov-like exponent," PLOS ONE, Public Library of Science, vol. 16(5), pages 1-9, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:chsofr:v:156:y:2022:i:c:s0960077921011334. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thayer, Thomas R. (email available below). General contact details of provider: https://www.journals.elsevier.com/chaos-solitons-and-fractals .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.