IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1008837.html
   My bibliography  Save this article

Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model

Author

Listed:
  • Gregory L Watson
  • Di Xiong
  • Lu Zhang
  • Joseph A Zoller
  • John Shamshoian
  • Phillip Sundin
  • Teresa Bufford
  • Anne W Rimoin
  • Marc A Suchard
  • Christina M Ramirez

Abstract

Predictions of COVID-19 case growth and mortality are critical to the decisions of political leaders, businesses, and individuals grappling with the pandemic. This predictive task is challenging due to the novelty of the virus, limited data, and dynamic political and societal responses. We embed a Bayesian time series model and a random forest algorithm within an epidemiological compartmental model for empirically grounded COVID-19 predictions. The Bayesian case model fits a location-specific curve to the velocity (first derivative) of the log transformed cumulative case count, borrowing strength across geographic locations and incorporating prior information to obtain a posterior distribution for case trajectories. The compartmental model uses this distribution and predicts deaths using a random forest algorithm trained on COVID-19 data and population-level characteristics, yielding daily projections and interval estimates for cases and deaths in U.S. states. We evaluated the model by training it on progressively longer periods of the pandemic and computing its predictive accuracy over 21-day forecasts. The substantial variation in predicted trajectories and associated uncertainty between states is illustrated by comparing three unique locations: New York, Colorado, and West Virginia. The sophistication and accuracy of this COVID-19 model offer reliable predictions and uncertainty estimates for the current trajectory of the pandemic in the U.S. and provide a platform for future predictions as shifting political and societal responses alter its course.Author summary: COVID-19 models can be roughly classified as mathematical models that simulate disease within a population, including epidemiological compartmental models, or statistical curve-fitting models that fit a function to observed data and extrapolate forward into the future. Bridging this divide, we combine the strengths of curve-fitting statistical models and the structure of epidemiological models, by embedding a Bayesian velocity model and a machine learning algorithm (random forest) into the framework of a compartmental model. Fusing these models together exploits the particular strengths of each to glean as much information as possible from the currently available data. We identify the velocity of log cumulative cases as an excellent target for modeling and extrapolating COVID-19 case trajectories. We empirically evaluate the predictive performance of the model and provide predicted trajectories with credible intervals for cumulative confirmed case count, active confirmed infections and COVID-19 deaths for each of the 50 U.S. states. Combining sophisticated data analytic methods with proven epidemiological models offers an empirically grounded strategy for making realistic predictions and quantifying their uncertainty. These predictions indicate substantial variation in the COVID-19 trajectories of U.S. states.

Suggested Citation

  • Gregory L Watson & Di Xiong & Lu Zhang & Joseph A Zoller & John Shamshoian & Phillip Sundin & Teresa Bufford & Anne W Rimoin & Marc A Suchard & Christina M Ramirez, 2021. "Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model," PLOS Computational Biology, Public Library of Science, vol. 17(3), pages 1-20, March.
  • Handle: RePEc:plo:pcbi00:1008837
    DOI: 10.1371/journal.pcbi.1008837
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008837
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008837&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1008837?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. David Berger & Kyle Herkenhoff & Chengdai Huang & Simon Mongey, 2022. "Testing and Reopening in an SEIR Model," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 43, pages 1-21, January.
    2. Zhang, Xiaolei & Ma, Renjun & Wang, Lin, 2020. "Predicting turning point, duration and attack rate of COVID-19 outbreaks in major Western countries," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    3. Roman Wölfel & Victor M. Corman & Wolfgang Guggemos & Michael Seilmaier & Sabine Zange & Marcel A. Müller & Daniela Niemeyer & Terry C. Jones & Patrick Vollmar & Camilla Rothe & Michael Hoelscher & To, 2020. "Virological assessment of hospitalized patients with COVID-2019," Nature, Nature, vol. 581(7809), pages 465-469, May.
    4. Ndaïrou, Faïçal & Area, Iván & Nieto, Juan J. & Torres, Delfim F.M., 2020. "Mathematical modeling of COVID-19 transmission dynamics with a case study of Wuhan," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    5. Soetaert, Karline & Petzoldt, Thomas & Setzer, R. Woodrow, 2010. "Solving Differential Equations in R: Package deSolve," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i09).
    6. Cleo Anastassopoulou & Lucia Russo & Athanasios Tsakris & Constantinos Siettos, 2020. "Data-based analysis, modelling and forecasting of the COVID-19 outbreak," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-21, March.
    7. Fotios Petropoulos & Spyros Makridakis, 2020. "Forecasting the novel coronavirus COVID-19," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-8, March.
    8. Sheryl L. Chang & Nathan Harding & Cameron Zachreson & Oliver M. Cliff & Mikhail Prokopenko, 2020. "Modelling transmission and control of the COVID-19 pandemic in Australia," Nature Communications, Nature, vol. 11(1), pages 1-13, December.
    9. Hyndman, Rob J. & Koehler, Anne B., 2006. "Another look at measures of forecast accuracy," International Journal of Forecasting, Elsevier, vol. 22(4), pages 679-688.
    10. Sepehr Rafieenasab & Amir-Pouyan Zahiri & Ehsan Roohi, 2020. "Prediction of peak and termination of novel coronavirus COVID-19 epidemic in Iran," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 31(11), pages 1-17, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yang Ye & Abhishek Pandey & Carolyn Bawden & Dewan Md. Sumsuzzman & Rimpi Rajput & Affan Shoukat & Burton H. Singer & Seyed M. Moghadas & Alison P. Galvani, 2025. "Integrating artificial intelligence with mechanistic epidemiological modeling: a scoping review of opportunities and challenges," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    2. Roberto Vega & Leonardo Flores & Russell Greiner, 2022. "SIMLR: Machine Learning inside the SIR Model for COVID-19 Forecasting," Forecasting, MDPI, vol. 4(1), pages 1-23, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nathan H. Schumaker & Sydney M. Watkins, 2021. "Adding Space to Disease Models: A Case Study with COVID-19 in Oregon, USA," Land, MDPI, vol. 10(4), pages 1-13, April.
    2. Waychal, Nachiketas & Laha, Arnab Kumar & Sinha, Ankur, 2022. "Customized forecasting with Adaptive Ensemble Generator," IIMA Working Papers WP 2022-06-04, Indian Institute of Management Ahmedabad, Research and Publication Department.
    3. Masum, Mohammad & Masud, M.A. & Adnan, Muhaiminul Islam & Shahriar, Hossain & Kim, Sangil, 2022. "Comparative study of a mathematical epidemic model, statistical modeling, and deep learning for COVID-19 forecasting and management," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    4. Cooper, Ian & Mondal, Argha & Antonopoulos, Chris G., 2020. "Dynamic tracking with model-based forecasting for the spread of the COVID-19 pandemic," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    5. Cooper, Ian & Mondal, Argha & Antonopoulos, Chris G., 2020. "A SIR model assumption for the spread of COVID-19 in different communities," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    6. Memon, Zaibunnisa & Qureshi, Sania & Memon, Bisharat Rasool, 2021. "Assessing the role of quarantine and isolation as control strategies for COVID-19 outbreak: A case study," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).
    7. Amaral, Marco A. & Oliveira, Marcelo M. de & Javarone, Marco A., 2021. "An epidemiological model with voluntary quarantine strategies governed by evolutionary game dynamics," Chaos, Solitons & Fractals, Elsevier, vol. 143(C).
    8. Swapnarekha, H. & Behera, Himansu Sekhar & Nayak, Janmenjoy & Naik, Bighnaraj, 2020. "Role of intelligent computing in COVID-19 prognosis: A state-of-the-art review," Chaos, Solitons & Fractals, Elsevier, vol. 138(C).
    9. Huang, Chiou-Jye & Shen, Yamin & Kuo, Ping-Huan & Chen, Yung-Hsiang, 2022. "Novel spatiotemporal feature extraction parallel deep neural network for forecasting confirmed cases of coronavirus disease 2019," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    10. Nagel, Kai & Rakow, Christian & Müller, Sebastian A., 2021. "Realistic agent-based simulation of infection dynamics and percolation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 584(C).
    11. Pongou, Roland & Tchuente, Guy & Tondji, Jean-Baptiste, 2021. "Optimally Targeting Interventions in Networks during a Pandemic: Theory and Evidence from the Networks of Nursing Homes in the United States," GLO Discussion Paper Series 957, Global Labor Organization (GLO).
    12. Daniel K Sewell & Aaron Miller & for the CDC MInD-Healthcare Program, 2020. "Simulation-free estimation of an individual-based SEIR model for evaluating nonpharmaceutical interventions with an application to COVID-19 in the District of Columbia," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-18, November.
    13. Rabih Ghostine & Mohamad Gharamti & Sally Hassrouny & Ibrahim Hoteit, 2021. "Mathematical Modeling of Immune Responses against SARS-CoV-2 Using an Ensemble Kalman Filter," Mathematics, MDPI, vol. 9(19), pages 1-13, September.
    14. Koutsellis, Themistoklis & Nikas, Alexandros, 2020. "A predictive model and country risk assessment for COVID-19: An application of the Limited Failure Population concept," Chaos, Solitons & Fractals, Elsevier, vol. 140(C).
    15. Noureddine Ouerfelli & Narcisa Vrinceanu & Diana Coman & Adriana Lavinia Cioca, 2022. "Empirical Modeling of COVID-19 Evolution with High/Direct Impact on Public Health and Risk Assessment," IJERPH, MDPI, vol. 19(6), pages 1-13, March.
    16. Berta, P. & Lovaglio, P.G. & Paruolo, P. & Verzillo, S., 2020. "Real Time Forecasting of Covid-19 Intensive Care Units demand," Health, Econometrics and Data Group (HEDG) Working Papers 20/16, HEDG, c/o Department of Economics, University of York.
    17. Semenoglou, Artemios-Anargyros & Spiliotis, Evangelos & Makridakis, Spyros & Assimakopoulos, Vassilios, 2021. "Investigating the accuracy of cross-learning time series forecasting methods," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1072-1084.
    18. Dalton Garcia Borges de Souza & Erivelton Antonio dos Santos & Francisco Tarcísio Alves Júnior & Mariá Cristina Vasconcelos Nascimento, 2021. "On Comparing Cross-Validated Forecasting Models with a Novel Fuzzy-TOPSIS Metric: A COVID-19 Case Study," Sustainability, MDPI, vol. 13(24), pages 1-25, December.
    19. Konstantinos Demertzis & Dimitrios Tsiotas & Lykourgos Magafas, 2020. "Modeling and Forecasting the COVID-19 Temporal Spread in Greece: An Exploratory Approach Based on Complex Network Defined Splines," IJERPH, MDPI, vol. 17(13), pages 1-17, June.
    20. Sinitsyn, E. V. & Tolmachev, A. V. & Ovchinnikov, A. S., 2020. "Socio-economic factors in the spread of SARS-COV-2 across Russian regions," R-Economy, Ural Federal University, Graduate School of Economics and Management, vol. 6(3), pages 129-145.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008837. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.