IDEAS home Printed from https://ideas.repec.org/a/sae/medema/v43y2023i6p719-736.html
   My bibliography  Save this article

Calibration and Validation of the Colorectal Cancer and Adenoma Incidence and Mortality (CRC-AIM) Microsimulation Model Using Deep Neural Networks

Author

Listed:
  • Vahab Vahdat

    (Health Economics and Outcome Research, Exact Sciences Corporation, Madison, WI, USA)

  • Oguzhan Alagoz

    (Departments of Industrial & Systems Engineering and Population Health Sciences, University of Wisconsin–Madison, Madison, WI, USA)

  • Jing Voon Chen

    (Health Economics and Outcome Research, Exact Sciences Corporation, Madison, WI, USA)

  • Leila Saoud

    (Health Economics and Outcome Research, Exact Sciences Corporation, Madison, WI, USA)

  • Bijan J. Borah

    (Division of Health Care Delivery Research, Mayo Clinic, Rochester, MN, USA)

  • Paul J. Limburg

    (Health Economics and Outcome Research, Exact Sciences Corporation, Madison, WI, USA)

Abstract

Objectives Machine learning (ML)–based emulators improve the calibration of decision-analytical models, but their performance in complex microsimulation models is yet to be determined. Methods We demonstrated the use of an ML-based emulator with the Colorectal Cancer (CRC)-Adenoma Incidence and Mortality (CRC-AIM) model, which includes 23 unknown natural history input parameters to replicate the CRC epidemiology in the United States. We first generated 15,000 input combinations and ran the CRC-AIM model to evaluate CRC incidence, adenoma size distribution, and the percentage of small adenoma detected by colonoscopy. We then used this data set to train several ML algorithms, including deep neural network (DNN), random forest, and several gradient boosting variants (i.e., XGBoost, LightGBM, CatBoost) and compared their performance. We evaluated 10 million potential input combinations using the selected emulator and examined input combinations that best estimated observed calibration targets. Furthermore, we cross-validated outcomes generated by the CRC-AIM model with those made by CISNET models. The calibrated CRC-AIM model was externally validated using the United Kingdom Flexible Sigmoidoscopy Screening Trial (UKFSST). Results The DNN with proper preprocessing outperformed other tested ML algorithms and successfully predicted all 8 outcomes for different input combinations. It took 473 s for the trained DNN to predict outcomes for 10 million inputs, which would have required 190 CPU-years without our DNN. The overall calibration process took 104 CPU-days, which included building the data set, training, selecting, and hyperparameter tuning of the ML algorithms. While 7 input combinations had acceptable fit to the targets, a combination that best fits all outcomes was selected as the best vector. Almost all of the predictions made by the best vector laid within those from the CISNET models, demonstrating CRC-AIM’s cross-model validity. Similarly, CRC-AIM accurately predicted the hazard ratios of CRC incidence and mortality as reported by UKFSST, demonstrating its external validity. Examination of the impact of calibration targets suggested that the selection of the calibration target had a substantial impact on model outcomes in terms of life-year gains with screening. Conclusions Emulators such as a DNN that is meticulously selected and trained can substantially reduce the computational burden of calibrating complex microsimulation models. Highlights Calibrating a microsimulation model, a process to find unobservable parameters so that the model fits observed data, is computationally complex. We used a deep neural network model, a popular machine learning algorithm, to calibrate the Colorectal Cancer Adenoma Incidence and Mortality (CRC-AIM) model. We demonstrated that our approach provides an efficient and accurate method to significantly speed up calibration in microsimulation models. The calibration process successfully provided cross-model validation of CRC-AIM against 3 established CISNET models and also externally validated against a randomized controlled trial.

Suggested Citation

  • Vahab Vahdat & Oguzhan Alagoz & Jing Voon Chen & Leila Saoud & Bijan J. Borah & Paul J. Limburg, 2023. "Calibration and Validation of the Colorectal Cancer and Adenoma Incidence and Mortality (CRC-AIM) Microsimulation Model Using Deep Neural Networks," Medical Decision Making, , vol. 43(6), pages 719-736, August.
  • Handle: RePEc:sae:medema:v:43:y:2023:i:6:p:719-736
    DOI: 10.1177/0272989X231184175
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0272989X231184175
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0272989X231184175?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Mucahit Cevik & Mehmet Ali Ergun & Natasha K. Stout & Amy Trentham-Dietz & Mark Craven & Oguzhan Alagoz, 2016. "Using Active Learning for Speeding up Calibration in Simulation Models," Medical Decision Making, , vol. 36(5), pages 581-593, July.
    2. Theresa Reiker & Monica Golumbeanu & Andrew Shattock & Lydia Burgert & Thomas A. Smith & Sarah Filippi & Ewan Cameron & Melissa A. Penny, 2021. "Emulator-based Bayesian optimization for efficient multi-objective calibration of an individual-based model of malaria," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    3. Aditya Sai & Carolina Vivas-Valencia & Thomas F. Imperiale & Nan Kong, 2019. "Multiobjective Calibration of Disease Simulation Models Using Gaussian Processes," Medical Decision Making, , vol. 39(5), pages 540-552, July.
    4. Fernando Alarid-Escudero & Richard F. MacLehose & Yadira Peralta & Karen M. Kuntz & Eva A. Enns, 2018. "Nonidentifiability in Model Calibration and Implications for Medical Decision Making," Medical Decision Making, , vol. 38(7), pages 810-821, October.
    5. Theresa Ryckman & Stephen Luby & Douglas K. Owens & Eran Bendavid & Jeremy D. Goldhaber-Fiebert, 2020. "Methods for Model Calibration under High Uncertainty: Modeling Cholera in Bangladesh," Medical Decision Making, , vol. 40(5), pages 693-709, July.
    6. Nicolas A. Menzies & Djøra I. Soeteman & Ankur Pandya & Jane J. Kim, 2017. "Bayesian Methods for Calibrating Health Policy Models: A Tutorial," PharmacoEconomics, Springer, vol. 35(6), pages 613-624, June.
    7. Rutter, Carolyn M. & Miglioretti, Diana L. & Savarino, James E., 2009. "Bayesian Calibration of Microsimulation Models," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1338-1350.
    8. Jie-Bin Lew & Marjolein J. E. Greuter & Michael Caruana & Emily He & Joachim Worthington & D. James St John & Finlay A. Macrae & Eleonora Feletto & Veerle M. H. Coupé & Karen Canfell, 2020. "Validation of Microsimulation Models against Alternative Model Predictions and Long-Term Colorectal Cancer Incidence and Mortality Outcomes of Randomized Controlled Trials," Medical Decision Making, , vol. 40(6), pages 815-829, August.
    9. Marjolein J. E. Greuter & Xiang‐Ming Xu & Jie‐Bin Lew & Evelien Dekker & Ernst J. Kuipers & Karen Canfell & Gerrit A. Meijer & Veerle M. H. Coupé, 2014. "Modeling the Adenoma and Serrated Pathway to Colorectal CAncer (ASCCA)," Risk Analysis, John Wiley & Sons, vol. 34(5), pages 889-910, May.
    10. Carolina Barbosa & William N. Dowd & Arnie P. Aldridge & Christine Timko & Gary A. Zarkin, 2019. "Estimating Long-Term Drinking Patterns for People with Lifetime Alcohol Use Disorder," Medical Decision Making, , vol. 39(7), pages 765-780, October.
    11. Peter Shewmaker & Stavroula A. Chrysanthopoulou & Rowan Iskandar & Derek Lake & Earic Jutkowitz, 2022. "Microsimulation Model Calibration with Approximate Bayesian Computation in R: A Tutorial," Medical Decision Making, , vol. 42(5), pages 557-570, July.
    12. C Marijn Hazelbag & Jonathan Dushoff & Emanuel M Dominic & Zinhle E Mthombothi & Wim Delva, 2020. "Calibration of individual-based models to epidemiological data: A systematic review," PLOS Computational Biology, Public Library of Science, vol. 16(5), pages 1-17, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Penny R. Breeze & Hazel Squires & Kate Ennis & Petra Meier & Kate Hayes & Nik Lomax & Alan Shiell & Frank Kee & Frank de Vocht & Martin O’Flaherty & Nigel Gilbert & Robin Purshouse & Stewart Robinson , 2023. "Guidance on the use of complex systems models for economic evaluations of public health interventions," Health Economics, John Wiley & Sons, Ltd., vol. 32(7), pages 1603-1625, July.
    2. Jing Voon Chen & Julia L. Higle & Michael Hintlian, 2018. "A systematic approach for examining the impact of calibration uncertainty in disease modeling," Computational Management Science, Springer, vol. 15(3), pages 541-561, October.
    3. Douglas Taylor & Vivek Pawar & Denise Kruzikas & Kristen Gilmore & Myrlene Sanon & Milton Weinstein, 2012. "Incorporating Calibrated Model Parameters into Sensitivity Analyses," PharmacoEconomics, Springer, vol. 30(2), pages 119-126, February.
    4. Chaitanya Kaligotla & Jonathan Ozik & Nicholson Collier & Charles M. Macal & Kelly Boyd & Jennifer Makelarski & Elbert S. Huang & Stacy T. Lindau, 2020. "Model Exploration of an Information-Based Healthcare Intervention Using Parallelization and Active Learning," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 23(4), pages 1-1.
    5. Pablo M. Salazar & Alice Kamau & Aurelien Cavelan & Samuel Akech & Arthur Mpimbaza & Robert W. Snow & Melissa A. Penny, 2024. "Severe outcomes of malaria in children under time-varying exposure," Nature Communications, Nature, vol. 15(1), pages 1-9, December.
    6. Sophie Whyte & Cathal Walsh & Jim Chilcott, 2011. "Bayesian Calibration of a Natural History Model with Application to a Population Model for Colorectal Cancer," Medical Decision Making, , vol. 31(4), pages 625-641, July.
    7. Alex van der Steen & Joost van Rosmalen & Sonja Kroep & Frank van Hees & Ewout W. Steyerberg & Harry J. de Koning & Marjolein van Ballegooijen & Iris Lansdorp-Vogelaar, 2016. "Calibrating Parameters for Microsimulation Disease Models," Medical Decision Making, , vol. 36(5), pages 652-665, July.
    8. Theresa Reiker & Monica Golumbeanu & Andrew Shattock & Lydia Burgert & Thomas A. Smith & Sarah Filippi & Ewan Cameron & Melissa A. Penny, 2021. "Emulator-based Bayesian optimization for efficient multi-objective calibration of an individual-based model of malaria," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    9. Maria DeYoreo & Iris Lansdorp-Vogelaar & Amy B. Knudsen & Karen M. Kuntz & Ann G. Zauber & Carolyn M. Rutter, 2020. "Validation of Colorectal Cancer Models on Long-term Outcomes from a Randomized Controlled Trial," Medical Decision Making, , vol. 40(8), pages 1034-1040, November.
    10. Beate Jahn & Jovan Todorovic & Marvin Bundo & Gaby Sroczynski & Annette Conrads-Frank & Ursula Rochau & Gottfried Endel & Ingrid Wilbacher & Nikoletta Malbaski & Niki Popper & Jagpreet Chhatwal & Dan , 2019. "Budget Impact Analysis of Cancer Screening: A Methodological Review," Applied Health Economics and Health Policy, Springer, vol. 17(4), pages 493-511, August.
    11. David Niyukuri & Trust Chibawara & Peter Suwirakwenda Nyasulu & Wim Delva, 2021. "Inferring HIV Transmission Network Determinants Using Agent-Based Models Calibrated to Multi-Data Sources," Mathematics, MDPI, vol. 9(21), pages 1-33, October.
    12. C Marijn Hazelbag & Jonathan Dushoff & Emanuel M Dominic & Zinhle E Mthombothi & Wim Delva, 2020. "Calibration of individual-based models to epidemiological data: A systematic review," PLOS Computational Biology, Public Library of Science, vol. 16(5), pages 1-17, May.
    13. Eleanor J. Murray & James M. Robins & George R. Seage III & Sara Lodi & Emily P. Hyle & Krishna P. Reddy & Kenneth A. Freedberg & Miguel A. Hernán, 2018. "Using Observational Data to Calibrate Simulation Models," Medical Decision Making, , vol. 38(2), pages 212-224, February.
    14. Alaa Tharwat & Wolfram Schenck, 2023. "A Survey on Active Learning: State-of-the-Art, Practical Challenges and Research Directions," Mathematics, MDPI, vol. 11(4), pages 1-38, February.
    15. Arias Chao, Manuel & Kulkarni, Chetan & Goebel, Kai & Fink, Olga, 2022. "Fusing physics-based and deep learning models for prognostics," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
    16. Stavroula A Chrysanthopoulou, 2017. "MILC: A Microsimulation Model of the Natural History of Lung Cancer," International Journal of Microsimulation, International Microsimulation Association, vol. 10(3), pages 5-26.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:medema:v:43:y:2023:i:6:p:719-736. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.