IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2405.13926.html
   My bibliography  Save this paper

Some models are useful, but for how long?: A decision theoretic approach to choosing when to refit large-scale prediction models

Author

Listed:
  • Kentaro Hoffman
  • Stephen Salerno
  • Jeff Leek
  • Tyler McCormick

Abstract

Large-scale prediction models (typically using tools from artificial intelligence, AI, or machine learning, ML) are increasingly ubiquitous across a variety of industries and scientific domains. Such methods are often paired with detailed data from sources such as electronic health records, wearable sensors, and omics data (high-throughput technology used to understand biology). Despite their utility, implementing AI and ML tools at the scale necessary to work with this data introduces two major challenges. First, it can cost tens of thousands of dollars to train a modern AI/ML model at scale. Second, once the model is trained, its predictions may become less relevant as patient and provider behavior change, and predictions made for one geographical area may be less accurate for another. These two challenges raise a fundamental question: how often should you refit the AI/ML model to optimally trade-off between cost and relevance? Our work provides a framework for making decisions about when to {\it refit} AI/ML models when the goal is to maintain valid statistical inference (e.g. estimating a treatment effect in a clinical trial). Drawing on portfolio optimization theory, we treat the decision of {\it recalibrating} versus {\it refitting} the model as a choice between ''investing'' in one of two ''assets.'' One asset, recalibrating the model based on another model, is quick and relatively inexpensive but bears uncertainty from sampling and the possibility that the other model is not relevant to current circumstances. The other asset, {\it refitting} the model, is costly but removes the irrelevance concern (though not the risk of sampling error). We explore the balancing act between these two potential investments in this paper.

Suggested Citation

  • Kentaro Hoffman & Stephen Salerno & Jeff Leek & Tyler McCormick, 2024. "Some models are useful, but for how long?: A decision theoretic approach to choosing when to refit large-scale prediction models," Papers 2405.13926, arXiv.org.
  • Handle: RePEc:arx:papers:2405.13926
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2405.13926
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fabio Maccheroni & Massimo Marinacci & Doriana Ruffino, 2013. "Alpha as Ambiguity: Robust Mean‐Variance Portfolio Analysis," Econometrica, Econometric Society, vol. 81(3), pages 1075-1113, May.
    2. Stéphane Bonhomme & Martin Weidner, 2022. "Minimizing sensitivity to model misspecification," Quantitative Economics, Econometric Society, vol. 13(3), pages 907-954, July.
    3. Sergey I. Nikolenko, 2021. "Deep Learning and Optimization," Springer Optimization and Its Applications, in: Synthetic Data for Deep Learning, chapter 0, pages 19-58, Springer.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jewitt, Ian & Mukerji, Sujoy, 2017. "Ordering ambiguous acts," Journal of Economic Theory, Elsevier, vol. 171(C), pages 213-267.
    2. José Valentim Machado Vicente & Jaqueline Terra Moura Marins, 2019. "A Volatility Smile-Based Uncertainty Index," Working Papers Series 502, Central Bank of Brazil, Research Department.
    3. L. A. Franzoni, 2016. "Optimal liability design under risk and ambiguity," Working Papers wp1048, Dipartimento Scienze Economiche, Universita' di Bologna.
    4. Albrecht, E & Baum, Günter & Birsa, R & Bradamante, F & Bressan, A & Chapiro, A & Cicuttin, A & Ciliberti, P & Colavita, A & Costa, S & Crespo, M & Cristaudo, P & Dalla Torre, S & Diaz, V & Duic, V &, 2010. "Results from COMPASS RICH-1," Center for Mathematical Economics Working Papers 535, Center for Mathematical Economics, Bielefeld University.
    5. Sujoy Mukerji & Han N. Ozsoylev & Jean‐Marc Tallon, 2023. "Trading Ambiguity: A Tale Of Two Heterogeneities," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 64(3), pages 1127-1164, August.
    6. Hill, Brian & Michalski, Tomasz, 2018. "Risk versus ambiguity and international security design," Journal of International Economics, Elsevier, vol. 113(C), pages 74-105.
    7. Oliver Walker & Simon Dietz, 2012. "Ambiguity and insurance: robust capital requirements and premiums," GRI Working Papers 97, Grantham Research Institute on Climate Change and the Environment.
    8. He, Ying & Dyer, James S. & Butler, John C. & Jia, Jianmin, 2019. "An additive model of decision making under risk and ambiguity," Journal of Mathematical Economics, Elsevier, vol. 85(C), pages 78-92.
    9. Fabrice Collard & Sujoy Mukerji & Kevin Sheppard & Jean‐Marc Tallon, 2018. "Ambiguity and the historical equity premium," Quantitative Economics, Econometric Society, vol. 9(2), pages 945-993, July.
    10. Loïc Berger, 2014. "The Impact of Ambiguity Prudence on Insurance and Prevention," Working Papers ECARES ECARES 2014-08, ULB -- Universite Libre de Bruxelles.
    11. Marco Nieddu & Lorenzo Pandolfi, 2018. "Cutting Through the Fog: Financial Literacy and the Subjective Value of Financial Assets," CSEF Working Papers 497, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
    12. Andrei Zeleneev & Kirill Evdokimov, 2023. "Simple estimation of semiparametric models with measurement errors," CeMMAP working papers 10/23, Institute for Fiscal Studies.
    13. Gábor-Tóth, Enikő & Georgarakos, Dimitris, 2018. "Economic policy uncertainty and stock market participation," CFS Working Paper Series 590, Center for Financial Studies (CFS).
    14. Kirill S. Evdokimov & Andrei Zeleneev, 2023. "Simple Estimation of Semiparametric Models with Measurement Errors," Papers 2306.14311, arXiv.org, revised Mar 2024.
    15. Wakai, Katsutoshi, 2014. "Observational equivalence and nonequivalence of subjective and robust mean–variance preferences," Economics Letters, Elsevier, vol. 124(2), pages 219-221.
    16. Cherbonnier, Frédéric & Gollier, Christian, 2015. "Decreasing aversion under ambiguity," Journal of Economic Theory, Elsevier, vol. 157(C), pages 606-623.
    17. Ilke Aydogan & Loic Berger & Valentina Bosetti & Ning Liu, 2018. "Three layers of uncertainty: an experiment," Working Papers 623, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
    18. Gertsman, Gleb & Frehen, Rik & Werker, Bas J.M., 2019. "Would Ambiguity Averse Investors Hedge Risk in Equity Markets?," Other publications TiSEM bd3eb3e5-517e-40d4-aab9-e, Tilburg University, School of Economics and Management.
    19. Paolo Caro, 2015. "Risk, ambiguity, and sovereign rating," International Economics and Economic Policy, Springer, vol. 12(1), pages 41-57, March.
    20. Jens Klooster & Mikhail Zhelonkin, 2024. "Outlier robust inference in the instrumental variable model with applications to causal effects," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(1), pages 86-106, January.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2405.13926. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.