IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2405.13926.html
   My bibliography  Save this paper

Some models are useful, but for how long?: A decision theoretic approach to choosing when to refit large-scale prediction models

Author

Listed:
  • Kentaro Hoffman
  • Stephen Salerno
  • Jeff Leek
  • Tyler McCormick

Abstract

Large-scale prediction models using tools from artificial intelligence (AI) or machine learning (ML) are increasingly common across a variety of industries and scientific domains. Despite their effectiveness, training AI and ML tools at scale can cost tens or hundreds of thousands of dollars (or more); and even after a model is trained, substantial resources must be invested to keep models up-to-date. This paper presents a decision-theoretic framework for deciding when to refit an AI/ML model when the goal is to perform unbiased statistical inference using partially AI/ML-generated data. Drawing on portfolio optimization theory, we treat the decision of {\it recalibrating} a model or statistical inference versus {\it refitting} the model as a choice between ``investing'' in one of two ``assets.'' One asset, recalibrating the model based on another model, is quick and relatively inexpensive but bears uncertainty from sampling and may not be robust to model drift. The other asset, {\it refitting} the model, is costly but removes the drift concern (though not statistical uncertainty from sampling). We present a framework for balancing these two potential investments while preserving statistical validity. We evaluate the framework using simulation and data on electricity usage and predicting flu trends.

Suggested Citation

  • Kentaro Hoffman & Stephen Salerno & Jeff Leek & Tyler McCormick, 2024. "Some models are useful, but for how long?: A decision theoretic approach to choosing when to refit large-scale prediction models," Papers 2405.13926, arXiv.org, revised Jan 2025.
  • Handle: RePEc:arx:papers:2405.13926
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2405.13926
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fabio Maccheroni & Massimo Marinacci & Doriana Ruffino, 2013. "Alpha as Ambiguity: Robust Mean‐Variance Portfolio Analysis," Econometrica, Econometric Society, vol. 81(3), pages 1075-1113, May.
    2. Stéphane Bonhomme & Martin Weidner, 2022. "Minimizing sensitivity to model misspecification," Quantitative Economics, Econometric Society, vol. 13(3), pages 907-954, July.
    3. Sergey I. Nikolenko, 2021. "Deep Learning and Optimization," Springer Optimization and Its Applications, in: Synthetic Data for Deep Learning, chapter 0, pages 19-58, Springer.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jewitt, Ian & Mukerji, Sujoy, 2017. "Ordering ambiguous acts," Journal of Economic Theory, Elsevier, vol. 171(C), pages 213-267.
    2. José Valentim Machado Vicente & Jaqueline Terra Moura Marins, 2019. "A Volatility Smile-Based Uncertainty Index," Working Papers Series 502, Central Bank of Brazil, Research Department.
    3. L. A. Franzoni, 2016. "Optimal liability design under risk and ambiguity," Working Papers wp1048, Dipartimento Scienze Economiche, Universita' di Bologna.
    4. Albrecht, E & Baum, Günter & Birsa, R & Bradamante, F & Bressan, A & Chapiro, A & Cicuttin, A & Ciliberti, P & Colavita, A & Costa, S & Crespo, M & Cristaudo, P & Dalla Torre, S & Diaz, V & Duic, V &, 2010. "Results from COMPASS RICH-1," Center for Mathematical Economics Working Papers 535, Center for Mathematical Economics, Bielefeld University.
    5. Sujoy Mukerji & Han N. Ozsoylev & Jean‐Marc Tallon, 2023. "Trading Ambiguity: A Tale Of Two Heterogeneities," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 64(3), pages 1127-1164, August.
    6. Hill, Brian & Michalski, Tomasz, 2018. "Risk versus ambiguity and international security design," Journal of International Economics, Elsevier, vol. 113(C), pages 74-105.
    7. Oliver Walker & Simon Dietz, 2012. "Ambiguity and insurance: robust capital requirements and premiums," GRI Working Papers 97, Grantham Research Institute on Climate Change and the Environment.
    8. He, Ying & Dyer, James S. & Butler, John C. & Jia, Jianmin, 2019. "An additive model of decision making under risk and ambiguity," Journal of Mathematical Economics, Elsevier, vol. 85(C), pages 78-92.
    9. Fabrice Collard & Sujoy Mukerji & Kevin Sheppard & Jean‐Marc Tallon, 2018. "Ambiguity and the historical equity premium," Quantitative Economics, Econometric Society, vol. 9(2), pages 945-993, July.
    10. Loïc Berger, 2014. "The Impact of Ambiguity Prudence on Insurance and Prevention," Working Papers ECARES ECARES 2014-08, ULB -- Universite Libre de Bruxelles.
    11. Marco Nieddu & Lorenzo Pandolfi, 2018. "Cutting Through the Fog: Financial Literacy and the Subjective Value of Financial Assets," CSEF Working Papers 497, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
    12. Andrei Zeleneev & Kirill Evdokimov, 2023. "Simple estimation of semiparametric models with measurement errors," CeMMAP working papers 10/23, Institute for Fiscal Studies.
    13. Gábor-Tóth, Enikő & Georgarakos, Dimitris, 2018. "Economic policy uncertainty and stock market participation," CFS Working Paper Series 590, Center for Financial Studies (CFS).
    14. Kirill S. Evdokimov & Andrei Zeleneev, 2023. "Simple Estimation of Semiparametric Models with Measurement Errors," Papers 2306.14311, arXiv.org, revised Jan 2025.
    15. Wakai, Katsutoshi, 2014. "Observational equivalence and nonequivalence of subjective and robust mean–variance preferences," Economics Letters, Elsevier, vol. 124(2), pages 219-221.
    16. Cherbonnier, Frédéric & Gollier, Christian, 2015. "Decreasing aversion under ambiguity," Journal of Economic Theory, Elsevier, vol. 157(C), pages 606-623.
    17. Ilke Aydogan & Loic Berger & Valentina Bosetti & Ning Liu, 2018. "Three layers of uncertainty: an experiment," Working Papers 623, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
    18. Gertsman, Gleb & Frehen, Rik & Werker, Bas J.M., 2019. "Would Ambiguity Averse Investors Hedge Risk in Equity Markets?," Other publications TiSEM bd3eb3e5-517e-40d4-aab9-e, Tilburg University, School of Economics and Management.
    19. Paolo Caro, 2015. "Risk, ambiguity, and sovereign rating," International Economics and Economic Policy, Springer, vol. 12(1), pages 41-57, March.
    20. Jens Klooster & Mikhail Zhelonkin, 2024. "Outlier robust inference in the instrumental variable model with applications to causal effects," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(1), pages 86-106, January.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2405.13926. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.