IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2002.05193.html
   My bibliography  Save this paper

A Hierarchy of Limitations in Machine Learning

Author

Listed:
  • Momin M. Malik

Abstract

"All models are wrong, but some are useful", wrote George E. P. Box (1979). Machine learning has focused on the usefulness of probability models for prediction in social systems, but is only now coming to grips with the ways in which these models are wrong---and the consequences of those shortcomings. This paper attempts a comprehensive, structured overview of the specific conceptual, procedural, and statistical limitations of models in machine learning when applied to society. Machine learning modelers themselves can use the described hierarchy to identify possible failure points and think through how to address them, and consumers of machine learning models can know what to question when confronted with the decision about if, where, and how to apply machine learning. The limitations go from commitments inherent in quantification itself, through to showing how unmodeled dependencies can lead to cross-validation being overly optimistic as a way of assessing model performance.

Suggested Citation

  • Momin M. Malik, 2020. "A Hierarchy of Limitations in Machine Learning," Papers 2002.05193, arXiv.org, revised Feb 2020.
  • Handle: RePEc:arx:papers:2002.05193
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2002.05193
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Spiegler,Peter, 2015. "Behind the Model," Cambridge Books, Cambridge University Press, number 9781107677807, September.
    2. Thomas Herndon & Michael Ash & Robert Pollin, 2014. "Does high public debt consistently stifle economic growth? A critique of Reinhart and Rogoff," Cambridge Journal of Economics, Cambridge Political Economy Society, vol. 38(2), pages 257-279.
    3. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    4. Donald MacKenzie, 2006. "An Engine, Not a Camera: How Financial Models Shape Markets," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262134608, April.
    5. Jon Kleinberg & Jens Ludwig & Sendhil Mullainathan & Ziad Obermeyer, 2015. "Prediction Policy Problems," American Economic Review, American Economic Association, vol. 105(5), pages 491-495, May.
    6. Bergmeir, Christoph & Hyndman, Rob J. & Koo, Bonsoo, 2018. "A note on the validity of cross-validation for evaluating autoregressive time series prediction," Computational Statistics & Data Analysis, Elsevier, vol. 120(C), pages 70-83.
    7. Ryan J. Tibshirani & Saharon Rosset, 2019. "Excess Optimism: How Biased is the Apparent Error of an Estimator Tuned by SURE?," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 697-712, April.
    8. Julian Reiss, 2013. "The explanation paradox redux," Journal of Economic Methodology, Taylor & Francis Journals, vol. 20(3), pages 280-292, September.
    9. Kehui Chen & Jing Lei, 2018. "Network Cross-Validation for Determining the Number of Communities in Network Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(521), pages 241-251, January.
    10. Michael Lavine, 2019. "Frequentist, Bayes, or Other?," The American Statistician, Taylor & Francis Journals, vol. 73(S1), pages 312-318, March.
    11. Zeeya Merali, 2010. "Computational science: ...Error," Nature, Nature, vol. 467(7317), pages 775-777, October.
    12. Xiaoyan Qiu & Diego F. M. Oliveira & Alireza Sahami Shirazi & Alessandro Flammini & Filippo Menczer, 2019. "Retraction Note: Limited individual attention and online virality of low-quality information," Nature Human Behaviour, Nature, vol. 3(1), pages 102-102, January.
    13. D.R. Cox, 2015. "Big data and precision," Biometrika, Biometrika Trust, vol. 102(3), pages 712-716.
    14. Quinn McNemar, 1947. "Note on the sampling error of the difference between correlated proportions or percentages," Psychometrika, Springer;The Psychometric Society, vol. 12(2), pages 153-157, June.
    15. Rosenbaum, Paul R., 2007. "Interference Between Units in Randomized Experiments," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 191-200, March.
    16. Racine, Jeff, 2000. "Consistent cross-validatory model-selection for dependent data: hv-block cross-validation," Journal of Econometrics, Elsevier, vol. 99(1), pages 39-61, November.
    17. Richardson, Eugene T. & Morrow, Carl D. & Ho, Theodore & Fürst, Nicole & Cohelia, Rebekkah & Tram, Khai Hoan & Farmer, Paul E. & Wood, Robin, 2016. "Forced removals embodied as tuberculosis," Social Science & Medicine, Elsevier, vol. 161(C), pages 13-18.
    18. Laura J. van 't Veer & Hongyue Dai & Marc J. van de Vijver & Yudong D. He & Augustinus A. M. Hart & Mao Mao & Hans L. Peterse & Karin van der Kooy & Matthew J. Marton & Anke T. Witteveen & George J. S, 2002. "Gene expression profiling predicts clinical outcome of breast cancer," Nature, Nature, vol. 415(6871), pages 530-536, January.
    19. Peter M. Aronow & Cyrus Samii, 2016. "Does Regression Produce Representative Estimates of Causal Effects?," American Journal of Political Science, John Wiley & Sons, vol. 60(1), pages 250-267, January.
    20. Anna Alexandrova, 2005. "Subjective Well-Being and Kahneman’s ‘Objective Happiness’," Journal of Happiness Studies, Springer, vol. 6(3), pages 301-324, September.
    21. Julian Reiss, 2012. "The explanation paradox," Journal of Economic Methodology, Taylor & Francis Journals, vol. 19(1), pages 43-62, March.
    22. Bouk, Dan, 2015. "How Our Days Became Numbered," University of Chicago Press Economics Books, University of Chicago Press, edition 1, number 9780226259178.
    23. Strathern, Marilyn, 1997. "‘Improving ratings’: audit in the British University system," European Review, Cambridge University Press, vol. 5(3), pages 305-321, July.
    24. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    25. Zhenglong Zhou & Chaz Firestone, 2019. "Humans can decipher adversarial images," Nature Communications, Nature, vol. 10(1), pages 1-9, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Meir Russ, 2021. "The Individual and the Organizational Model of Quantum Decision-Making and Learning: An Introduction and the Application of the Quadruple Loop Learning," Merits, MDPI, vol. 1(1), pages 1-13, June.
    2. Merel Noorman & Brenda Espinosa Apráez & Saskia Lavrijssen, 2023. "AI and Energy Justice," Energies, MDPI, vol. 16(5), pages 1-16, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicolaj N. Mühlbach, 2020. "Tree-based Synthetic Control Methods: Consequences of moving the US Embassy," CREATES Research Papers 2020-04, Department of Economics and Business Economics, Aarhus University.
    2. Nicolaj S{o}ndergaard Muhlbach & Mikkel Slot Nielsen, 2019. "Tree-based Synthetic Control Methods: Consequences of moving the US Embassy," Papers 1909.03968, arXiv.org, revised Feb 2021.
    3. Lundberg, Ian & Brand, Jennie E. & Jeon, Nanum, 2022. "Researcher reasoning meets computational capacity: Machine learning for social science," SocArXiv s5zc8, Center for Open Science.
    4. Naguib, Costanza, 2019. "Estimating the Heterogeneous Impact of the Free Movement of Persons on Relative Wage Mobility," Economics Working Paper Series 1903, University of St. Gallen, School of Economics and Political Science.
    5. Gary S. Anderson & Alena Audzeyeva, 2019. "A Coherent Framework for Predicting Emerging Market Credit Spreads with Support Vector Regression," Finance and Economics Discussion Series 2019-074, Board of Governors of the Federal Reserve System (U.S.).
    6. Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2018. "Big Data And Big Cities: The Promises And Limitations Of Improved Measures Of Urban Life," Economic Inquiry, Western Economic Association International, vol. 56(1), pages 114-137, January.
    7. McFall, Liz, 2015. "Is digital disruption the end of health insurance? Some thoughts on the devising of risk," economic sociology. perspectives and conversations, Max Planck Institute for the Study of Societies, vol. 17(1), pages 32-44.
    8. Evangelos Spiliotis & Fotios Petropoulos & Vassilios Assimakopoulos, 2023. "On the Disagreement of Forecasting Model Selection Criteria," Forecasting, MDPI, vol. 5(2), pages 1-12, June.
    9. Massimiliano Caporin & Francesco Poli, 2017. "Building News Measures from Textual Data and an Application to Volatility Forecasting," Econometrics, MDPI, vol. 5(3), pages 1-46, August.
    10. Falco J. Bargagli Stoffi & Kenneth De Beckker & Joana E. Maldonado & Kristof De Witte, 2021. "Assessing Sensitivity of Machine Learning Predictions.A Novel Toolbox with an Application to Financial Literacy," Papers 2102.04382, arXiv.org.
    11. Nicolas Brisset, 2018. "Models as speech acts: the telling case of financial models," Journal of Economic Methodology, Taylor & Francis Journals, vol. 25(1), pages 21-41, January.
    12. Zhaonan Qu & Ruoxuan Xiong & Jizhou Liu & Guido Imbens, 2021. "Semiparametric Estimation of Treatment Effects in Observational Studies with Heterogeneous Partial Interference," Papers 2107.12420, arXiv.org, revised Jun 2024.
    13. Andree,Bo Pieter Johannes & Chamorro Elizondo,Andres Fernando & Kraay,Aart C. & Spencer,Phoebe Girouard & Wang,Dieter, 2020. "Predicting Food Crises," Policy Research Working Paper Series 9412, The World Bank.
    14. repec:bny:wpaper:0077 is not listed on IDEAS
    15. Huntington-Klein Nick, 2020. "Instruments with Heterogeneous Effects: Bias, Monotonicity, and Localness," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 182-208, January.
    16. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    17. Md Saiful Islam & Md Sarowar Morshed & Gary J Young & Md Noor-E-Alam, 2019. "Robust policy evaluation from large-scale observational studies," PLOS ONE, Public Library of Science, vol. 14(10), pages 1-19, October.
    18. Adriano Koshiyama & Nick Firoozye & Philip Treleaven, 2021. "Generative adversarial networks for financial trading strategies fine-tuning and combination," Quantitative Finance, Taylor & Francis Journals, vol. 21(5), pages 797-813, May.
    19. Filip Stanek, 2021. "Optimal Out-of-Sample Forecast Evaluation under Stationarity," CERGE-EI Working Papers wp712, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    20. Michael Joffe, 2017. "Causal theories, models and evidence in economics—some reflections from the natural sciences," Cogent Economics & Finance, Taylor & Francis Journals, vol. 5(1), pages 1280983-128, January.
    21. O. Didkovskyi & N. Jean & G. Le Pera & C. Nordio, 2024. "Cross-Domain Behavioral Credit Modeling: transferability from private to central data," Papers 2401.09778, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2002.05193. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.