IDEAS home Printed from https://ideas.repec.org/a/bla/jecsur/v36y2022i5p1251-1287.html
   My bibliography  Save this article

Statistical modeling and inference in the era of Data Science and Graphical Causal modeling

Author

Listed:
  • Aris Spanos

Abstract

The paper discusses four paradigm shifts in statistics since the 1920s with a view to compare their similarities and differences, and evaluate their effectiveness in giving rise to ‘learning from data' about phenomena of interest. The first is Fisher's 1922 recasting of Karl Pearson's descriptive statistics into a model‐based [Mθ(x)$\mathcal {M}_{{\bm {\theta }}}(\mathbf {x})$] statistical induction that dominates current statistics (frequentist and Bayesian). A crucial departure was Fisher's replacing the curve‐fitting perspective guided by goodness‐of‐fit measures with a model‐based perspective guided by the statistical adequacy: the validity of the probabilistic assumptions comprising Mθ(x)$\mathcal {M}_{{\bm {\theta }}}(\mathbf { x})$. Statistical adequacy is pivotal in securing trustworthy evidence since it underwrites the reliability of inference. The second is the nonparametric turn in the 1970s aiming to broaden Mθ(x)$\mathcal {M}_{{\bm {\theta }}}(\mathbf {x })$ by replacing its distribution assumption with weaker mathematical conditions relating to the unknown density function underlying Mθ(x)$\mathcal {M}_{ {\bm {\theta }}}(\mathbf {x})$. The third is a two‐pronged development initiated in Artificial Intelligence (AI) in the 1990s that gave rise to Data Science (DS) and Graphical Causal (GC) modeling. The primary objective of the paper is to compare and evaluate the other competing approaches with a refined/enhanced version of Fisher's model‐based approach in terms of their effectiveness in giving rise to genuine “learning from data;” excellent goodness‐of‐fit/prediction is neither necessary nor sufficient for statistical adequacy, or so it is argued.

Suggested Citation

  • Aris Spanos, 2022. "Statistical modeling and inference in the era of Data Science and Graphical Causal modeling," Journal of Economic Surveys, Wiley Blackwell, vol. 36(5), pages 1251-1287, December.
  • Handle: RePEc:bla:jecsur:v:36:y:2022:i:5:p:1251-1287
    DOI: 10.1111/joes.12483
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/joes.12483
    Download Restriction: no

    File URL: https://libkey.io/10.1111/joes.12483?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. P. Dorian Owen, 2017. "Evaluating Ingenious Instruments for Fundamental Determinants of Long-Run Economic Growth and Development," Econometrics, MDPI, vol. 5(3), pages 1-33, September.
    2. Spanos, Aris, 2010. "Statistical adequacy and the trustworthiness of empirical evidence: Statistical vs. substantive information," Economic Modelling, Elsevier, vol. 27(6), pages 1436-1452, November.
    3. Spanos,Aris, 1986. "Statistical Foundations of Econometric Modelling," Cambridge Books, Cambridge University Press, number 9780521269124, October.
    4. Aris Spanos & Anya McGuirk, 2001. "The Model Specification Problem from a Probabilistic Reduction Perspective," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 83(5), pages 1168-1176.
    5. Phillips, P.C.B., 1983. "Exact small sample theory in the simultaneous equations model," Handbook of Econometrics, in: Z. Griliches† & M. D. Intriligator (ed.), Handbook of Econometrics, edition 1, volume 1, chapter 8, pages 449-516, Elsevier.
    6. Spanos, Aris, 1990. "The simultaneous-equations model revisited : Statistical adequacy and identification," Journal of Econometrics, Elsevier, vol. 44(1-2), pages 87-105.
    7. Hendry, David F., 1976. "The structure of simultaneous equations estimators," Journal of Econometrics, Elsevier, vol. 4(1), pages 51-88, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aris Spanos, 2016. "Transforming structural econometrics: substantive vs. statistical premises of inference," Review of Political Economy, Taylor & Francis Journals, vol. 28(3), pages 426-437, July.
    2. Niraj Poudyal & Aris Spanos, 2022. "Model Validation and DSGE Modeling," Econometrics, MDPI, vol. 10(2), pages 1-25, April.
    3. Spanos, Aris, 2010. "Akaike-type criteria and the reliability of inference: Model selection versus statistical model specification," Journal of Econometrics, Elsevier, vol. 158(2), pages 204-220, October.
    4. Aris Spanos, 2009. "Statistical Misspecification and the Reliability of Inference: The Simple T-Test in the Presence of Markov Dependence," Korean Economic Review, Korean Economic Association, vol. 25, pages 165-213.
    5. Aris Spanos, 2018. "Mis†Specification Testing In Retrospect," Journal of Economic Surveys, Wiley Blackwell, vol. 32(2), pages 541-577, April.
    6. Aris Spanos, 2022. "Frequentist Model-based Statistical Induction and the Replication Crisis," Journal of Quantitative Economics, Springer;The Indian Econometric Society (TIES), vol. 20(1), pages 133-159, September.
    7. David F. Hendry, 2011. "Empirical Economic Model Discovery and Theory Evaluation," Rationality, Markets and Morals, Frankfurt School Verlag, Frankfurt School of Finance & Management, vol. 2(46), October.
    8. Spanos, Aris, 2010. "Statistical adequacy and the trustworthiness of empirical evidence: Statistical vs. substantive information," Economic Modelling, Elsevier, vol. 27(6), pages 1436-1452, November.
    9. Owen, P. Dorian, 2018. "Replication to assess statistical adequacy," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 12, pages 1-16.
    10. Charles G. Renfro, 2009. "The Practice of Econometric Theory," Advanced Studies in Theoretical and Applied Econometrics, Springer, number 978-3-540-75571-5, July-Dece.
    11. Aris Spanos, 2021. "Yule–Simpson’s paradox: the probabilistic versus the empirical conundrum," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(2), pages 605-635, June.
    12. Aris Spanos & Niki Papadopoulou, 2013. "A Small Macroeconometric Model for the Cyprus Economy," Working Papers 2013-2, Central Bank of Cyprus.
    13. Spanos, Aris, 2009. "The Pre-Eminence of Theory versus the European CVAR Perspective in Macroeconometric Modeling," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 3, pages 1-14.
    14. Spanos, Aris, 1989. "On Rereading Haavelmo: A Retrospective View of Econometric Modeling," Econometric Theory, Cambridge University Press, vol. 5(3), pages 405-429, December.
    15. Ericsson, Neil R. & Maasoumi, Esfandiar & Mizon, Grayham E., 2001. "A retrospective on J.D. Sargan and his contribution to Econometrics," Discussion Paper Series In Economics And Econometrics 0108, Economics Division, School of Social Sciences, University of Southampton.
    16. Jaime Ortiz, 1999. "The role of interest groups in agricultural policy design: Chile 1960-1988," Journal of International Development, John Wiley & Sons, Ltd., vol. 11(2), pages 241-258.
    17. Francisco Estrada & Víctor Guerrero & Carlos Gay-García & Benjamín Martínez-López, 2013. "A cautionary note on automated statistical downscaling methods for climate change," Climatic Change, Springer, vol. 120(1), pages 263-276, September.
    18. Neil R. Ericsson & Esfandiar Maasoumi & Grayham E. Mizon, 2001. "A retrospective on J. Denis Sargan and his contributions to econometrics," International Finance Discussion Papers 700, Board of Governors of the Federal Reserve System (U.S.).
    19. Anya McGuirk & Aris Spanos, 2009. "Revisiting Error‐Autocorrelation Correction: Common Factor Restrictions and Granger Non‐Causality," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 71(2), pages 273-294, April.
    20. Aris Spanos, 2006. "Revisiting the omitted variables argument: Substantive vs. statistical adequacy," Journal of Economic Methodology, Taylor & Francis Journals, vol. 13(2), pages 179-218.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jecsur:v:36:y:2022:i:5:p:1251-1287. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0950-0804 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.