IDEAS home Printed from https://ideas.repec.org/p/osf/osfxxx/tmgqd.html
   My bibliography  Save this paper

Error statistics, Bayes-factor Tests and the Fallacy of Non-exhaustive Alternatives

Author

Listed:
  • Mayo, Deborah

Abstract

In this paper I discuss a fundamental contrast between two types of statistical tests now in use: those where the post-data inferential assessment is sensitive to the method’s error probabilities—error statistical methods (e.g., statistical significance tests), and those where it is insensitive (e.g., Bayes factors). It might be thought that if a method is insensitive to error probabilities that it escapes the inferential consequences of inflated error rates due to biasing selection effects. I will argue that this is not the case. I discuss a recent paper advocating subjective Bayes factors (BFs) by van Dongen, Sprenger, and Wagenmakers (VSW 2022). VSW claim that the comparatively more likely hypothesis H passes a stringent test, despite insensitivity to the error statistical properties of that test. I argue that the BF test rule they advocate can accord strong evidence to a claim H, even though little has been done to rule out H’s flaws. There are two reasons the BF test fails to satisfy the minimal requirement for stringency: its insensitivity to biasing selection effects, and the fact that H and its competitor need not exhaust the space of possibilities. Data can be much more probable under hypothesis H than under a chosen non-exhaustive competitor H’, even though H is poorly warranted. I will recommend VSW supplement their BF tests with a report of how severely H has passed, in the frequentist error statistical sense. I begin by responding to the criticisms VSW raise for a severe testing reformulation of statistical significance tests. A post-data severity concept can supply a transparent way for skeptical consumers, who are not steeped in technical machinery, to check if errors and biases are avoided in specific inferences that affect them.

Suggested Citation

  • Mayo, Deborah, 2024. "Error statistics, Bayes-factor Tests and the Fallacy of Non-exhaustive Alternatives," OSF Preprints tmgqd, Center for Open Science.
  • Handle: RePEc:osf:osfxxx:tmgqd
    DOI: 10.31219/osf.io/tmgqd
    as

    Download full text from publisher

    File URL: https://osf.io/download/66797cdd993d150533048c9e/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/tmgqd?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Andrew Gelman & Christian Hennig, 2017. "Beyond subjective and objective in statistics," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(4), pages 967-1033, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Delbianco Fernando & Tohmé Fernando, 2023. "What is a relevant control?: An algorithmic proposal," Asociación Argentina de Economía Política: Working Papers 4643, Asociación Argentina de Economía Política.
    2. Francesco De Pretis & Barbara Osimani, 2019. "New Insights in Computational Methods for Pharmacovigilance: E-Synthesis , a Bayesian Framework for Causal Assessment," IJERPH, MDPI, vol. 16(12), pages 1-19, June.
    3. Lutz Bornmann & Julian N. Marewski, 2024. "Opium in science and society: numbers and other quantifications," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(9), pages 5313-5346, September.
    4. Dimitris Korobilis & Kenichi Shimizu, 2022. "Bayesian Approaches to Shrinkage and Sparse Estimation," Foundations and Trends(R) in Econometrics, now publishers, vol. 11(4), pages 230-354, June.
    5. Elena Grimaccia & Alessia Naccarato, 2019. "Food Insecurity Individual Experience: A Comparison of Economic and Social Characteristics of the Most Vulnerable Groups in the World," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 143(1), pages 391-410, May.
    6. Jingjing He & Wei Wang & Min Huang & Shaohua Wang & Xuefei Guan, 2021. "Bayesian Inference under Small Sample Sizes Using General Noninformative Priors," Mathematics, MDPI, vol. 9(21), pages 1-20, November.
    7. Claudia Biancotti & Alfonso Rosolia & Giovanni Veronese & Robert Kirchner & Francois Mouriaux, 2021. "Covid-19 and official statistics: a wakeup call?," Questioni di Economia e Finanza (Occasional Papers) 605, Bank of Italy, Economic Research and International Relations Area.
    8. Helfgott, Ariella & Midgley, Gerald & Chaudhury, Abrar & Vervoort, Joost & Sova, Chase & Ryan, Alex, 2023. "Multi-level participation in integrative, systemic planning: The case of climate adaptation in Ghana," European Journal of Operational Research, Elsevier, vol. 309(3), pages 1201-1217.
    9. Wei-Chao Lin & Ching Chen, 2021. "Novel World University Rankings Combining Academic, Environmental and Resource Indicators," Sustainability, MDPI, vol. 13(24), pages 1-15, December.
    10. Taisei Sugiyama & Nicolas Schweighofer & Jun Izawa, 2023. "Reinforcement learning establishes a minimal metacognitive process to monitor and control motor learning performance," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    11. Jan Hannig & Hari Iyer, 2022. "Testing for calibration discrepancy of reported likelihood ratios in forensic science," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 267-301, January.
    12. John Deke & Mariel Finucane & Daniel Thal, "undated". "The BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations: A Practical Guide for Education Researchers," Mathematica Policy Research Reports 5a0d5dff375d42048799878be, Mathematica Policy Research.
    13. Brian H. MacGillivray, 2019. "Null Hypothesis Testing ≠ Scientific Inference: A Critique of the Shaky Premise at the Heart of the Science and Values Debate, and a Defense of Value‐Neutral Risk Assessment," Risk Analysis, John Wiley & Sons, vol. 39(7), pages 1520-1532, July.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:osfxxx:tmgqd. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://osf.io/preprints/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.