IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v50y2023i1p54-88.html
   My bibliography  Save this article

Divergence versus decision P‐values: A distinction worth making in theory and keeping in practice: Or, how divergence P‐values measure evidence even when decision P‐values do not

Author

Listed:
  • Sander Greenland

Abstract

There are two distinct definitions of “P‐value” for evaluating a proposed hypothesis or model for the process generating an observed dataset. The original definition starts with a measure of the divergence of the dataset from what was expected under the model, such as a sum of squares or a deviance statistic. A P‐value is then the ordinal location of the measure in a reference distribution computed from the model and the data, and is treated as a unit‐scaled index of compatibility between the data and the model. In the other definition, a P‐value is a random variable on the unit interval whose realizations can be compared to a cutoff α to generate a decision rule with known error rates under the model and specific alternatives. It is commonly assumed that realizations of such decision P‐values always correspond to divergence P‐values. But this need not be so: Decision P‐values can violate intuitive single‐sample coherence criteria where divergence P‐values do not. It is thus argued that divergence and decision P‐values should be carefully distinguished in teaching, and that divergence P‐values are the relevant choice when the analysis goal is to summarize evidence rather than implement a decision rule.

Suggested Citation

  • Sander Greenland, 2023. "Divergence versus decision P‐values: A distinction worth making in theory and keeping in practice: Or, how divergence P‐values measure evidence even when decision P‐values do not," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 54-88, March.
  • Handle: RePEc:bla:scjsta:v:50:y:2023:i:1:p:54-88
    DOI: 10.1111/sjos.12625
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/sjos.12625
    Download Restriction: no

    File URL: https://libkey.io/10.1111/sjos.12625?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. David R. Bickel, 2021. "Null Hypothesis Significance Testing Defended and Calibrated by Bayesian Model Checking," The American Statistician, Taylor & Francis Journals, vol. 75(3), pages 249-255, July.
    2. Rubin, Mark, 2021. "When to adjust alpha during multiple testing: A consideration of disjunction, conjunction, and individual testing," MetaArXiv tj6pm, Center for Open Science.
    3. Victor Fossaluza & Rafael Izbicki & Gustavo Miranda da Silva & Luís Gustavo Esteves, 2017. "Coherent Hypothesis Testing," The American Statistician, Taylor & Francis Journals, vol. 71(3), pages 242-248, July.
    4. Erik W. van Zwet & Eric A. Cator, 2021. "The significance filter, the winner's curse and the need to shrink," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 75(4), pages 437-452, November.
    5. Hubbard R. & Bayarri M.J., 2003. "Confusion Over Measures of Evidence (ps) Versus Errors (alphas) in Classical Statistical Testing," The American Statistician, American Statistical Association, vol. 57, pages 171-178, August.
    6. Glenn Shafer, 2021. "Testing by betting: A strategy for statistical and scientific communication," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(2), pages 407-431, April.
    7. Blakeley B. McShane & David Gal & Andrew Gelman & Christian Robert & Jennifer L. Tackett, 2019. "Abandon Statistical Significance," The American Statistician, Taylor & Francis Journals, vol. 73(S1), pages 235-245, March.
    8. Spencer Hansen & Ken Rice, 2023. "Coherent Tests for Interval Null Hypotheses," The American Statistician, Taylor & Francis Journals, vol. 77(1), pages 20-28, January.
    9. Valentin Amrhein & David Trafimow & Sander Greenland, 2019. "Inferential Statistics as Descriptive Statistics: There Is No Replication Crisis if We Don’t Expect Replication," The American Statistician, Taylor & Francis Journals, vol. 73(S1), pages 262-270, March.
    10. Sander Greenland, 2021. "Sander Greenland’s contribution to the Discussion of ‘Testing by betting: A strategy for statistical and scientific communication’ by Glenn Shafer," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(2), pages 450-451, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Markku Maula & Wouter Stam, 2020. "Enhancing Rigor in Quantitative Entrepreneurship Research," Entrepreneurship Theory and Practice, , vol. 44(6), pages 1059-1090, November.
    2. Austin Chia & Margaret L. Kern, 2021. "Subjective Wellbeing and the Social Responsibilities of Business: an Exploratory Investigation of Australian Perspectives," Applied Research in Quality of Life, Springer;International Society for Quality-of-Life Studies, vol. 16(5), pages 1881-1908, October.
    3. Zachary Van Winkle & Anette Fasang, 2021. "The complexity of employment and family life courses across 20th century Europe: More evidence for larger cross-national differences but little change across 1916‒1966 birth cohorts," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 44(32), pages 775-810.
    4. Patrick Vu, 2022. "Can the Replication Rate Tell Us About Publication Bias?," Papers 2206.15023, arXiv.org, revised Jul 2022.
    5. Wang, Feipeng & Wong, Wing-Keung & Wang, Zheng & Albasher, Gadah & Alsultan, Nouf & Fatemah, Ambreen, 2023. "Emerging pathways to sustainable economic development: An interdisciplinary exploration of resource efficiency, technological innovation, and ecosystem resilience in resource-rich regions," Resources Policy, Elsevier, vol. 85(PA).
    6. Rinne, Sonja, 2024. "Estimating the merit-order effect using coarsened exact matching: Reconciling theory with the empirical results to improve policy implications," Energy Policy, Elsevier, vol. 185(C).
    7. Fang, Ximeng & Goette, Lorenz & Rockenbach, Bettina & Sutter, Matthias & Tiefenbeck, Verena & Schoeb, Samuel & Staake, Thorsten, 2023. "Complementarities in behavioral interventions: Evidence from a field experiment on resource conservation," Journal of Public Economics, Elsevier, vol. 228(C).
    8. Armstrong, J. Scott, 2007. "Significance tests harm progress in forecasting," International Journal of Forecasting, Elsevier, vol. 23(2), pages 321-327.
    9. Turner, Rosanne J. & Grünwald, Peter D., 2023. "Exact anytime-valid confidence intervals for contingency tables and beyond," Statistics & Probability Letters, Elsevier, vol. 198(C).
    10. Fanelli, Daniele, 2020. "Metascientific reproducibility patterns revealed by informatic measure of knowledge," MetaArXiv 5vnhj, Center for Open Science.
    11. Diana W. Thomas & Michael D. Thomas, 2020. "Behavioral symmetry, rent seeking, and the Republic of Science," Public Choice, Springer, vol. 183(3), pages 443-459, June.
    12. G. Christopher Crawford & Vitaliy Skorodziyevskiy & Casey J. Frid & Thomas E. Nelson & Zahra Booyavi & Diana M. Hechavarria & Xuanye Li & Paul D. Reynolds & Ehsan Teymourian, 2022. "Advancing Entrepreneurship Theory Through Replication: A Case Study on Contemporary Methodological Challenges, Future Best Practices, and an Entreaty for Communality," Entrepreneurship Theory and Practice, , vol. 46(3), pages 779-799, May.
    13. Geva, Sharon & Fulman, Nir & Ben-Elia, Eran, 2022. "Getting the prices right: Drivers' cruising choices in a serious parking game," Transportation Research Part A: Policy and Practice, Elsevier, vol. 165(C), pages 54-75.
    14. Ruodu Wang & Aaditya Ramdas, 2022. "False discovery rate control with e‐values," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 822-852, July.
    15. Bertoni, M.; & Marin-Lopez, B.A.; & Sanz-de-Galdeano, A.;, 2023. "Subjective Gender-Based Patterns in ADHD Diagnosis," Health, Econometrics and Data Group (HEDG) Working Papers 23/17, HEDG, c/o Department of Economics, University of York.
    16. Beecham, Roger & Lovelace, Robin, 2022. "A framework for inserting visually-supported inferences into geographical analysis workflow: application to road safety research," OSF Preprints mfja8, Center for Open Science.
    17. David J. Hand, 2022. "Trustworthiness of statistical inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 329-347, January.
    18. Helmut Wasserbacher & Martin Spindler, 2024. "Credit Ratings: Heterogeneous Effect on Capital Structure," Papers 2406.18936, arXiv.org.
    19. William J. Matthews, 2011. "What would judgment and decision making research be like if we took a Bayesian approach to hypothesis testing?," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 6(8), pages 843-856, December.
    20. Asatryan, Zareh & Havlik, Annika & Heinemann, Friedrich & Nover, Justus, 2020. "Biases in fiscal multiplier estimates," European Journal of Political Economy, Elsevier, vol. 63(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:50:y:2023:i:1:p:54-88. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.