IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v115y2020i529p393-402.html
   My bibliography  Save this article

Cauchy Combination Test: A Powerful Test With Analytic p-Value Calculation Under Arbitrary Dependency Structures

Author

Listed:
  • Yaowu Liu
  • Jun Xie

Abstract

Abstract–Combining individual p-values to aggregate multiple small effects has a long-standing interest in statistics, dating back to the classic Fisher’s combination test. In modern large-scale data analysis, correlation and sparsity are common features and efficient computation is a necessary requirement for dealing with massive data. To overcome these challenges, we propose a new test that takes advantage of the Cauchy distribution. Our test statistic has a simple form and is defined as a weighted sum of Cauchy transformation of individual p-values. We prove a nonasymptotic result that the tail of the null distribution of our proposed test statistic can be well approximated by a Cauchy distribution under arbitrary dependency structures. Based on this theoretical result, the p-value calculation of our proposed test is not only accurate, but also as simple as the classic z-test or t-test, making our test well suited for analyzing massive data. We further show that the power of the proposed test is asymptotically optimal in a strong sparsity setting. Extensive simulations demonstrate that the proposed test has both strong power against sparse alternatives and a good accuracy with respect to p-value calculations, especially for very small p-values. The proposed test has also been applied to a genome-wide association study of Crohn’s disease and compared with several existing tests. Supplementary materials for this article are available online.

Suggested Citation

  • Yaowu Liu & Jun Xie, 2020. "Cauchy Combination Test: A Powerful Test With Analytic p-Value Calculation Under Arbitrary Dependency Structures," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 393-402, January.
  • Handle: RePEc:taf:jnlasa:v:115:y:2020:i:529:p:393-402
    DOI: 10.1080/01621459.2018.1554485
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2018.1554485
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2018.1554485?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. David Ardia & S'ebastien Laurent & Rosnel Sessinou, 2024. "High-Dimensional Mean-Variance Spanning Tests," Papers 2403.17127, arXiv.org.
    2. Xiong, Peihan & Hu, Taizhong, 2022. "On Samuel’s p-value model and the Simes test under dependence," Statistics & Probability Letters, Elsevier, vol. 187(C).
    3. Chen Wang & Tianying Wang & Krzysztof Kiryluk & Ying Wei & Hugues Aschard & Iuliana Ionita-Laza, 2024. "Genome-wide discovery for biomarkers using quantile regression at biobank scale," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    4. Choi, Woohyun & Kim, Ilmun, 2023. "Averaging p-values under exchangeability," Statistics & Probability Letters, Elsevier, vol. 194(C).
    5. Nabil Bouamara & S'ebastien Laurent & Shuping Shi, 2023. "Sequential Cauchy Combination Test for Multiple Testing Problems with Financial Applications," Papers 2303.13406, arXiv.org, revised Jun 2023.
    6. Zichen Zhang & Ye Eun Bae & Jonathan R. Bradley & Lang Wu & Chong Wu, 2022. "SUMMIT: An integrative approach for better transcriptomic data imputation improves causal gene identification," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    7. Haque Md Rejuan & Kubatko Laura, 2024. "A global test of hybrid ancestry from genome-scale data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 23(1), pages 1-18, January.
    8. Juan Antonio Villatoro-García & Jordi Martorell-Marugán & Daniel Toro-Domínguez & Yolanda Román-Montoya & Pedro Femia & Pedro Carmona-Sáez, 2022. "DExMA: An R Package for Performing Gene Expression Meta-Analysis with Missing Genes," Mathematics, MDPI, vol. 10(18), pages 1-15, September.
    9. Xiaoyu Song & Jiayi Ji & Joseph H. Rothstein & Stacey E. Alexeeff & Lori C. Sakoda & Adriana Sistig & Ninah Achacoso & Eric Jorgenson & Alice S. Whittemore & Robert J. Klein & Laurel A. Habel & Pei Wa, 2023. "MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    10. Joaquim Fernando Pinto da Costa & Manuel Cabral, 2022. "Statistical Methods with Applications in Data Mining: A Review of the Most Recent Works," Mathematics, MDPI, vol. 10(6), pages 1-22, March.
    11. Yuyu Chen & Ruodu Wang, 2024. "Infinite-mean models in risk management: Discussions and recent advances," Papers 2408.08678, arXiv.org, revised Oct 2024.
    12. Remo Monti & Pia Rautenstrauch & Mahsa Ghanbari & Alva Rani James & Matthias Kirchler & Uwe Ohler & Stefan Konigorski & Christoph Lippert, 2022. "Identifying interpretable gene-biomarker associations with functionally informed kernel-based tests in 190,000 exomes," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    13. Hong Zhang & Zheyang Wu, 2023. "The generalized Fisher's combination and accurate p‐value calculation under dependence," Biometrics, The International Biometric Society, vol. 79(2), pages 1159-1172, June.
    14. William R. Reay & Dylan J. Kiltschewskij & Maria A. Biase & Zachary F. Gerring & Kousik Kundu & Praveen Surendran & Laura A. Greco & Erin D. Clarke & Clare E. Collins & Alison M. Mondul & Demetrius Al, 2024. "Genetic influences on circulating retinol and its relationship to human health," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    15. William R. Reay & Michael P. Geaghan & Murray J. Cairns, 2022. "The genetic architecture of pneumonia susceptibility implicates mucin biology and a relationship with psychiatric illness," Nature Communications, Nature, vol. 13(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:115:y:2020:i:529:p:393-402. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.