IDEAS home Printed from https://ideas.repec.org/a/ibn/ijspjl/v11y2022i6p12.html
   My bibliography  Save this article

Combining Correlated P-values From Primary Data Analyses

Author

Listed:
  • Jai Won Choi
  • Balgobin Nandram
  • Boseung Choi

Abstract

Research results on the same subject, extracted from scientific papers or clinical trials, are combined to determine a consensus. We are primarily concerned with combining p-values from experiments that may be correlated. We have two methods, a non-Bayesian method and a Bayesian method. We use a model to combine these results and assume the combined results follow a certain distribution, for example, chi-square or normal. The distribution requires independent and identically distributed (iid) random variables. When the data are correlated or non-iid, we cannot assume such distribution. In order to do so, the combined results from the model need to be adjusted, and the adjustment is done “indirectly” through two test statistics. Specifically, one test statistic (TS** ) is obtained for the non-iid data and the other is the test statistic (TS) is obtained for iid data. We use the ratio between the two test statistics to adjust the model test statistic (TS**) for its non-iid violation. The adjusted TS** is named as “effective test statistics” (ETS), which is then used for statistical inferences with the assumed distribution. As it is difficult to estimate the correlation, to provide a more coherent method for combining p-values, we also introduce a novel Bayesian method for both iid data and non-iid data. The examples are used to illustrate the non-Bayesian method and additional examples are given to illustrate the Bayesian method.

Suggested Citation

  • Jai Won Choi & Balgobin Nandram & Boseung Choi, 2022. "Combining Correlated P-values From Primary Data Analyses," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 11(6), pages 1-12, November.
  • Handle: RePEc:ibn:ijspjl:v:11:y:2022:i:6:p:12
    as

    Download full text from publisher

    File URL: https://ccsenet.org/journal/index.php/ijsp/article/download/0/0/47906/51443
    Download Restriction: no

    File URL: https://ccsenet.org/journal/index.php/ijsp/article/view/0/47906
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jai Won Choi & Balgobin Nandram, 2021. "Large Sample Problems," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 10(2), pages 1-81, March.
    2. Hari K. Iyer & C.M. Jack Wang & Thomas Mathew, 2004. "Models and Confidence Intervals for True Values in Interlaboratory Trials," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 1060-1071, December.
    3. Loughin, Thomas M., 2004. "A systematic comparison of methods for combining p-values from independent tests," Computational Statistics & Data Analysis, Elsevier, vol. 47(3), pages 467-485, October.
    4. Balgobin Nandram & Jai Won Choi & Yang Liu, 2021. "Integration of Nonprobability and Probability Samples via Survey Weights," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 10(6), pages 1-5, December.
    5. N A Heard & P Rubin-Delanchy, 2018. "Choosing between methods of combining $p$-values," Biometrika, Biometrika Trust, vol. 105(1), pages 239-246.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Song, Zhi & Mukherjee, Amitava & Zhang, Jiujun, 2021. "Some robust approaches based on copula for monitoring bivariate processes and component-wise assessment," European Journal of Operational Research, Elsevier, vol. 289(1), pages 177-196.
    2. Marco Marozzi, 2012. "A combined test for differences in scale based on the interquantile range," Statistical Papers, Springer, vol. 53(1), pages 61-72, February.
    3. Li, Xinmin & Wang, Juan & Liang, Hua, 2011. "Comparison of several means: A fiducial based approach," Computational Statistics & Data Analysis, Elsevier, vol. 55(5), pages 1993-2002, May.
    4. Alexander Kaever & Manuel Landesfeind & Kirstin Feussner & Burkhard Morgenstern & Ivo Feussner & Peter Meinicke, 2014. "Meta-Analysis of Pathway Enrichment: Combining Independent and Dependent Omics Data Sets," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-12, February.
    5. Lan Cheng & Xuguang Simon Sheng, 2017. "Combination of “combinations of p values”," Empirical Economics, Springer, vol. 53(1), pages 329-350, August.
    6. Tian, Lili, 2006. "Testing equality of inverse Gaussian means under heterogeneity, based on generalized test variable," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1156-1162, November.
    7. Doyle, John R. & Chen, Catherine H., 2013. "Patterns in stock market movements tested as random number generators," European Journal of Operational Research, Elsevier, vol. 227(1), pages 122-132.
    8. Yu, Xiufan & Yao, Jiawei & Xue, Lingzhou, 2024. "Power enhancement for testing multi-factor asset pricing models via Fisher’s method," Journal of Econometrics, Elsevier, vol. 239(2).
    9. Xuguang Sheng & Jingyun Yang, 2013. "Truncated Product Methods for Panel Unit Root Tests," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 75(4), pages 624-636, August.
    10. Xiong, Peihan & Hu, Taizhong, 2022. "On Samuel’s p-value model and the Simes test under dependence," Statistics & Probability Letters, Elsevier, vol. 187(C).
    11. Yoav Benjamini & Ruth Heller, 2008. "Screening for Partial Conjunction Hypotheses," Biometrics, The International Biometric Society, vol. 64(4), pages 1215-1222, December.
    12. Zimmermann, Paul, 2021. "The role of the leverage effect in the price discovery process of credit markets," Journal of Economic Dynamics and Control, Elsevier, vol. 122(C).
    13. Paulo C. Rodrigues & Vanda M. Lourenço, 2020. "Comments on: Hierarchical Inference for genome-wide association studies: a view on methodology with software by Paulo C. Rodrigues and Vanda M. Lourenço," Computational Statistics, Springer, vol. 35(1), pages 57-58, March.
    14. Juan Antonio Villatoro-García & Jordi Martorell-Marugán & Daniel Toro-Domínguez & Yolanda Román-Montoya & Pedro Femia & Pedro Carmona-Sáez, 2022. "DExMA: An R Package for Performing Gene Expression Meta-Analysis with Missing Genes," Mathematics, MDPI, vol. 10(18), pages 1-15, September.
    15. Xin Yuan & Yanran Ma & Ruitian Gao & Shuya Cui & Yifan Wang & Botao Fa & Shiyang Ma & Ting Wei & Shuangge Ma & Zhangsheng Yu, 2024. "HEARTSVG: a fast and accurate method for identifying spatially variable genes in large-scale spatial transcriptomics," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    16. Kechris Katerina J & Biehs Brian & Kornberg Thomas B, 2010. "Generalizing Moving Averages for Tiling Arrays Using Combined P-Value Statistics," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-31, August.
    17. Gunasekera, Sumith, 2018. "Inference for the Burr XII reliability under progressive censoring with random removals," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 144(C), pages 182-195.
    18. Chen, Zhongxue & Nadarajah, Saralees, 2014. "On the optimally weighted z-test for combining probabilities from independent studies," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 387-394.
    19. Patrick B. Langthaler & Riccardo Ceccato & Luigi Salmaso & Rosa Arboretti & Arne C. Bathke, 2023. "Permutation testing for thick data when the number of variables is much greater than the sample size: recent developments and some recommendations," Computational Statistics, Springer, vol. 38(1), pages 101-132, March.
    20. Sexton, Joseph & Blomhoff, Rune & Karlsen, Anette & Laake, Petter, 2012. "Adaptive combination of dependent tests," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1935-1943.

    More about this item

    JEL classification:

    • R00 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General - - - General
    • Z0 - Other Special Topics - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ibn:ijspjl:v:11:y:2022:i:6:p:12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Canadian Center of Science and Education (email available below). General contact details of provider: https://edirc.repec.org/data/cepflch.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.