IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i10p1741-d819173.html
   My bibliography  Save this article

A Two-Sample Test of High Dimensional Means Based on Posterior Bayes Factor

Author

Listed:
  • Yuanyuan Jiang

    (School of Mathematics and Statistics, Beijing Institute of Technology, Beijing 100081, China)

  • Xingzhong Xu

    (School of Mathematics and Statistics, Beijing Institute of Technology, Beijing 100081, China
    Beijing Key Laboratory on MCAACI, Beijing Institute of Technology, Beijing 100081, China)

Abstract

In classical statistics, the primary test statistic is the likelihood ratio. However, for high dimensional data, the likelihood ratio test is no longer effective and sometimes does not work altogether. By replacing the maximum likelihood with the integral of the likelihood, the Bayes factor is obtained. The posterior Bayes factor is the ratio of the integrals of the likelihood function with respect to the posterior. In this paper, we investigate the performance of the posterior Bayes factor in high dimensional hypothesis testing through the problem of testing the equality of two multivariate normal mean vectors. The asymptotic normality of the linear function of the logarithm of the posterior Bayes factor is established. Then we construct a test with an asymptotically nominal significance level. The asymptotic power of the test is also derived. Simulation results and an application example are presented, which show good performance of the test. Hence, taking the posterior Bayes factor as a statistic in high dimensional hypothesis testing is a reasonable methodology.

Suggested Citation

  • Yuanyuan Jiang & Xingzhong Xu, 2022. "A Two-Sample Test of High Dimensional Means Based on Posterior Bayes Factor," Mathematics, MDPI, vol. 10(10), pages 1-23, May.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1741-:d:819173
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/10/1741/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/10/1741/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Thulin, Måns, 2014. "A high-dimensional two-sample test for the mean using random subspaces," Computational Statistics & Data Analysis, Elsevier, vol. 74(C), pages 26-38.
    2. Srivastava, Muni S., 2009. "A test for the mean vector with fewer observations than the dimension under non-normality," Journal of Multivariate Analysis, Elsevier, vol. 100(3), pages 518-532, March.
    3. Roger S. Zoh & Abhra Sarkar & Raymond J. Carroll & Bani K. Mallick, 2018. "A Powerful Bayesian Test for Equality of Means in High Dimensions," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(524), pages 1733-1741, October.
    4. Chen, Song Xi & Qin, Yingli, 2010. "A Two Sample Test for High Dimensional Data with Applications to Gene-set Testing," MPRA Paper 59642, University Library of Munich, Germany.
    5. T. Tony Cai & Weidong Liu & Yin Xia, 2014. "Two-sample test of high dimensional means under dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(2), pages 349-372, March.
    6. Karl Bruce Gregory & Raymond J. Carroll & Veerabhadran Baladandayuthapani & Soumendra N. Lahiri, 2015. "A Two-Sample Test for Equality of Means in High Dimension," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 837-849, June.
    7. Zhang, Liang & Zhu, Tianming & Zhang, Jin-Ting, 2020. "A Simple Scale-Invariant Two-Sample Test for High-dimensional Data," Econometrics and Statistics, Elsevier, vol. 14(C), pages 131-144.
    8. Wang, Rui & Xu, Xingzhong, 2018. "On two-sample mean tests under spiked covariances," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 225-249.
    9. Jin-Ting Zhang & Jia Guo & Bu Zhou & Ming-Yen Cheng, 2020. "A Simple Two-Sample Test in High Dimensions Based on L2-Norm," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 1011-1027, April.
    10. Srivastava, Muni S. & Du, Meng, 2008. "A test for the mean vector with fewer observations than the dimension," Journal of Multivariate Analysis, Elsevier, vol. 99(3), pages 386-402, March.
    11. Srivastava, Muni S. & Katayama, Shota & Kano, Yutaka, 2013. "A two sample test in high dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 114(C), pages 349-358.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Harrar, Solomon W. & Kong, Xiaoli, 2022. "Recent developments in high-dimensional inference for multivariate data: Parametric, semiparametric and nonparametric approaches," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Huang, Yuan & Li, Changcheng & Li, Runze & Yang, Songshan, 2022. "An overview of tests on high-dimensional means," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    3. Zhang, Huaiyu & Wang, Haiyan, 2021. "A more powerful test of equality of high-dimensional two-sample means," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    4. Tzviel Frostig & Yoav Benjamini, 2022. "Testing the equality of multivariate means when $$p>n$$ p > n by combining the Hotelling and Simes tests," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 390-415, June.
    5. Feng, Long & Sun, Fasheng, 2015. "A note on high-dimensional two-sample test," Statistics & Probability Letters, Elsevier, vol. 105(C), pages 29-36.
    6. Zhang, Jin-Ting & Guo, Jia & Zhou, Bu, 2017. "Linear hypothesis testing in high-dimensional one-way MANOVA," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 200-216.
    7. Ghosh, Santu & Ayyala, Deepak Nag & Hellebuyck, Rafael, 2021. "Two-sample high dimensional mean test based on prepivots," Computational Statistics & Data Analysis, Elsevier, vol. 163(C).
    8. Zhang, Jin-Ting & Zhu, Tianming, 2022. "A new normal reference test for linear hypothesis testing in high-dimensional heteroscedastic one-way MANOVA," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    9. Zhao, Junguang & Xu, Xingzhong, 2016. "A generalized likelihood ratio test for normal mean when p is greater than n," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 91-104.
    10. Ayyala, Deepak Nag & Park, Junyong & Roy, Anindya, 2017. "Mean vector testing for high-dimensional dependent observations," Journal of Multivariate Analysis, Elsevier, vol. 153(C), pages 136-155.
    11. Li, Yang & Wang, Zhaojun & Zou, Changliang, 2016. "A simpler spatial-sign-based two-sample test for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 149(C), pages 192-198.
    12. Jiang Hu & Zhidong Bai & Chen Wang & Wei Wang, 2017. "On testing the equality of high dimensional mean vectors with unequal covariance matrices," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(2), pages 365-387, April.
    13. Zhang, Jie & Pan, Meng, 2016. "A high-dimension two-sample test for the mean using cluster subspaces," Computational Statistics & Data Analysis, Elsevier, vol. 97(C), pages 87-97.
    14. Dong, Kai & Pang, Herbert & Tong, Tiejun & Genton, Marc G., 2016. "Shrinkage-based diagonal Hotelling’s tests for high-dimensional small sample size data," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 127-142.
    15. Yin, Yanqing, 2021. "Test for high-dimensional mean vector under missing observations," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
    16. Li, Jun, 2023. "Finite sample t-tests for high-dimensional means," Journal of Multivariate Analysis, Elsevier, vol. 196(C).
    17. Zhengbang Li & Fuxiang Liu & Luanjie Zeng & Guoxin Zuo, 2021. "A stationary bootstrap test about two mean vectors comparison with somewhat dense differences and fewer sample size than dimension," Computational Statistics, Springer, vol. 36(2), pages 941-960, June.
    18. Cai, T. Tony & Xia, Yin, 2014. "High-dimensional sparse MANOVA," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 174-196.
    19. Amanda Plunkett & Junyong Park, 2019. "Two-sample test for sparse high-dimensional multinomial distributions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 804-826, September.
    20. Pini, Alessia & Stamm, Aymeric & Vantini, Simone, 2018. "Hotelling’s T2 in separable Hilbert spaces," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 284-305.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1741-:d:819173. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.