IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v68y2006i3p477-493.html
   My bibliography  Save this article

Testing against a high dimensional alternative

Author

Listed:
  • Jelle J. Goeman
  • Sara A. Van De Geer
  • Hans C. Van Houwelingen

Abstract

Summary. As the dimensionality of the alternative hypothesis increases, the power of classical tests tends to diminish quite rapidly. This is especially true for high dimensional data in which there are more parameters than observations. We discuss a score test on a hyperparameter in an empirical Bayesian model as an alternative to classical tests. It gives a general test statistic which can be used to test a point null hypothesis against a high dimensional alternative, even when the number of parameters exceeds the number of samples. This test will be shown to have optimal power on average in a neighbourhood of the null hypothesis, which makes it a proper generalization of the locally most powerful test to multiple dimensions. To illustrate this new locally most powerful test we investigate the case of testing the global null hypothesis in a linear regression model in more detail. The score test is shown to have significantly more power than the F‐test whenever under the alternative the large variance principal components of the design matrix explain substantially more of the variance of the outcome than do the small variance principal components. The score test is also useful for detecting sparse alternatives in truly high dimensional data, where its power is comparable with the test based on the maximum absolute t‐statistic.

Suggested Citation

  • Jelle J. Goeman & Sara A. Van De Geer & Hans C. Van Houwelingen, 2006. "Testing against a high dimensional alternative," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(3), pages 477-493, June.
  • Handle: RePEc:bla:jorssb:v:68:y:2006:i:3:p:477-493
    DOI: 10.1111/j.1467-9868.2006.00551.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2006.00551.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2006.00551.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang, Siyang & Cui, Hengjian, 2015. "A new test for part of high dimensional regression coefficients," Journal of Multivariate Analysis, Elsevier, vol. 137(C), pages 187-203.
    2. Hong Guo & Changliang Zou & Zhaojun Wang & Bin Chen, 2014. "Empirical likelihood for high-dimensional linear regression models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 77(7), pages 921-945, October.
    3. Zihuai He & Min Zhang & Seunggeun Lee & Jennifer A. Smith & Sharon L. R. Kardia & V. Diez Roux & Bhramar Mukherjee, 2017. "Set-Based Tests for the Gene–Environment Interaction in Longitudinal Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 966-978, July.
    4. Ma, Yingying & Lan, Wei & Wang, Hansheng, 2015. "Testing predictor significance with ultra high dimensional multivariate responses," Computational Statistics & Data Analysis, Elsevier, vol. 83(C), pages 275-286.
    5. Peter Jensen, 2010. "Testing the null of a low dimensional growth model," Empirical Economics, Springer, vol. 38(1), pages 193-215, February.
    6. Konstantin Glombek, 2014. "Statistical Inference for High-Dimensional Global Minimum Variance Portfolios," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(4), pages 845-865, December.
    7. Lebrec Jeremie J & Stijnen Theo & van Houwelingen Hans C, 2010. "Dealing with Heterogeneity between Cohorts in Genomewide SNP Association Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-22, January.
    8. Tianxi Cai & Giulia Tonini & Xihong Lin, 2011. "Kernel Machine Approach to Testing the Significance of Multiple Genetic Markers for Risk Prediction," Biometrics, The International Biometric Society, vol. 67(3), pages 975-986, September.
    9. Chaturvedi Nimisha & Menezes Renée X. de & Goeman Jelle J. & Wieringen Wessel van, 2018. "A test for detecting differential indirect trans effects between two groups of samples," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 17(5), pages 1-11, October.
    10. Ningning Xu & Aldo Solari & Jelle J. Goeman, 2023. "Closed testing with Globaltest, with application in metabolomics," Biometrics, The International Biometric Society, vol. 79(2), pages 1103-1113, June.
    11. Ping-Shou Zhong & Tao Hu & Jun Li, 2015. "Tests for Coefficients in High-dimensional Additive Hazard Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(3), pages 649-664, September.
    12. Long Qu & Tobias Guennel & Scott L. Marshall, 2013. "Linear Score Tests for Variance Components in Linear Mixed Models and Applications to Genetic Association Studies," Biometrics, The International Biometric Society, vol. 69(4), pages 883-892, December.
    13. Jiang Dandan & Sun Jianguo, 2017. "Group Tests for High-dimensional Failure Time Data with the Additive Hazards Models," The International Journal of Biostatistics, De Gruyter, vol. 13(1), pages 1-10, May.
    14. Zang, Yangguang & Zhang, Sanguo & Li, Qizhai & Zhang, Qingzhao, 2016. "Jackknife empirical likelihood test for high-dimensional regression coefficients," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 302-316.
    15. Lan, Wei & Zhong, Ping-Shou & Li, Runze & Wang, Hansheng & Tsai, Chih-Ling, 2016. "Testing a single regression coefficient in high dimensional linear models," Journal of Econometrics, Elsevier, vol. 195(1), pages 154-168.
    16. Liu, Yang & Sun, Wei & Hsu, Li & He, Qianchuan, 2022. "Statistical inference for high-dimensional pathway analysis with multiple responses," Computational Statistics & Data Analysis, Elsevier, vol. 169(C).
    17. Long Feng & Changliang Zou & Zhaojun Wang, 2016. "Multivariate-Sign-Based High-Dimensional Tests for the Two-Sample Location Problem," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 721-735, April.
    18. He, Yi & Jaidee, Sombut & Gao, Jiti, 2023. "Most powerful test against a sequence of high dimensional local alternatives," Journal of Econometrics, Elsevier, vol. 234(1), pages 151-177.
    19. Albert Vexler & Sergey Tarima, 2011. "An optimal approach for hypothesis testing in the presence of incomplete data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 63(6), pages 1141-1163, December.
    20. Kaiqiong Zhao & Karim Oualkacha & Lajmi Lakhal‐Chaieb & Aurélie Labbe & Kathleen Klein & Antonio Ciampi & Marie Hudson & Inés Colmegna & Tomi Pastinen & Tieyuan Zhang & Denise Daley & Celia M.T. Green, 2021. "A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation," Biometrics, The International Biometric Society, vol. 77(2), pages 424-438, June.
    21. Gong, Siliang & Zhang, Kai & Liu, Yufeng, 2018. "Efficient test-based variable selection for high-dimensional linear models," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 17-31.
    22. Yi He & Sombut Jaidee & Jiti Gao, 2020. "Most Powerful Test against High Dimensional Free Alternatives," Monash Econometrics and Business Statistics Working Papers 13/20, Monash University, Department of Econometrics and Business Statistics.
    23. Rui Wang & Xingzhong Xu, 2021. "A Bayesian-motivated test for high-dimensional linear regression models with fixed design matrix," Statistical Papers, Springer, vol. 62(4), pages 1821-1852, August.
    24. Jesse Hemerik & Jelle J. Goeman & Livio Finos, 2020. "Robust testing in generalized linear models by sign flipping score contributions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(3), pages 841-864, July.
    25. Bin Guo & Song Xi Chen, 2016. "Tests for high dimensional generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(5), pages 1079-1102, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:68:y:2006:i:3:p:477-493. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.