IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v197y2024ics0167947324000574.html
   My bibliography  Save this article

FDR control for linear log-contrast models with high-dimensional compositional covariates

Author

Listed:
  • Yuan, Panxu
  • Jin, Changhan
  • Li, Gaorong

Abstract

Linear log-contrast models have been widely used to describe the relationship between the response of interest and the compositional covariates, in which one central task is to identify the significant compositional covariates while controlling the false discovery rate (FDR) at a nominal level. To achieve this goal, a new FDR control method is proposed for linear log-contrast models with high-dimensional compositional covariates. An appealing feature of the proposed method is that it completely bypasses the traditional p-values and utilizes only the symmetry property of the test statistic for the unimportant compositional covariates to give an upper bound of the FDR. Under some regularity conditions, the FDR can be asymptotically controlled at the nominal level for the proposed method in theory, and the theoretical power is also proven to approach one as the sample size tends to infinity. The finite-sample performance of the proposed method is evaluated through extensive simulation studies, and applications to microbiome compositional datasets are also provided.

Suggested Citation

  • Yuan, Panxu & Jin, Changhan & Li, Gaorong, 2024. "FDR control for linear log-contrast models with high-dimensional compositional covariates," Computational Statistics & Data Analysis, Elsevier, vol. 197(C).
  • Handle: RePEc:eee:csdana:v:197:y:2024:i:c:s0167947324000574
    DOI: 10.1016/j.csda.2024.107973
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947324000574
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2024.107973?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yoav Benjamini & Abba M. Krieger & Daniel Yekutieli, 2006. "Adaptive linear step-up procedures that control the false discovery rate," Biometrika, Biometrika Trust, vol. 93(3), pages 491-507, September.
    2. Dongxiao Han & Jian Huang & Yuanyuan Lin & Lei Liu & Lianqiang Qu & Liuquan Sun, 2023. "Robust Signal Recovery for High-Dimensional Linear Log-Contrast Models with Compositional Covariates," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(3), pages 957-967, July.
    3. Xianyang Zhang & Jun Chen, 2022. "Covariate Adaptive False Discovery Rate Control With Applications to Omics-Wide Multiple Testing," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(537), pages 411-427, January.
    4. Pixu Shi & Yuchen Zhou & Anru R Zhang, 2022. "High-dimensional log-error-in-variable regression with applications to microbial compositional data analysis [Log contrast models for experiments with mixtures]," Biometrika, Biometrika Trust, vol. 109(2), pages 405-420.
    5. Wei Lin & Pixu Shi & Rui Feng & Hongzhe Li, 2014. "Variable selection in regression with compositional covariates," Biometrika, Biometrika Trust, vol. 101(4), pages 785-797.
    6. Jiarui Lu & Pixu Shi & Hongzhe Li, 2019. "Generalized linear models with linear constraints for microbiome compositional data," Biometrics, The International Biometric Society, vol. 75(1), pages 235-244, March.
    7. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    8. Chenguang Dai & Buyu Lin & Xin Xing & Jun S. Liu, 2023. "False Discovery Rate Control via Data Splitting," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(544), pages 2503-2520, October.
    9. Xin Xing & Zhigen Zhao & Jun S. Liu, 2023. "Controlling False Discovery Rate Using Gaussian Mirrors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(541), pages 222-241, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arun Srinivasan & Lingzhou Xue & Xiang Zhan, 2021. "Compositional knockoff filter for high‐dimensional regression analysis of microbiome data," Biometrics, The International Biometric Society, vol. 77(3), pages 984-995, September.
    2. Haixiang Zhang & Jun Chen & Zhigang Li & Lei Liu, 2021. "Testing for Mediation Effect with Application to Human Microbiome Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(2), pages 313-328, July.
    3. Lingjing Jiang & Niina Haiminen & Anna‐Paola Carrieri & Shi Huang & Yoshiki Vázquez‐Baeza & Laxmi Parida & Ho‐Cheol Kim & Austin D. Swafford & Rob Knight & Loki Natarajan, 2022. "Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data," Biometrics, The International Biometric Society, vol. 78(3), pages 1155-1167, September.
    4. Jordi Saperas-Riera & Glòria Mateu-Figueras & Josep Antoni Martín-Fernández, 2024. "L p -Norm for Compositional Data: Exploring the CoDa L 1 -Norm in Penalised Regression," Mathematics, MDPI, vol. 12(9), pages 1-16, May.
    5. Jingxuan Luo & Lili Yue & Gaorong Li, 2023. "Overview of High-Dimensional Measurement Error Regression Models," Mathematics, MDPI, vol. 11(14), pages 1-22, July.
    6. Sean M Devlin & Axel Martin & Irina Ostrovnaya, 2021. "Identifying prognostic pairwise relationships among bacterial species in microbiome studies," PLOS Computational Biology, Public Library of Science, vol. 17(11), pages 1-12, November.
    7. G. S. Monti & P. Filzmoser, 2022. "Robust logistic zero-sum regression for microbiome compositional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(2), pages 301-324, June.
    8. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    9. Michael L. Anderson & Fangwen Lu, 2017. "Learning to Manage and Managing to Learn: The Effects of Student Leadership Service," Management Science, INFORMS, vol. 63(10), pages 3246-3261, October.
    10. Daniel Yekutieli, 2008. "Comments on: Control of the false discovery rate under dependence using the bootstrap and subsampling," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(3), pages 458-460, November.
    11. Emilio Depetris-Chauvin & Ömer Özak, 2020. "The origins of the division of labor in pre-industrial times," Journal of Economic Growth, Springer, vol. 25(3), pages 297-340, September.
    12. Fernández Guerrico, Sofía, 2021. "The effects of trade-induced worker displacement on health and mortality in Mexico," Journal of Health Economics, Elsevier, vol. 80(C).
    13. Daniel Bjorkegren & Joshua Blumenstock & Omowunmi Folajimi-Senjobi & Jacqueline Mauro & Suraj R. Nair, 2022. "Instant Loans Can Lift Subjective Well-Being: A Randomized Evaluation of Digital Credit in Nigeria," Papers 2202.13540, arXiv.org.
    14. Cattaneo, Cristina & Grieco, Daniela, 2021. "Turning opposition into support to immigration: The role of narratives," Journal of Economic Behavior & Organization, Elsevier, vol. 190(C), pages 785-801.
    15. Blimpo, Moussa P. & Pugatch, Todd, 2021. "Entrepreneurship education and teacher training in Rwanda," Journal of Development Economics, Elsevier, vol. 149(C).
    16. Sauvenier, Mathieu & Van Bellegem, Sébastien, 2023. "Direction Identification and Minimax Estimation by Generalized Eigenvalue Problem in High Dimensional Sparse Regression," LIDAM Discussion Papers CORE 2023005, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    17. Girum Abebe & A Stefano Caria & Marcel Fafchamps & Paolo Falco & Simon Franklin & Simon Quinn, 2021. "Anonymity or Distance? Job Search and Labour Market Exclusion in a Growing African City [Endogenous Stratification in Randomized Experiments]," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 88(3), pages 1279-1310.
    18. Ahmed Ismaïl & Hartikainen Anna-Liisa & Järvelin Marjo-Riitta & Richardson Sylvia, 2011. "False Discovery Rate Estimation for Stability Selection: Application to Genome-Wide Association Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-20, November.
    19. Hanck, Christoph, 2011. "Now, whose schools are really better (or weaker) than Germany's? A multiple testing approach," Economic Modelling, Elsevier, vol. 28(4), pages 1739-1746, July.
    20. Luca Barbaglia & Sergio Consoli & Sebastiano Manzan, 2024. "Forecasting GDP in Europe with textual data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(2), pages 338-355, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:197:y:2024:i:c:s0167947324000574. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.