IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1000352.html
   My bibliography  Save this article

Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples

Author

Listed:
  • James Robert White
  • Niranjan Nagarajan
  • Mihai Pop

Abstract

Numerous studies are currently underway to characterize the microbial communities inhabiting our world. These studies aim to dramatically expand our understanding of the microbial biosphere and, more importantly, hope to reveal the secrets of the complex symbiotic relationship between us and our commensal bacterial microflora. An important prerequisite for such discoveries are computational tools that are able to rapidly and accurately compare large datasets generated from complex bacterial communities to identify features that distinguish them.We present a statistical method for comparing clinical metagenomic samples from two treatment populations on the basis of count data (e.g. as obtained through sequencing) to detect differentially abundant features. Our method, Metastats, employs the false discovery rate to improve specificity in high-complexity environments, and separately handles sparsely-sampled features using Fisher's exact test. Under a variety of simulations, we show that Metastats performs well compared to previously used methods, and significantly outperforms other methods for features with sparse counts. We demonstrate the utility of our method on several datasets including a 16S rRNA survey of obese and lean human gut microbiomes, COG functional profiles of infant and mature gut microbiomes, and bacterial and viral metabolic subsystem data inferred from random sequencing of 85 metagenomes. The application of our method to the obesity dataset reveals differences between obese and lean subjects not reported in the original study. For the COG and subsystem datasets, we provide the first statistically rigorous assessment of the differences between these populations. The methods described in this paper are the first to address clinical metagenomic datasets comprising samples from multiple subjects. Our methods are robust across datasets of varied complexity and sampling level. While designed for metagenomic applications, our software can also be applied to digital gene expression studies (e.g. SAGE). A web server implementation of our methods and freely available source code can be found at http://metastats.cbcb.umd.edu/.Author Summary: The emerging field of metagenomics aims to understand the structure and function of microbial communities solely through DNA analysis. Current metagenomics studies comparing communities resemble large-scale clinical trials with multiple subjects from two general populations (e.g. sick and healthy). To improve analyses of this type of experimental data, we developed a statistical methodology for detecting differentially abundant features between microbial communities, that is, features that are enriched or depleted in one population versus another. We show our methods are applicable to various metagenomic data ranging from taxonomic information to functional annotations. We also provide an assessment of taxonomic differences in gut microbiota between lean and obese humans, as well as differences between the functional capacities of mature and infant gut microbiomes, and those of microbial and viral metagenomes. Our methods are the first to statistically address differential abundance in comparative metagenomics studies with multiple subjects, and we hope will give researchers a more complete picture of how exactly two environments differ.

Suggested Citation

  • James Robert White & Niranjan Nagarajan & Mihai Pop, 2009. "Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples," PLOS Computational Biology, Public Library of Science, vol. 5(4), pages 1-11, April.
  • Handle: RePEc:plo:pcbi00:1000352
    DOI: 10.1371/journal.pcbi.1000352
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000352
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000352&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1000352?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Peter J. Turnbaugh & Ruth E. Ley & Michael A. Mahowald & Vincent Magrini & Elaine R. Mardis & Jeffrey I. Gordon, 2006. "An obesity-associated gut microbiome with increased capacity for energy harvest," Nature, Nature, vol. 444(7122), pages 1027-1031, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhenqiu Liu & Dechang Chen & Li Sheng & Amy Y Liu, 2013. "Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-7, March.
    2. Pengfei Song & Wen Qin & YanGan Huang & Lei Wang & Zhenyuan Cai & Tongzuo Zhang, 2020. "Grazing Management Influences Gut Microbial Diversity of Livestock in the Same Area," Sustainability, MDPI, vol. 12(10), pages 1-12, May.
    3. Shilan Li & Jianxin Shi & Paul Albert & Hong-Bin Fang, 2022. "Dependence Structure Analysis and Its Application in Human Microbiome," Mathematics, MDPI, vol. 11(1), pages 1-14, December.
    4. Kai Qiu & Huiyi Cai & Xin Wang & Guohua Liu, 2023. "Effects of Peroral Microbiota Transplantation on the Establishment of Intestinal Microorganisms in a Newly-Hatched Chick Model," Agriculture, MDPI, vol. 13(5), pages 1-13, April.
    5. Paul J McMurdie & Susan Holmes, 2014. "Waste Not, Want Not: Why Rarefying Microbiome Data Is Inadmissible," PLOS Computational Biology, Public Library of Science, vol. 10(4), pages 1-12, April.
    6. Allison G. White & George S. Watts & Zhenqiang Lu & Maria M. Meza-Montenegro & Eric A. Lutz & Philip Harber & Jefferey L. Burgess, 2014. "Environmental Arsenic Exposure and Microbiota in Induced Sputum," IJERPH, MDPI, vol. 11(2), pages 1-15, February.
    7. Qi Xu & Xiaoya Yuan & Tiantian Gu & Yang Li & Wangcheng Dai & Xiaokun Shen & Yadong Song & Yang Zhang & Wenming Zhao & Guobin Chang & Guohong Chen, 2017. "Comparative characterization of bacterial communities in geese fed all-grass or high-grain diets," PLOS ONE, Public Library of Science, vol. 12(10), pages 1-14, October.
    8. Patricio S La Rosa & Berkley Shands & Elena Deych & Yanjiao Zhou & Erica Sodergren & George Weinstock & William D Shannon, 2012. "Statistical Object Data Analysis of Taxonomic Trees from Human Microbiome Data," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-12, November.
    9. Amirhossein Shamsaddini & Kimia Dadkhah & Patrick M Gillevet, 2020. "BiomMiner: An advanced exploratory microbiome analysis and visualization pipeline," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-13, June.
    10. Tsapekos, Panagiotis & Khoshnevisan, Benyamin & Zhu, Xinyu & Treu, Laura & Alfaro, Natalia & Kougias, Panagiotis G. & Angelidaki, Irini, 2022. "Lab- and pilot-scale anaerobic digestion of municipal bio-waste and potential of digestate for biogas upgrading sustained by microbial analysis," Renewable Energy, Elsevier, vol. 201(P1), pages 344-353.
    11. Edoardo Pasolli & Duy Tin Truong & Faizan Malik & Levi Waldron & Nicola Segata, 2016. "Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-26, July.
    12. Gregor Gorkiewicz & Gerhard G Thallinger & Slave Trajanoski & Stefan Lackner & Gernot Stocker & Thomas Hinterleitner & Christian Gülly & Christoph Högenauer, 2013. "Alterations in the Colonic Microbiota in Response to Osmotic Diarrhea," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-17, February.
    13. Gabriele Bellotti & Eren Taskin & Simone Sello & Cristina Sudiro & Rossella Bortolaso & Francesca Bandini & Maria Chiara Guerrieri & Pier Sandro Cocconcelli & Francesco Vuolo & Edoardo Puglisi, 2022. "LABs Fermentation Side-Product Positively Influences Rhizosphere and Plant Growth in Greenhouse Lettuce and Tomatoes," Land, MDPI, vol. 11(9), pages 1-15, September.
    14. Bin Wang, 2020. "A Zipf-plot based normalization method for high-throughput RNA-seq data," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-15, April.
    15. Wu, Peijie & Chen, Tianyi & Diew Wong, Yiik & Meng, Xianghai & Wang, Xueqin & Liu, Wei, 2023. "Exploring key spatio-temporal features of crash risk hot spots on urban road network: A machine learning approach," Transportation Research Part A: Policy and Practice, Elsevier, vol. 173(C).
    16. Hongjian Wei & Yongqi Wang & Juming Zhang & Liangfa Ge & Tianzeng Liu, 2022. "Changes in Soil Bacterial Community Structure in Bermudagrass Turf under Short-Term Traffic Stress," Agriculture, MDPI, vol. 12(5), pages 1-18, May.
    17. Yuan Ge & Joshua P Schimel & Patricia A Holden, 2014. "Analysis of Run-to-Run Variation of Bar-Coded Pyrosequencing for Evaluating Bacterial Community Shifts and Individual Taxa Dynamics," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-6, June.
    18. Zheng Sun & Jiang Liu & Meng Zhang & Tong Wang & Shi Huang & Scott T. Weiss & Yang-Yu Liu, 2023. "Removal of false positives in metagenomics-based taxonomy profiling via targeting Type IIB restriction sites," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    19. Yong Li & Jiejie Zhang & Jianqiang Zhang & Wenlai Xu & Zishen Mou, 2019. "Microbial Community Structure in the Sediments and Its Relation to Environmental Factors in Eutrophicated Sancha Lake," IJERPH, MDPI, vol. 16(11), pages 1-15, May.
    20. Monica Vera-Lise Tulstrup & Ellen Gerd Christensen & Vera Carvalho & Caroline Linninge & Siv Ahrné & Ole Højberg & Tine Rask Licht & Martin Iain Bahl, 2015. "Antibiotic Treatment Affects Intestinal Permeability and Gut Microbial Composition in Wistar Rats Dependent on Antibiotic Class," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-17, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Doratha A Byrd & Jun Chen & Emily Vogtmann & Autumn Hullings & Se Jin Song & Amnon Amir & Muhammad G Kibriya & Habibul Ahsan & Yu Chen & Heidi Nelson & Rob Knight & Jianxin Shi & Nicholas Chia & Rashm, 2019. "Reproducibility, stability, and accuracy of microbial profiles by fecal sample collection method in three distinct populations," PLOS ONE, Public Library of Science, vol. 14(11), pages 1-19, November.
    2. Kiran Konain & Sadia & Turfa Nadeem & Adeed Khan & Warda Iqbal & Arsalan & Amir Javed & Ruby Khan & Kainat Jamil & Kainat Jamil, 2018. "Importance of Probiotics in Gastrointestinal Tract," Journal of Asian Scientific Research, Asian Economic and Social Society, vol. 8(3), pages 128-143, March.
    3. Marjolein Heddes & Baraa Altaha & Yunhui Niu & Sandra Reitmeier & Karin Kleigrewe & Dirk Haller & Silke Kiessling, 2022. "The intestinal clock drives the microbiome to maintain gastrointestinal homeostasis," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    4. Hui Xia & Beijia Zhou & Jing Sui & Wenqing Ma & Shaokang Wang & Ligang Yang & Guiju Sun, 2022. "Lycium barbarum Polysaccharide Regulates the Lipid Metabolism and Alters Gut Microbiota in High-Fat Diet Induced Obese Mice," IJERPH, MDPI, vol. 19(19), pages 1-12, September.
    5. Davillas, Apostolos & Pudney, Stephen, 2016. "Concordance of health states in couples. Analysis of self-reported, nurse administered and blood-based biomarker data in Understanding Society," ISER Working Paper Series 2016-15, Institute for Social and Economic Research.
    6. Vinod Nikhra, 2019. "Therapeutic Potential of Gut Microbiome Manipulation: Concepts in Fecal Microbiota Transplantation," Current Research in Diabetes & Obesity Journal, Juniper Publishers Inc., vol. 11(1), pages 1-9, June.
    7. Cristiane R. S. Câmara & Carlos A. Urrea & Vicki Schlegel, 2013. "Pinto Beans ( Phaseolus vulgaris L.) as a Functional Food: Implications on Human Health," Agriculture, MDPI, vol. 3(1), pages 1-22, February.
    8. Vinod Nikhra, 2019. "The Novel Dimensions of Cardio-Metabolic Health Gut Microbiota, Dysbiosis and its Fallouts," Current Research in Diabetes & Obesity Journal, Juniper Publishers Inc., vol. 11(1), pages 28-37, June.
    9. Shinji Fukuda & Yumiko Nakanishi & Eisuke Chikayama & Hiroshi Ohno & Tsuneo Hino & Jun Kikuchi, 2009. "Evaluation and Characterization of Bacterial Metabolic Dynamics with a Novel Profiling Technique, Real-Time Metabolotyping," PLOS ONE, Public Library of Science, vol. 4(3), pages 1-10, March.
    10. Zemin Zheng & Jinchi Lv & Wei Lin, 2021. "Nonsparse Learning with Latent Variables," Operations Research, INFORMS, vol. 69(1), pages 346-359, January.
    11. Karen D. Corbin & Elvis A. Carnero & Blake Dirks & Daria Igudesman & Fanchao Yi & Andrew Marcus & Taylor L. Davis & Richard E. Pratley & Bruce E. Rittmann & Rosa Krajmalnik-Brown & Steven R. Smith, 2023. "Host-diet-gut microbiome interactions influence human energy balance: a randomized clinical trial," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    12. Michael DiMarzio & Brigida Rusconi & Neela H Yennawar & Mark Eppinger & Andrew D Patterson & Edward G Dudley, 2017. "Identification of a mouse Lactobacillus johnsonii strain with deconjugase activity against the FXR antagonist T-β-MCA," PLOS ONE, Public Library of Science, vol. 12(9), pages 1-15, September.
    13. Hanan H. Wahid & Ayesha Bahez & Mohammed I. A. Mustafa Mahmud & Farih N. Hashim & Norhidayah Kamarudin & Roesnita Baharuddin & Ahmad M. Ahmad Mustafa & Hamizah Ismail, 2022. "Maternal Risk Factors For Group B Streptococcus (Gbs) Vaginal Colonization," Acta Scientifica Malaysia (ASM), Zibeline International Publishing, vol. 6(2), pages 55-58, August.
    14. Tamar Ringel-Kulka & Jing Cheng & Yehuda Ringel & Jarkko Salojärvi & Ian Carroll & Airi Palva & Willem M de Vos & Reetta Satokari, 2013. "Intestinal Microbiota in Healthy U.S. Young Children and Adults—A High Throughput Microarray Analysis," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-10, May.
    15. Hannah Lees & Jonathan Swann & Simon M Poucher & Jeremy K Nicholson & Elaine Holmes & Ian D Wilson & Julian R Marchesi, 2014. "Age and Microenvironment Outweigh Genetic Influence on the Zucker Rat Microbiome," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-11, September.
    16. Barbara Emmenegger & Julien Massoni & Christine M. Pestalozzi & Miriam Bortfeld-Miller & Benjamin A. Maier & Julia A. Vorholt, 2023. "Identifying microbiota community patterns important for plant protection using synthetic communities and machine learning," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    17. Liat Shenhav & Ori Furman & Leah Briscoe & Mike Thompson & Justin D Silverman & Itzhak Mizrahi & Eran Halperin, 2019. "Modeling the temporal dynamics of the gut microbial community in adults and infants," PLOS Computational Biology, Public Library of Science, vol. 15(6), pages 1-21, June.
    18. Isaac G. Crusoe & Ian C. Chiwaya & Tasnim I. Habib, 2024. "Immune Control of Gut Microbiota Prevents Obesity and the Effect of Antibiotic on Microbial Population," International Journal of Research and Scientific Innovation, International Journal of Research and Scientific Innovation (IJRSI), vol. 11(5), pages 1-9, May.
    19. Daniel Beck & James A Foster, 2014. "Machine Learning Techniques Accurately Classify Microbial Communities by Bacterial Vaginosis Characteristics," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-8, February.
    20. Chihiro Morita & Hirokazu Tsuji & Tomokazu Hata & Motoharu Gondo & Shu Takakura & Keisuke Kawai & Kazufumi Yoshihara & Kiyohito Ogata & Koji Nomoto & Kouji Miyazaki & Nobuyuki Sudo, 2015. "Gut Dysbiosis in Patients with Anorexia Nervosa," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1000352. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.