IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1001061.html
   My bibliography  Save this article

PhylOTU: A High-Throughput Procedure Quantifies Microbial Community Diversity and Resolves Novel Taxa from Metagenomic Data

Author

Listed:
  • Thomas J Sharpton
  • Samantha J Riesenfeld
  • Steven W Kembel
  • Joshua Ladau
  • James P O'Dwyer
  • Jessica L Green
  • Jonathan A Eisen
  • Katherine S Pollard

Abstract

Microbial diversity is typically characterized by clustering ribosomal RNA (SSU-rRNA) sequences into operational taxonomic units (OTUs). Targeted sequencing of environmental SSU-rRNA markers via PCR may fail to detect OTUs due to biases in priming and amplification. Analysis of shotgun sequenced environmental DNA, known as metagenomics, avoids amplification bias but generates fragmentary, non-overlapping sequence reads that cannot be clustered by existing OTU-finding methods. To circumvent these limitations, we developed PhylOTU, a computational workflow that identifies OTUs from metagenomic SSU-rRNA sequence data through the use of phylogenetic principles and probabilistic sequence profiles. Using simulated metagenomic data, we quantified the accuracy with which PhylOTU clusters reads into OTUs. Comparisons of PCR and shotgun sequenced SSU-rRNA markers derived from the global open ocean revealed that while PCR libraries identify more OTUs per sequenced residue, metagenomic libraries recover a greater taxonomic diversity of OTUs. In addition, we discover novel species, genera and families in the metagenomic libraries, including OTUs from phyla missed by analysis of PCR sequences. Taken together, these results suggest that PhylOTU enables characterization of part of the biosphere currently hidden from PCR-based surveys of diversity?Author Summary: Microorganisms comprise the majority of the biodiversity on the planet. Because the overwhelming majority of microbes are not readily cultured in the laboratory, researchers often rely on PCR-based investigations of genomic sequence to characterize microbial diversity. These analyses have dramatically expanded our understanding of biodiversity, but due to methodological biases PCR-based approaches may only reveal part of the microbial biosphere. Shotgun sequencing of environmental DNA, known as metagenomics, avoids the biases associated with targeted amplification of genomic sequence and can provide insight into the diversity hidden from traditional investigations. However, the fragmentary, non-overlapping nature of shotgun sequence data makes it intractable to analyze with existing tools. Here, we present PhylOTU, a novel computational method that enables accurate characterization of microbial diversity from metagenomic data. We process over 10 million metagenomic sequences obtained from the global open ocean to identify novel Bacterial taxa and reveal the presence of microorganisms overlooked by investigation of PCR-based sequences from the same samples. These results suggest that to fully characterize microbial biodiversity requires a novel bioinformatics toolbox for analysis of shotgun metagenomic data.

Suggested Citation

  • Thomas J Sharpton & Samantha J Riesenfeld & Steven W Kembel & Joshua Ladau & James P O'Dwyer & Jessica L Green & Jonathan A Eisen & Katherine S Pollard, 2011. "PhylOTU: A High-Throughput Procedure Quantifies Microbial Community Diversity and Resolves Novel Taxa from Metagenomic Data," PLOS Computational Biology, Public Library of Science, vol. 7(1), pages 1-13, January.
  • Handle: RePEc:plo:pcbi00:1001061
    DOI: 10.1371/journal.pcbi.1001061
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1001061
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1001061&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1001061?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Alexandra M Schnoes & Shoshana D Brown & Igor Dodevski & Patricia C Babbitt, 2009. "Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies," PLOS Computational Biology, Public Library of Science, vol. 5(12), pages 1-13, December.
    2. Peter J. Turnbaugh & Micah Hamady & Tanya Yatsunenko & Brandi L. Cantarel & Alexis Duncan & Ruth E. Ley & Mitchell L. Sogin & William J. Jones & Bruce A. Roe & Jason P. Affourtit & Michael Egholm & Be, 2009. "A core gut microbiome in obese and lean twins," Nature, Nature, vol. 457(7228), pages 480-484, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wei Chen & Clarence K Zhang & Yongmei Cheng & Shaowu Zhang & Hongyu Zhao, 2013. "A Comparison of Methods for Clustering 16S rRNA Sequences into OTUs," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-10, August.
    2. Yemin Lan & Qiong Wang & James R Cole & Gail L Rosen, 2012. "Using the RDP Classifier to Predict Taxonomic Novelty and Reduce the Search Space for Finding Novel Organisms," PLOS ONE, Public Library of Science, vol. 7(3), pages 1-15, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John Molloy & Katrina Allen & Fiona Collier & Mimi L. K. Tang & Alister C. Ward & Peter Vuillermin, 2013. "The Potential Link between Gut Microbiota and IgE-Mediated Food Allergy in Early Life," IJERPH, MDPI, vol. 10(12), pages 1-22, December.
    2. Pirjo Wacklin & Harri Mäkivuokko & Noora Alakulppi & Janne Nikkilä & Heli Tenkanen & Jarkko Räbinä & Jukka Partanen & Kari Aranko & Jaana Mättö, 2011. "Secretor Genotype (FUT2 gene) Is Strongly Associated with the Composition of Bifidobacteria in the Human Intestine," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-10, May.
    3. Sanjeena Subedi & Drew Neish & Stephen Bak & Zeny Feng, 2020. "Cluster analysis of microbiome data by using mixtures of Dirichlet–multinomial regression models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1163-1187, November.
    4. Shinji Fukuda & Yumiko Nakanishi & Eisuke Chikayama & Hiroshi Ohno & Tsuneo Hino & Jun Kikuchi, 2009. "Evaluation and Characterization of Bacterial Metabolic Dynamics with a Novel Profiling Technique, Real-Time Metabolotyping," PLOS ONE, Public Library of Science, vol. 4(3), pages 1-10, March.
    5. Akira R Kinjo & Haruki Nakamura, 2012. "Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-11, February.
    6. Hannah Lees & Jonathan Swann & Simon M Poucher & Jeremy K Nicholson & Elaine Holmes & Ian D Wilson & Julian R Marchesi, 2014. "Age and Microenvironment Outweigh Genetic Influence on the Zucker Rat Microbiome," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-11, September.
    7. Liat Shenhav & Ori Furman & Leah Briscoe & Mike Thompson & Justin D Silverman & Itzhak Mizrahi & Eran Halperin, 2019. "Modeling the temporal dynamics of the gut microbial community in adults and infants," PLOS Computational Biology, Public Library of Science, vol. 15(6), pages 1-21, June.
    8. Huang Lin & Merete Eggesbø & Shyamal Das Peddada, 2022. "Linear and nonlinear correlation estimators unveil undescribed taxa interactions in microbiome data," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    9. Li, Jie & Shen, Xuzhu & Li, YaoTang, 2021. "Modeling the temporal dynamics of gut microbiota from a local community perspective," Ecological Modelling, Elsevier, vol. 460(C).
    10. Lena Takayasu & Wataru Suda & Eiichiro Watanabe & Shinji Fukuda & Kageyasu Takanashi & Hiroshi Ohno & Misako Takayasu & Hideki Takayasu & Masahira Hattori, 2017. "A 3-dimensional mathematical model of microbial proliferation that generates the characteristic cumulative relative abundance distributions in gut microbiomes," PLOS ONE, Public Library of Science, vol. 12(8), pages 1-20, August.
    11. Claudia Sala & Enrico Giampieri & Silvia Vitali & Paolo Garagnani & Daniel Remondini & Armando Bazzani & Claudio Franceschi & Gastone C Castellani, 2020. "Gut microbiota ecology: Biodiversity estimated from hybrid neutral-niche model increases with health status and aging," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-23, October.
    12. Wing-Cheong Wong & Sebastian Maurer-Stroh & Frank Eisenhaber, 2010. "More Than 1,001 Problems with Protein Domain Databases: Transmembrane Regions, Signal Peptides and the Issue of Sequence Homology," PLOS Computational Biology, Public Library of Science, vol. 6(7), pages 1-19, July.
    13. Alessandra N. Bazzano & Kaitlin S. Potts & Lydia A. Bazzano & John B. Mason, 2017. "The Life Course Implications of Ready to Use Therapeutic Food for Children in Low-Income Countries," IJERPH, MDPI, vol. 14(4), pages 1-19, April.
    14. Fiona B. Tamburini & Dylan Maghini & Ovokeraye H. Oduaran & Ryan Brewster & Michaella R. Hulley & Venesa Sahibdeen & Shane A. Norris & Stephen Tollman & Kathleen Kahn & Ryan G. Wagner & Alisha N. Wade, 2022. "Short- and long-read metagenomics of urban and rural South African gut microbiomes reveal a transitional composition and undescribed taxa," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    15. Xiao-Tao Jiang & Hai Zhang & Hua-Fang Sheng & Yu Wang & Yan He & Fei Zou & Hong-Wei Zhou, 2012. "Two-Stage Clustering (TSC): A Pipeline for Selecting Operational Taxonomic Units for the High-Throughput Sequencing of PCR Amplicons," PLOS ONE, Public Library of Science, vol. 7(1), pages 1-8, January.
    16. Liangwu Qiu & Fuhong Gong & Jiang Wu & Dingyun You & Yinzhou Zhao & Lianwu Xu & Xue Cao & Fukai Bao, 2022. "Exercise Interventions Improved Sleep Quality through Regulating Intestinal Microbiota Composition," IJERPH, MDPI, vol. 19(19), pages 1-14, September.
    17. Toby Kenney & Hong Gu & Tianshu Huang, 2021. "Poisson PCA: Poisson measurement error corrected PCA, with application to microbiome data," Biometrics, The International Biometric Society, vol. 77(4), pages 1369-1384, December.
    18. Doris Vandeputte & Lindsey Commer & Raul Y. Tito & Gunter Kathagen & João Sabino & Séverine Vermeire & Karoline Faust & Jeroen Raes, 2021. "Temporal variability in quantitative human gut microbiome profiles and implications for clinical research," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    19. Kai Markus Schneider & Antje Mohs & Wenfang Gui & Eric J. C. Galvez & Lena Susanna Candels & Lisa Hoenicke & Uthayakumar Muthukumarasamy & Christian H. Holland & Carsten Elfers & Konrad Kilic & Caroli, 2022. "Imbalanced gut microbiota fuels hepatocellular carcinoma development by shaping the hepatic inflammatory microenvironment," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    20. Yuval Bussi & Ruti Kapon & Ziv Reich, 2021. "Large-scale k-mer-based analysis of the informational properties of genomes, comparative genomics and taxonomy," PLOS ONE, Public Library of Science, vol. 16(10), pages 1-27, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1001061. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.