IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0070837.html
   My bibliography  Save this article

A Comparison of Methods for Clustering 16S rRNA Sequences into OTUs

Author

Listed:
  • Wei Chen
  • Clarence K Zhang
  • Yongmei Cheng
  • Shaowu Zhang
  • Hongyu Zhao

Abstract

Recent studies of 16S rRNA sequences through next-generation sequencing have revolutionized our understanding of the microbial community composition and structure. One common approach in using these data to explore the genetic diversity in a microbial community is to cluster the 16S rRNA sequences into Operational Taxonomic Units (OTUs) based on sequence similarities. The inferred OTUs can then be used to estimate species, diversity, composition, and richness. Although a number of methods have been developed and commonly used to cluster the sequences into OTUs, relatively little guidance is available on their relative performance and the choice of key parameters for each method. In this study, we conducted a comprehensive evaluation of ten existing OTU inference methods. We found that the appropriate dissimilarity value for defining distinct OTUs is not only related with a specific method but also related with the sample complexity. For data sets with low complexity, all the algorithms need a higher dissimilarity threshold to define OTUs. Some methods, such as, CROP and SLP, are more robust to the specific choice of the threshold than other methods, especially for shorter reads. For high-complexity data sets, hierarchical cluster methods need a more strict dissimilarity threshold to define OTUs because the commonly used dissimilarity threshold of 3% often leads to an under-estimation of the number of OTUs. In general, hierarchical clustering methods perform better at lower dissimilarity thresholds. Our results show that sequence abundance plays an important role in OTU inference. We conclude that care is needed to choose both a threshold for dissimilarity and abundance for OTU inference.

Suggested Citation

  • Wei Chen & Clarence K Zhang & Yongmei Cheng & Shaowu Zhang & Hongyu Zhao, 2013. "A Comparison of Methods for Clustering 16S rRNA Sequences into OTUs," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-10, August.
  • Handle: RePEc:plo:pone00:0070837
    DOI: 10.1371/journal.pone.0070837
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0070837
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0070837&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0070837?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Peter J. Turnbaugh & Micah Hamady & Tanya Yatsunenko & Brandi L. Cantarel & Alexis Duncan & Ruth E. Ley & Mitchell L. Sogin & William J. Jones & Bruce A. Roe & Jason P. Affourtit & Michael Egholm & Be, 2009. "A core gut microbiome in obese and lean twins," Nature, Nature, vol. 457(7228), pages 480-484, January.
    2. Ruth E. Ley & Peter J. Turnbaugh & Samuel Klein & Jeffrey I. Gordon, 2006. "Human gut microbes associated with obesity," Nature, Nature, vol. 444(7122), pages 1022-1023, December.
    3. Thomas J Sharpton & Samantha J Riesenfeld & Steven W Kembel & Joshua Ladau & James P O'Dwyer & Jessica L Green & Jonathan A Eisen & Katherine S Pollard, 2011. "PhylOTU: A High-Throughput Procedure Quantifies Microbial Community Diversity and Resolves Novel Taxa from Metagenomic Data," PLOS Computational Biology, Public Library of Science, vol. 7(1), pages 1-13, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jeongsu Oh & Chi-Hwan Choi & Min-Kyu Park & Byung Kwon Kim & Kyuin Hwang & Sang-Heon Lee & Soon Gyu Hong & Arshan Nasir & Wan-Sup Cho & Kyung Mo Kim, 2016. "CLUSTOM-CLOUD: In-Memory Data Grid-Based Software for Clustering 16S rRNA Sequence Data in the Cloud Environment," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-20, March.
    2. Stilianos Louca & Florent Mazel & Michael Doebeli & Laura Wegener Parfrey, 2019. "A census-based estimate of Earth's bacterial and archaeal diversity," PLOS Biology, Public Library of Science, vol. 17(2), pages 1-30, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pirjo Wacklin & Harri Mäkivuokko & Noora Alakulppi & Janne Nikkilä & Heli Tenkanen & Jarkko Räbinä & Jukka Partanen & Kari Aranko & Jaana Mättö, 2011. "Secretor Genotype (FUT2 gene) Is Strongly Associated with the Composition of Bifidobacteria in the Human Intestine," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-10, May.
    2. Mariana F. Fernández & Iris Reina-Pérez & Juan Manuel Astorga & Andrea Rodríguez-Carrillo & Julio Plaza-Díaz & Luis Fontana, 2018. "Breast Cancer and Its Relationship with the Microbiota," IJERPH, MDPI, vol. 15(8), pages 1-20, August.
    3. Hannah Lees & Jonathan Swann & Simon M Poucher & Jeremy K Nicholson & Elaine Holmes & Ian D Wilson & Julian R Marchesi, 2014. "Age and Microenvironment Outweigh Genetic Influence on the Zucker Rat Microbiome," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-11, September.
    4. Chihiro Morita & Hirokazu Tsuji & Tomokazu Hata & Motoharu Gondo & Shu Takakura & Keisuke Kawai & Kazufumi Yoshihara & Kiyohito Ogata & Koji Nomoto & Kouji Miyazaki & Nobuyuki Sudo, 2015. "Gut Dysbiosis in Patients with Anorexia Nervosa," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-13, December.
    5. Dongyang Yang & Wei Xu, 2023. "Estimation of Mediation Effect on Zero-Inflated Microbiome Mediators," Mathematics, MDPI, vol. 11(13), pages 1-16, June.
    6. Liangwu Qiu & Fuhong Gong & Jiang Wu & Dingyun You & Yinzhou Zhao & Lianwu Xu & Xue Cao & Fukai Bao, 2022. "Exercise Interventions Improved Sleep Quality through Regulating Intestinal Microbiota Composition," IJERPH, MDPI, vol. 19(19), pages 1-14, September.
    7. Jeffrey D Galley & Michael Bailey & Claire Kamp Dush & Sarah Schoppe-Sullivan & Lisa M Christian, 2014. "Maternal Obesity Is Associated with Alterations in the Gut Microbiome in Toddlers," PLOS ONE, Public Library of Science, vol. 9(11), pages 1-13, November.
    8. Qian Zhang & Lijuan Zhang & Yang Lyu & Yutao Shi & Liangyun Zhu & Min Zhang & Yuyan Zhao & Di Zhao & Lei Wang & Dan Yi & Yongqing Hou & Tao Wu, 2022. "Dietary supplementation of Lactobacillus zeae regulated the gut microbiome in piglets infected with enterotoxigenic Escherichia coli," Czech Journal of Animal Science, Czech Academy of Agricultural Sciences, vol. 67(1), pages 27-38.
    9. Patrick D Schloss, 2009. "A High-Throughput DNA Sequence Aligner for Microbial Ecology Studies," PLOS ONE, Public Library of Science, vol. 4(12), pages 1-9, December.
    10. John Molloy & Katrina Allen & Fiona Collier & Mimi L. K. Tang & Alister C. Ward & Peter Vuillermin, 2013. "The Potential Link between Gut Microbiota and IgE-Mediated Food Allergy in Early Life," IJERPH, MDPI, vol. 10(12), pages 1-22, December.
    11. Bharati Patel & Kadamb Patel & Shabbir Moochhala, 2020. "Diet-Derived Post-Biotic Metabolites to Promote Microbiota Function and Human Health," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 28(2), pages 21520-21524, June.
    12. Ahmed A Metwally & Philip S Yu & Derek Reiman & Yang Dai & Patricia W Finn & David L Perkins, 2019. "Utilizing longitudinal microbiome taxonomic profiles to predict food allergy via Long Short-Term Memory networks," PLOS Computational Biology, Public Library of Science, vol. 15(2), pages 1-16, February.
    13. Kiran Konain & Sadia & Turfa Nadeem & Adeed Khan & Warda Iqbal & Arsalan & Amir Javed & Ruby Khan & Kainat Jamil & Kainat Jamil, 2018. "Importance of Probiotics in Gastrointestinal Tract," Journal of Asian Scientific Research, Asian Economic and Social Society, vol. 8(3), pages 128-143, March.
    14. Yunxi Liu & R. A. Leo Elworth & Michael D. Jochum & Kjersti M. Aagaard & Todd J. Treangen, 2022. "De novo identification of microbial contaminants in low microbial biomass microbiomes with Squeegee," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    15. C. E. Dubé & M. Ziegler & A. Mercière & E. Boissin & S. Planes & C. A. -F. Bourmaud & C. R. Voolstra, 2021. "Naturally occurring fire coral clones demonstrate a genetic and environmental basis of microbiome composition," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    16. Frisha Abkar & Sajjad ur Rahman & Ahsan Naveed & Hira Rasheed & Syed Ashar Mehfooz, 2019. "Evaluation of Oral Microflora in Obese and Non- Obese Humans from District Faisalabad, Pakistan," Current Research in Diabetes & Obesity Journal, Juniper Publishers Inc., vol. 10(1), pages 12-16, March.
    17. Frederick A Matsen IV & Steven N Evans, 2013. "Edge Principal Components and Squash Clustering: Using the Special Structure of Phylogenetic Placement Data for Sample Comparison," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-15, March.
    18. Thomas J Sharpton & Samantha J Riesenfeld & Steven W Kembel & Joshua Ladau & James P O'Dwyer & Jessica L Green & Jonathan A Eisen & Katherine S Pollard, 2011. "PhylOTU: A High-Throughput Procedure Quantifies Microbial Community Diversity and Resolves Novel Taxa from Metagenomic Data," PLOS Computational Biology, Public Library of Science, vol. 7(1), pages 1-13, January.
    19. Sanjeena Subedi & Drew Neish & Stephen Bak & Zeny Feng, 2020. "Cluster analysis of microbiome data by using mixtures of Dirichlet–multinomial regression models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(5), pages 1163-1187, November.
    20. Vinod Nikhra, 2019. "The Novel Dimensions of Cardio-Metabolic Health Gut Microbiota, Dysbiosis and its Fallouts," Current Research in Diabetes & Obesity Journal, Juniper Publishers Inc., vol. 11(1), pages 28-37, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0070837. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.