IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-41099-8.html
   My bibliography  Save this article

Removal of false positives in metagenomics-based taxonomy profiling via targeting Type IIB restriction sites

Author

Listed:
  • Zheng Sun

    (Brigham and Women’s Hospital and Harvard Medical School)

  • Jiang Liu

    (Qingdao OE Biotechnology Company Limited)

  • Meng Zhang

    (Inner Mongolia Agricultural University)

  • Tong Wang

    (Brigham and Women’s Hospital and Harvard Medical School)

  • Shi Huang

    (The University of Hong Kong)

  • Scott T. Weiss

    (Brigham and Women’s Hospital and Harvard Medical School)

  • Yang-Yu Liu

    (Brigham and Women’s Hospital and Harvard Medical School
    University of Illinois at Urbana-Champaign)

Abstract

Accurate species identification and abundance estimation are critical for the interpretation of whole metagenome sequencing (WMS) data. Yet, existing metagenomic profilers suffer from false-positive identifications, which can account for more than 90% of total identified species. Here, by leveraging species-specific Type IIB restriction endonuclease digestion sites as reference instead of universal markers or whole microbial genomes, we present a metagenomic profiler, MAP2B (MetAgenomic Profiler based on type IIB restriction sites), to resolve those issues. We first illustrate the pitfalls of using relative abundance as the only feature in determining false positives. We then propose a feature set to distinguish false positives from true positives, and using simulated metagenomes from CAMI2, we establish a false-positive recognition model. By benchmarking the performance in metagenomic profiling using a simulation dataset with varying sequencing depth and species richness, we illustrate the superior performance of MAP2B over existing metagenomic profilers in species identification. We further test the performance of MAP2B using real WMS data from an ATCC mock community, confirming its superior precision against sequencing depth. Finally, by leveraging WMS data from an IBD cohort, we demonstrate the taxonomic features generated by MAP2B can better discriminate IBD and predict metabolomic profiles.

Suggested Citation

  • Zheng Sun & Jiang Liu & Meng Zhang & Tong Wang & Shi Huang & Scott T. Weiss & Yang-Yu Liu, 2023. "Removal of false positives in metagenomics-based taxonomy profiling via targeting Type IIB restriction sites," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-41099-8
    DOI: 10.1038/s41467-023-41099-8
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-41099-8
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-41099-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. James Robert White & Niranjan Nagarajan & Mihai Pop, 2009. "Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples," PLOS Computational Biology, Public Library of Science, vol. 5(4), pages 1-11, April.
    2. Gregory D. Poore & Evguenia Kopylova & Qiyun Zhu & Carolina Carpenter & Serena Fraraccio & Stephen Wandro & Tomasz Kosciolek & Stefan Janssen & Jessica Metcalf & Se Jin Song & Jad Kanbar & Sandrine Mi, 2020. "RETRACTED ARTICLE: Microbiome analyses of blood and tissues suggest cancer diagnostic approach," Nature, Nature, vol. 579(7800), pages 567-574, March.
    3. Peter Menzel & Kim Lee Ng & Anders Krogh, 2016. "Fast and sensitive taxonomic classification for metagenomics with Kaiju," Nature Communications, Nature, vol. 7(1), pages 1-9, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kaiyuan Zhu & Alejandro A. Schäffer & Welles Robinson & Junyan Xu & Eytan Ruppin & A. Funda Ergun & Yuzhen Ye & S. Cenk Sahinalp, 2022. "Strain level microbial detection and quantification with applications to single cell metagenomics," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    2. Pengfei Song & Wen Qin & YanGan Huang & Lei Wang & Zhenyuan Cai & Tongzuo Zhang, 2020. "Grazing Management Influences Gut Microbial Diversity of Livestock in the Same Area," Sustainability, MDPI, vol. 12(10), pages 1-12, May.
    3. Shilan Li & Jianxin Shi & Paul Albert & Hong-Bin Fang, 2022. "Dependence Structure Analysis and Its Application in Human Microbiome," Mathematics, MDPI, vol. 11(1), pages 1-14, December.
    4. Allison G. White & George S. Watts & Zhenqiang Lu & Maria M. Meza-Montenegro & Eric A. Lutz & Philip Harber & Jefferey L. Burgess, 2014. "Environmental Arsenic Exposure and Microbiota in Induced Sputum," IJERPH, MDPI, vol. 11(2), pages 1-15, February.
    5. Leandro C. Hermida & E. Michael Gertz & Eytan Ruppin, 2022. "Predicting cancer prognosis and drug response from the tumor microbiome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    6. Yunmin Yang & Binbin Chu & Jiayi Cheng & Jiali Tang & Bin Song & Houyu Wang & Yao He, 2022. "Bacteria eat nanoprobes for aggregation-enhanced imaging and killing diverse microorganisms," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    7. Amanda Sörensen Ristinmaa & Albert Tafur Rangel & Alexander Idström & Sebastian Valenzuela & Eduard J. Kerkhoven & Phillip B. Pope & Merima Hasani & Johan Larsbrink, 2023. "Resin acids play key roles in shaping microbial communities during degradation of spruce bark," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    8. Yong Li & Jiejie Zhang & Jianqiang Zhang & Wenlai Xu & Zishen Mou, 2019. "Microbial Community Structure in the Sediments and Its Relation to Environmental Factors in Eutrophicated Sancha Lake," IJERPH, MDPI, vol. 16(11), pages 1-15, May.
    9. Paul J McMurdie & Susan Holmes, 2014. "Waste Not, Want Not: Why Rarefying Microbiome Data Is Inadmissible," PLOS Computational Biology, Public Library of Science, vol. 10(4), pages 1-12, April.
    10. Candice R. Gurbatri & Georgette A. Radford & Laura Vrbanac & Jongwon Im & Elaine M. Thomas & Courtney Coker & Samuel R. Taylor & YoungUk Jang & Ayelet Sivan & Kyu Rhee & Anas A. Saleh & Tiffany Chien , 2024. "Engineering tumor-colonizing E. coli Nissle 1917 for detection and treatment of colorectal neoplasia," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    11. Bin Wang, 2020. "A Zipf-plot based normalization method for high-throughput RNA-seq data," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-15, April.
    12. Hongjian Wei & Yongqi Wang & Juming Zhang & Liangfa Ge & Tianzeng Liu, 2022. "Changes in Soil Bacterial Community Structure in Bermudagrass Turf under Short-Term Traffic Stress," Agriculture, MDPI, vol. 12(5), pages 1-18, May.
    13. Lharbi Dridi & Fernando Altamura & Emmanuel Gonzalez & Olivia Lui & Ryszard Kubinski & Reilly Pidgeon & Adrian Montagut & Jasmine Chong & Jianguo Xia & Corinne F. Maurice & Bastien Castagner, 2023. "Identifying glycan consumers in human gut microbiota samples using metabolic labeling coupled with fluorescence-activated cell sorting," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    14. Lingjing Jiang & Niina Haiminen & Anna‐Paola Carrieri & Shi Huang & Yoshiki Vázquez‐Baeza & Laxmi Parida & Ho‐Cheol Kim & Austin D. Swafford & Rob Knight & Loki Natarajan, 2022. "Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data," Biometrics, The International Biometric Society, vol. 78(3), pages 1155-1167, September.
    15. Kai Qiu & Huiyi Cai & Xin Wang & Guohua Liu, 2023. "Effects of Peroral Microbiota Transplantation on the Establishment of Intestinal Microorganisms in a Newly-Hatched Chick Model," Agriculture, MDPI, vol. 13(5), pages 1-13, April.
    16. Amirhossein Shamsaddini & Kimia Dadkhah & Patrick M Gillevet, 2020. "BiomMiner: An advanced exploratory microbiome analysis and visualization pipeline," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-13, June.
    17. Gregor Gorkiewicz & Gerhard G Thallinger & Slave Trajanoski & Stefan Lackner & Gernot Stocker & Thomas Hinterleitner & Christian Gülly & Christoph Högenauer, 2013. "Alterations in the Colonic Microbiota in Response to Osmotic Diarrhea," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-17, February.
    18. Gabriele Bellotti & Eren Taskin & Simone Sello & Cristina Sudiro & Rossella Bortolaso & Francesca Bandini & Maria Chiara Guerrieri & Pier Sandro Cocconcelli & Francesco Vuolo & Edoardo Puglisi, 2022. "LABs Fermentation Side-Product Positively Influences Rhizosphere and Plant Growth in Greenhouse Lettuce and Tomatoes," Land, MDPI, vol. 11(9), pages 1-15, September.
    19. Mason. R. Stothart & Philip. D. McLoughlin & Sarah. A. Medill & Ruth. J. Greuel & Alastair. J. Wilson & Jocelyn. Poissant, 2024. "Methanogenic patterns in the gut microbiome are associated with survival in a population of feral horses," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    20. Yuan Ge & Joshua P Schimel & Patricia A Holden, 2014. "Analysis of Run-to-Run Variation of Bar-Coded Pyrosequencing for Evaluating Bacterial Community Shifts and Individual Taxa Dynamics," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-6, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-41099-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.