IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-33869-7.html
   My bibliography  Save this article

Strain level microbial detection and quantification with applications to single cell metagenomics

Author

Listed:
  • Kaiyuan Zhu

    (National Cancer Institute, National Institutes of Health
    UC San Diego
    Indiana University)

  • Alejandro A. Schäffer

    (National Cancer Institute, National Institutes of Health)

  • Welles Robinson

    (National Cancer Institute, National Institutes of Health
    National Cancer Institute, National Institutes of Health)

  • Junyan Xu

    (National Cancer Institute, National Institutes of Health)

  • Eytan Ruppin

    (National Cancer Institute, National Institutes of Health)

  • A. Funda Ergun

    (Indiana University)

  • Yuzhen Ye

    (Indiana University)

  • S. Cenk Sahinalp

    (National Cancer Institute, National Institutes of Health
    Indiana University)

Abstract

Computational identification and quantification of distinct microbes from high throughput sequencing data is crucial for our understanding of human health. Existing methods either use accurate but computationally expensive alignment-based approaches or less accurate but computationally fast alignment-free approaches, which often fail to correctly assign reads to genomes. Here we introduce CAMMiQ, a combinatorial optimization framework to identify and quantify distinct genomes (specified by a database) in a metagenomic dataset. As a key methodological innovation, CAMMiQ uses substrings of variable length and those that appear in two genomes in the database, as opposed to the commonly used fixed-length, unique substrings. These substrings allow to accurately decouple mixtures of highly similar genomes resulting in higher accuracy than the leading alternatives, without requiring additional computational resources, as demonstrated on commonly used benchmarking datasets. Importantly, we show that CAMMiQ can distinguish closely related bacterial strains in simulated metagenomic and real single-cell metatranscriptomic data.

Suggested Citation

  • Kaiyuan Zhu & Alejandro A. Schäffer & Welles Robinson & Junyan Xu & Eytan Ruppin & A. Funda Ergun & Yuzhen Ye & S. Cenk Sahinalp, 2022. "Strain level microbial detection and quantification with applications to single cell metagenomics," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-33869-7
    DOI: 10.1038/s41467-022-33869-7
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-33869-7
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-33869-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Anna Aulicino & Kevin C. Rue-Albrecht & Lorena Preciado-Llanes & Giorgio Napolitani & Neil Ashley & Adam Cribbs & Jana Koth & B. Christoffer Lagerholm & Tim Ambrose & Melita A. Gordon & David Sims & A, 2018. "Invasive Salmonella exploits divergent immune evasion strategies in infected and bystander dendritic cell subsets," Nature Communications, Nature, vol. 9(1), pages 1-17, December.
    2. Gregory D. Poore & Evguenia Kopylova & Qiyun Zhu & Carolina Carpenter & Serena Fraraccio & Stephen Wandro & Tomasz Kosciolek & Stefan Janssen & Jessica Metcalf & Se Jin Song & Jad Kanbar & Sandrine Mi, 2020. "RETRACTED ARTICLE: Microbiome analyses of blood and tissues suggest cancer diagnostic approach," Nature, Nature, vol. 579(7800), pages 567-574, March.
    3. Peter Menzel & Kim Lee Ng & Anders Krogh, 2016. "Fast and sensitive taxonomic classification for metagenomics with Kaiju," Nature Communications, Nature, vol. 7(1), pages 1-9, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zheng Sun & Jiang Liu & Meng Zhang & Tong Wang & Shi Huang & Scott T. Weiss & Yang-Yu Liu, 2023. "Removal of false positives in metagenomics-based taxonomy profiling via targeting Type IIB restriction sites," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    2. Leandro C. Hermida & E. Michael Gertz & Eytan Ruppin, 2022. "Predicting cancer prognosis and drug response from the tumor microbiome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    3. Yunmin Yang & Binbin Chu & Jiayi Cheng & Jiali Tang & Bin Song & Houyu Wang & Yao He, 2022. "Bacteria eat nanoprobes for aggregation-enhanced imaging and killing diverse microorganisms," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    4. Amanda Sörensen Ristinmaa & Albert Tafur Rangel & Alexander Idström & Sebastian Valenzuela & Eduard J. Kerkhoven & Phillip B. Pope & Merima Hasani & Johan Larsbrink, 2023. "Resin acids play key roles in shaping microbial communities during degradation of spruce bark," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    5. Peng Liu & Jessica Ewald & Zhiqiang Pang & Elena Legrand & Yeon Seon Jeon & Jonathan Sangiovanni & Orcun Hacariz & Guangyan Zhou & Jessica A. Head & Niladri Basu & Jianguo Xia, 2023. "ExpressAnalyst: A unified platform for RNA-sequencing analysis in non-model species," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    6. Candice R. Gurbatri & Georgette A. Radford & Laura Vrbanac & Jongwon Im & Elaine M. Thomas & Courtney Coker & Samuel R. Taylor & YoungUk Jang & Ayelet Sivan & Kyu Rhee & Anas A. Saleh & Tiffany Chien , 2024. "Engineering tumor-colonizing E. coli Nissle 1917 for detection and treatment of colorectal neoplasia," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    7. Wei Ding & Shougang Wang & Peng Qin & Shen Fan & Xiaoyan Su & Peiyan Cai & Jie Lu & Han Cui & Meng Wang & Yi Shu & Yongming Wang & Hui-Hui Fu & Yu-Zhong Zhang & Yong-Xin Li & Weipeng Zhang, 2023. "Anaerobic thiosulfate oxidation by the Roseobacter group is prevalent in marine biofilms," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    8. Daniela S. Aliaga Goltsman & Lisa M. Alexander & Jyun-Liang Lin & Rodrigo Fregoso Ocampo & Benjamin Freeman & Rebecca C. Lamothe & Andres Perez Rivas & Morayma M. Temoche-Diaz & Shailaja Chadha & Nata, 2022. "Compact Cas9d and HEARO enzymes for genome editing discovered from uncultivated microbes," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    9. Ernestina Hauptfeld & Nikolaos Pappas & Sandra Iwaarden & Basten L. Snoek & Andrea Aldas-Vargas & Bas E. Dutilh & F. A. Bastiaan Meijenfeldt, 2024. "Integrating taxonomic signals from MAGs and contigs improves read annotation and taxonomic profiling of metagenomes," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    10. J. L. Rolando & M. Kolton & T. Song & Y. Liu & P. Pinamang & R. Conrad & J. T. Morris & K. T. Konstantinidis & J. E. Kostka, 2024. "Sulfur oxidation and reduction are coupled to nitrogen fixation in the roots of the salt marsh foundation plant Spartina alterniflora," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    11. Omary Mzava & Alexandre Pellan Cheng & Adrienne Chang & Sami Smalling & Liz-Audrey Kounatse Djomnang & Joan Sesing Lenz & Randy Longman & Amy Steadman & Luis G. Gómez-Escobar & Edward J. Schenck & Mir, 2022. "A metagenomic DNA sequencing assay that is robust against environmental DNA contamination," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    12. Lharbi Dridi & Fernando Altamura & Emmanuel Gonzalez & Olivia Lui & Ryszard Kubinski & Reilly Pidgeon & Adrian Montagut & Jasmine Chong & Jianguo Xia & Corinne F. Maurice & Bastien Castagner, 2023. "Identifying glycan consumers in human gut microbiota samples using metabolic labeling coupled with fluorescence-activated cell sorting," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    13. Alexa M. Nicolas & Ella T. Sieradzki & Jennifer Pett-Ridge & Jillian F. Banfield & Michiko E. Taga & Mary K. Firestone & Steven J. Blazewicz, 2023. "A subset of viruses thrives following microbial resuscitation during rewetting of a seasonally dry California grassland soil," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    14. Lingjing Jiang & Niina Haiminen & Anna‐Paola Carrieri & Shi Huang & Yoshiki Vázquez‐Baeza & Laxmi Parida & Ho‐Cheol Kim & Austin D. Swafford & Rob Knight & Loki Natarajan, 2022. "Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data," Biometrics, The International Biometric Society, vol. 78(3), pages 1155-1167, September.
    15. Márk Szuhaj & Roland Wirth & Zoltán Bagi & Gergely Maróti & Gábor Rákhely & Kornél L. Kovács, 2021. "Development of Stable Mixed Microbiota for High Yield Power to Methane Conversion," Energies, MDPI, vol. 14(21), pages 1-17, November.
    16. Manoshi S. Datta & Idan Yelin & Ori Hochwald & Imad Kassis & Liron Borenstein-Levin & Amir Kugelman & Roy Kishony, 2021. "Rapid methicillin resistance diversification in Staphylococcus epidermidis colonizing human neonates," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    17. Heleen T. Ouboter & Rob Mesman & Tom Sleutels & Jelle Postma & Martijn Wissink & Mike S. M. Jetten & Annemiek Ter Heijne & Tom Berben & Cornelia U. Welte, 2024. "Mechanisms of extracellular electron transfer in anaerobic methanotrophic archaea," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    18. Marc Schoeler & Sandrine Ellero-Simatos & Till Birkner & Jordi Mayneris-Perxachs & Lisa Olsson & Harald Brolin & Ulrike Loeber & Jamie D. Kraft & Arnaud Polizzi & Marian Martí-Navas & Josep Puig & Ant, 2023. "The interplay between dietary fatty acids and gut microbiota influences host metabolism and hepatic steatosis," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    19. Ziye Wang & Ronghui You & Haitao Han & Wei Liu & Fengzhu Sun & Shanfeng Zhu, 2024. "Effective binning of metagenomic contigs using contrastive multi-view representation learning," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    20. Harold P. Hodgins & Pengsheng Chen & Briallen Lobb & Xin Wei & Benjamin J. M. Tremblay & Michael J. Mansfield & Victoria C. Y. Lee & Pyung-Gang Lee & Jeffrey Coffin & Ana T. Duggan & Alexis E. Dolphin, 2023. "Ancient Clostridium DNA and variants of tetanus neurotoxins associated with human archaeological remains," Nature Communications, Nature, vol. 14(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-33869-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.