IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-29843-y.html
   My bibliography  Save this article

A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments

Author

Listed:
  • Shaojun Pan

    (Fudan University
    Ministry of Education)

  • Chengkai Zhu

    (Fudan University
    Ministry of Education
    Fudan University)

  • Xing-Ming Zhao

    (Fudan University
    Ministry of Education
    Fudan University
    Zhangjiang Fudan International Innovation Center)

  • Luis Pedro Coelho

    (Fudan University
    Ministry of Education)

Abstract

Metagenomic binning is the step in building metagenome-assembled genomes (MAGs) when sequences predicted to originate from the same genome are automatically grouped together. The most widely-used methods for binning are reference-independent, operating de novo and enable the recovery of genomes from previously unsampled clades. However, they do not leverage the knowledge in existing databases. Here, we introduce SemiBin, an open source tool that uses deep siamese neural networks to implement a semi-supervised approach, i.e. SemiBin exploits the information in reference genomes, while retaining the capability of reconstructing high-quality bins that are outside the reference dataset. Using simulated and real microbiome datasets from several different habitats from GMGCv1 (Global Microbial Gene Catalog), including the human gut, non-human guts, and environmental habitats (ocean and soil), we show that SemiBin outperforms existing state-of-the-art binning methods. In particular, compared to other methods, SemiBin returns more high-quality bins with larger taxonomic diversity, including more distinct genera and species.

Suggested Citation

  • Shaojun Pan & Chengkai Zhu & Xing-Ming Zhao & Luis Pedro Coelho, 2022. "A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-29843-y
    DOI: 10.1038/s41467-022-29843-y
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-29843-y
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-29843-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Chirag Jain & Luis M. Rodriguez-R & Adam M. Phillippy & Konstantinos T. Konstantinidis & Srinivas Aluru, 2018. "High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries," Nature Communications, Nature, vol. 9(1), pages 1-8, December.
    2. Junjie Qin & Ruiqiang Li & Jeroen Raes & Manimozhiyan Arumugam & Kristoffer Solvsten Burgdorf & Chaysavanh Manichanh & Trine Nielsen & Nicolas Pons & Florence Levenez & Takuji Yamada & Daniel R. Mende, 2010. "A human gut microbial gene catalogue established by metagenomic sequencing," Nature, Nature, vol. 464(7285), pages 59-65, March.
    3. Luis Pedro Coelho & Renato Alves & Álvaro Rodríguez Río & Pernille Neve Myers & Carlos P. Cantalapiedra & Joaquín Giner-Lamia & Thomas Sebastian Schmidt & Daniel R. Mende & Askarbek Orakov & Ivica Let, 2022. "Towards the biogeography of prokaryotic genes," Nature, Nature, vol. 601(7892), pages 252-256, January.
    4. Erica C. Pehrsson & Pablo Tsukayama & Sanket Patel & Melissa Mejía-Bautista & Giordano Sosa-Soto & Karla M. Navarrete & Maritza Calderon & Lilia Cabrera & William Hoyos-Arango & M. Teresita Bertoli & , 2016. "Interconnected microbiomes and resistomes in low-income human habitats," Nature, Nature, vol. 533(7602), pages 212-216, May.
    5. Alexandre Almeida & Alex L. Mitchell & Miguel Boland & Samuel C. Forster & Gregory B. Gloor & Aleksandra Tarkowska & Trevor D. Lawley & Robert D. Finn, 2019. "A new genomic blueprint of the human gut microbiota," Nature, Nature, vol. 568(7753), pages 499-504, April.
    6. Frederik Schulz & Simon Roux & David Paez-Espino & Sean Jungbluth & David A. Walsh & Vincent J. Denef & Katherine D. McMahon & Konstantinos T. Konstantinidis & Emiley A. Eloe-Fadrosh & Nikos C. Kyrpid, 2020. "Giant virus diversity and host interactions through global metagenomics," Nature, Nature, vol. 578(7795), pages 432-436, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yuxuan Du & Fengzhu Sun, 2023. "MetaCC allows scalable and integrative analyses of both long-read and short-read metagenomic Hi-C data," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    2. Ziye Wang & Ronghui You & Haitao Han & Wei Liu & Fengzhu Sun & Shanfeng Zhu, 2024. "Effective binning of metagenomic contigs using contrastive multi-view representation learning," Nature Communications, Nature, vol. 15(1), pages 1-14, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiyang Dong & Yongyi Peng & Muhua Wang & Laura Woods & Wenxue Wu & Yong Wang & Xi Xiao & Jiwei Li & Kuntong Jia & Chris Greening & Zongze Shao & Casey R. J. Hubert, 2023. "Evolutionary ecology of microbial populations inhabiting deep sea sediments associated with cold seeps," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    2. Xuanji Li & Asker Brejnrod & Jonathan Thorsen & Trine Zachariasen & Urvish Trivedi & Jakob Russel & Gisle Alberg Vestergaard & Jakob Stokholm & Morten Arendt Rasmussen & Søren Johannes Sørensen, 2023. "Differential responses of the gut microbiome and resistome to antibiotic exposures in infants and adults," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    3. Chiranjib Chakraborty & Ashish Ranjan Sharma & Garima Sharma & Manojit Bhattacharya & Sang-Soo Lee, 2023. "Exploring the status of global terrestrial and aquatic microbial diversity through ‘Biodiversity Informatics’," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 25(10), pages 10567-10598, October.
    4. Fiona B. Tamburini & Dylan Maghini & Ovokeraye H. Oduaran & Ryan Brewster & Michaella R. Hulley & Venesa Sahibdeen & Shane A. Norris & Stephen Tollman & Kathleen Kahn & Ryan G. Wagner & Alisha N. Wade, 2022. "Short- and long-read metagenomics of urban and rural South African gut microbiomes reveal a transitional composition and undescribed taxa," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    5. Georges P. Schmartz & Jacqueline Rehner & Madline P. Gund & Verena Keller & Leidy-Alejandra G. Molano & Stefan Rupf & Matthias Hannig & Tim Berger & Elias Flockerzi & Berthold Seitz & Sara Fleser & Sa, 2024. "Decoding the diagnostic and therapeutic potential of microbiota using pan-body pan-disease microbiomics," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    6. Shuqin Zeng & Dhrati Patangia & Alexandre Almeida & Zhemin Zhou & Dezhi Mu & R. Paul Ross & Catherine Stanton & Shaopu Wang, 2022. "A compendium of 32,277 metagenome-assembled genomes and over 80 million genes from the early-life human gut microbiome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    7. Mingyue Cheng & Shuai Luo & Peng Zhang & Guangzhou Xiong & Kai Chen & Chuanqi Jiang & Fangdian Yang & Hanhui Huang & Pengshuo Yang & Guanxi Liu & Yuhao Zhang & Sang Ba & Ping Yin & Jie Xiong & Wei Mia, 2024. "A genome and gene catalog of the aquatic microbiomes of the Tibetan Plateau," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    8. Sigal Leviatan & Saar Shoer & Daphna Rothschild & Maria Gorodetski & Eran Segal, 2022. "An expanded reference map of the human gut microbiome reveals hundreds of previously unknown species," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    9. Eric W. Seabloom & Maria C. Caldeira & Kendi F. Davies & Linda Kinkel & Johannes M. H. Knops & Kimberly J. Komatsu & Andrew S. MacDougall & Georgiana May & Michael Millican & Joslin L. Moore & Luis I., 2023. "Globally consistent response of plant microbiome diversity across hosts and continents to soil nutrients and herbivores," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    10. Lulu Li & Hehong Zhang & Zihang Yang & Chen Wang & Shanshan Li & Chen Cao & Tongsong Yao & Zhongyan Wei & Yanjun Li & Jianping Chen & Zongtao Sun, 2022. "Independently evolved viral effectors convergently suppress DELLA protein SLR1-mediated broad-spectrum antiviral immunity in rice," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    11. Jianhua Wang & Guan-Zhu Han, 2023. "Genome mining shows that retroviruses are pervasively invading vertebrate genomes," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    12. Li Zhang & Karen R. Jonscher & Zuyuan Zhang & Yi Xiong & Ryan S. Mueller & Jacob E. Friedman & Chongle Pan, 2022. "Islet autoantibody seroconversion in type-1 diabetes is associated with metagenome-assembled genomes in infant gut microbiomes," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    13. Jae-Chang Cho, 2021. "Human microbiome privacy risks associated with summary statistics," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-11, April.
    14. Max E. Schön & Vasily V. Zlatogursky & Rohan P. Singh & Camille Poirier & Susanne Wilken & Varsha Mathur & Jürgen F. H. Strassert & Jarone Pinhassi & Alexandra Z. Worden & Patrick J. Keeling & Thijs J, 2021. "Single cell genomics reveals plastid-lacking Picozoa are close relatives of red algae," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    15. Alexandra Bessenay & Hugo Bisio & Lucid Belmudes & Yohann Couté & Lionel Bertaux & Jean-Michel Claverie & Chantal Abergel & Sandra Jeudy & Matthieu Legendre, 2024. "Complex transcriptional regulations of a hyperparasitic quadripartite system in giant viruses infecting protists," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    16. Ashwag Shami & Rewaa S. Jalal & Ruba A. Ashy & Haneen W. Abuauf & Lina Baz & Mohammed Y. Refai & Aminah A. Barqawi & Hanadi M. Baeissa & Manal A. Tashkandi & Sahar Alshareef & Aala A. Abulfaraj, 2022. "Use of Metagenomic Whole Genome Shotgun Sequencing Data in Taxonomic Assignment of Dipterygium glaucum Rhizosphere and Surrounding Bulk Soil Microbiomes, and Their Response to Watering," Sustainability, MDPI, vol. 14(14), pages 1-21, July.
    17. Manish Boolchandani & Kevin S. Blake & Drake H. Tilley & Miguel M. Cabada & Drew J. Schwartz & Sanket Patel & Maria Luisa Morales & Rina Meza & Giselle Soto & Sandra D. Isidean & Chad K. Porter & Mark, 2022. "Impact of international travel and diarrhea on gut microbiome and resistome dynamics," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    18. Nenad Macesic & Jane Hawkey & Ben Vezina & Jessica A. Wisniewski & Hugh Cottingham & Luke V. Blakeway & Taylor Harshegyi & Katherine Pragastis & Gnei Zweena Badoordeen & Amanda Dennison & Denis W. Spe, 2023. "Genomic dissection of endemic carbapenem resistance reveals metallo-beta-lactamase dissemination through clonal, plasmid and integron transfer," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    19. Minghui Cheng & Yingjie Xu & Xiao Cui & Xin Wei & Yundi Chang & Jun Xu & Cheng Lei & Lei Xue & Yifan Zheng & Zhang Wang & Lingtong Huang & Min Zheng & Hong Luo & Yuxin Leng & Chao Jiang, 2024. "Deep longitudinal lower respiratory tract microbiome profiling reveals genome-resolved functional and evolutionary dynamics in critical illness," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    20. Xinzhu Yi & Jie-Liang Liang & Ping Wen & Pu Jia & Shi-wei Feng & Shen-yan Liu & Yuan-yue Zhuang & Yu-qian Guo & Jing-li Lu & Sheng-ji Zhong & Bin Liao & Zhang Wang & Wen-sheng Shu & Jin-tian Li, 2024. "Giant viruses as reservoirs of antibiotic resistance genes," Nature Communications, Nature, vol. 15(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-29843-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.