IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0075448.html
   My bibliography  Save this article

NeSSM: A Next-Generation Sequencing Simulator for Metagenomics

Author

Listed:
  • Ben Jia
  • Liming Xuan
  • Kaiye Cai
  • Zhiqiang Hu
  • Liangxiao Ma
  • Chaochun Wei

Abstract

Background: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools. Results: We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics). Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units) version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim. Conclusions: NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it’s freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php.

Suggested Citation

  • Ben Jia & Liming Xuan & Kaiye Cai & Zhiqiang Hu & Liangxiao Ma & Chaochun Wei, 2013. "NeSSM: A Next-Generation Sequencing Simulator for Metagenomics," PLOS ONE, Public Library of Science, vol. 8(10), pages 1-10, October.
  • Handle: RePEc:plo:pone00:0075448
    DOI: 10.1371/journal.pone.0075448
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0075448
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0075448&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0075448?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gene W. Tyson & Jarrod Chapman & Philip Hugenholtz & Eric E. Allen & Rachna J. Ram & Paul M. Richardson & Victor V. Solovyev & Edward M. Rubin & Daniel S. Rokhsar & Jillian F. Banfield, 2004. "Community structure and metabolism through reconstruction of microbial genomes from the environment," Nature, Nature, vol. 428(6978), pages 37-43, March.
    2. Marvin Mundry & Erich Bornberg-Bauer & Michael Sammeth & Philine G D Feulner, 2012. "Evaluating Characteristics of De Novo Assembly Software on 454 Transcriptome Data: A Simulation Approach," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-10, February.
    3. Peng Jia & Liming Xuan & Lei Liu & Chaochun Wei, 2011. "MetaBinG: Using GPUs to Accelerate Metagenomic Sequence Classification," PLOS ONE, Public Library of Science, vol. 6(11), pages 1-5, November.
    4. Ravi K Patel & Mukesh Jain, 2012. "NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-7, February.
    5. Marcel Margulies & Michael Egholm & William E. Altman & Said Attiya & Joel S. Bader & Lisa A. Bemben & Jan Berka & Michael S. Braverman & Yi-Ju Chen & Zhoutao Chen & Scott B. Dewell & Lei Du & Joseph , 2005. "Genome sequencing in microfabricated high-density picolitre reactors," Nature, Nature, vol. 437(7057), pages 376-380, September.
    6. Dongying Wu & Philip Hugenholtz & Konstantinos Mavromatis & Rüdiger Pukall & Eileen Dalin & Natalia N. Ivanova & Victor Kunin & Lynne Goodwin & Martin Wu & Brian J. Tindall & Sean D. Hooper & Amrita P, 2009. "A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea," Nature, Nature, vol. 462(7276), pages 1056-1060, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Stephen A Stanhope, 2010. "Occupancy Modeling, Maximum Contig Size Probabilities and Designing Metagenomics Experiments," PLOS ONE, Public Library of Science, vol. 5(7), pages 1-10, July.
    2. Dongya Wu & Enhui Shen & Bowen Jiang & Yu Feng & Wei Tang & Sangting Lao & Lei Jia & Han-Yang Lin & Lingjuan Xie & Xifang Weng & Chenfeng Dong & Qinghong Qian & Feng Lin & Haiming Xu & Huabing Lu & Lu, 2022. "Genomic insights into the evolution of Echinochloa species as weed and orphan crop," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    3. Fernando Lopez-Rios & Barbara Angulo & Belen Gomez & Debbie Mair & Rebeca Martinez & Esther Conde & Felice Shieh & Jeffrey Vaks & Rachel Langland & H Jeffrey Lawrence & David Gonzalez de Castro, 2013. "Comparison of Testing Methods for the Detection of BRAF V600E Mutations in Malignant Melanoma: Pre-Approval Validation Study of the Companion Diagnostic Test for Vemurafenib," PLOS ONE, Public Library of Science, vol. 8(1), pages 1-7, January.
    4. Kelly J. Whaley-Martin & Lin-Xing Chen & Tara Colenbrander Nelson & Jennifer Gordon & Rose Kantor & Lauren E. Twible & Stephanie Marshall & Sam McGarry & Laura Rossi & Benoit Bessette & Christian Baro, 2023. "O2 partitioning of sulfur oxidizing bacteria drives acidity and thiosulfate distributions in mining waters," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    5. Jean-Sebastien Gounot & Minghao Chia & Denis Bertrand & Woei-Yuh Saw & Aarthi Ravikrishnan & Adrian Low & Yichen Ding & Amanda Hui Qi Ng & Linda Wei Lin Tan & Yik-Ying Teo & Henning Seedorf & Niranjan, 2022. "Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    6. Xiaoquan Su & Weihua Pan & Baoxing Song & Jian Xu & Kang Ning, 2014. "Parallel-META 2.0: Enhanced Metagenomic Data Analysis with Functional Annotation, High Performance Computing and Advanced Visualization," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-13, March.
    7. David J H F Knapp & Rachel A McGovern & Art F Y Poon & Xiaoyin Zhong & Dennison Chan & Luke C Swenson & Winnie Dong & P Richard Harrigan, 2014. "“Deep” Sequencing Accuracy and Reproducibility Using Roche/454 Technology for Inferring Co-Receptor Usage in HIV-1," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-10, June.
    8. Chongqing Wen & Liyou Wu & Yujia Qin & Joy D Van Nostrand & Daliang Ning & Bo Sun & Kai Xue & Feifei Liu & Ye Deng & Yuting Liang & Jizhong Zhou, 2017. "Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform," PLOS ONE, Public Library of Science, vol. 12(4), pages 1-20, April.
    9. Abrar E Al-Shaer & George R Flentke & Mark E Berres & Ana Garic & Susan M Smith, 2019. "Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-25, April.
    10. Shibu Yooseph & Granger Sutton & Douglas B Rusch & Aaron L Halpern & Shannon J Williamson & Karin Remington & Jonathan A Eisen & Karla B Heidelberg & Gerard Manning & Weizhong Li & Lukasz Jaroszewski , 2007. "The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein Families," PLOS Biology, Public Library of Science, vol. 5(3), pages 1-35, March.
    11. Jiang Du & Robert D Bjornson & Zhengdong D Zhang & Yong Kong & Michael Snyder & Mark B Gerstein, 2009. "Integrating Sequencing Technologies in Personal Genomics: Optimal Low Cost Reconstruction of Structural Variants," PLOS Computational Biology, Public Library of Science, vol. 5(7), pages 1-15, July.
    12. Angelina Beavogui & Auriane Lacroix & Nicolas Wiart & Julie Poulain & Tom O. Delmont & Lucas Paoli & Patrick Wincker & Pedro H. Oliveira, 2024. "The defensome of complex bacterial communities," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    13. Wei Ding & Shougang Wang & Peng Qin & Shen Fan & Xiaoyan Su & Peiyan Cai & Jie Lu & Han Cui & Meng Wang & Yi Shu & Yongming Wang & Hui-Hui Fu & Yu-Zhong Zhang & Yong-Xin Li & Weipeng Zhang, 2023. "Anaerobic thiosulfate oxidation by the Roseobacter group is prevalent in marine biofilms," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    14. Natasha K. Dudek & Jesus G. Galaz-Montoya & Handuo Shi & Megan Mayer & Cristina Danita & Arianna I. Celis & Tobias Viehboeck & Gong-Her Wu & Barry Behr & Silvia Bulgheresi & Kerwyn Casey Huang & Wah C, 2023. "Previously uncharacterized rectangular bacterial structures in the dolphin mouth," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    15. Irene Stefanini & Monica Di Paola & Gianni Liti & Andrea Marranci & Federico Sebastiani & Enrico Casalone & Duccio Cavalieri, 2022. "Resistance to Arsenite and Arsenate in Saccharomyces cerevisiae Arises through the Subtelomeric Expansion of a Cluster of Yeast Genes," IJERPH, MDPI, vol. 19(13), pages 1-15, July.
    16. Mohan A V S K Katta & Aamir W Khan & Dadakhalandar Doddamani & Mahendar Thudi & Rajeev K Varshney, 2015. "NGS-QCbox and Raspberry for Parallel, Automated and Rapid Quality Control Analysis of Large-Scale Next Generation Sequencing (Illumina) Data," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-9, October.
    17. Peri, Alessandro, 2020. "A hardware approach to value function iteration," Journal of Economic Dynamics and Control, Elsevier, vol. 114(C).
    18. Ágnes Becsei & Alessandro Fuschi & Saria Otani & Ravi Kant & Ilja Weinstein & Patricia Alba & József Stéger & Dávid Visontai & Christian Brinch & Miranda Graaf & Claudia M. E. Schapendonk & Antonio Ba, 2024. "Time-series sewage metagenomics distinguishes seasonal, human-derived and environmental microbial communities potentially allowing source-attributed surveillance," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    19. Luis E. Valentin-Alvarado & Kathryn E. Appler & Valerie Anda & Marie C. Schoelmerich & Jacob West-Roberts & Veronika Kivenson & Alexander Crits-Christoph & Lynn Ly & Rohan Sachdeva & Chris Greening & , 2024. "Asgard archaea modulate potential methanogenesis substrates in wetland soil," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    20. Wenxiu Wang & Weizhi Song & Marwan E. Majzoub & Xiaoyuan Feng & Bu Xu & Jianchang Tao & Yuanqing Zhu & Zhiyong Li & Pei-Yuan Qian & Nicole S. Webster & Torsten Thomas & Lu Fan, 2024. "Decoupling of strain- and intrastrain-level interactions of microbiomes in a sponge holobiont," Nature Communications, Nature, vol. 15(1), pages 1-17, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0075448. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.