IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-48117-3.html
   My bibliography  Save this article

Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data

Author

Listed:
  • Yaqi Su

    (Zhejiang University
    Zhejiang University
    University of California)

  • Zhejian Yu

    (Zhejiang University
    Zhejiang University)

  • Siqian Jin

    (Zhejiang University
    Zhejiang University)

  • Zhipeng Ai

    (Zhejiang University)

  • Ruihong Yuan

    (Zhejiang University)

  • Xinyi Chen

    (Zhejiang University
    Zhejiang University)

  • Ziwei Xue

    (Zhejiang University
    Zhejiang University)

  • Yixin Guo

    (Zhejiang University
    Zhejiang University)

  • Di Chen

    (Zhejiang University
    Zhejiang University)

  • Hongqing Liang

    (Zhejiang University)

  • Zuozhu Liu

    (Zhejiang University)

  • Wanlu Liu

    (Zhejiang University
    Zhejiang University
    Zhejiang University
    Zhejiang University)

Abstract

The advancement of Long-Read Sequencing (LRS) techniques has significantly increased the length of sequencing to several kilobases, thereby facilitating the identification of alternative splicing events and isoform expressions. Recently, numerous computational tools for isoform detection using long-read sequencing data have been developed. Nevertheless, there remains a deficiency in comparative studies that systemically evaluate the performance of these tools, which are implemented with different algorithms, under various simulations that encompass potential influencing factors. In this study, we conducted a benchmark analysis of thirteen methods implemented in nine tools capable of identifying isoform structures from long-read RNA-seq data. We evaluated their performances using simulated data, which represented diverse sequencing platforms generated by an in-house simulator, RNA sequins (sequencing spike-ins) data, as well as experimental data. Our findings demonstrate IsoQuant as a highly effective tool for isoform detection with LRS, with Bambu and StringTie2 also exhibiting strong performance. These results offer valuable guidance for future research on alternative splicing analysis and the ongoing improvement of tools for isoform detection using LRS data.

Suggested Citation

  • Yaqi Su & Zhejian Yu & Siqian Jin & Zhipeng Ai & Ruihong Yuan & Xinyi Chen & Ziwei Xue & Yixin Guo & Di Chen & Hongqing Liang & Zuozhu Liu & Wanlu Liu, 2024. "Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-48117-3
    DOI: 10.1038/s41467-024-48117-3
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-48117-3
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-48117-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Eric T. Wang & Rickard Sandberg & Shujun Luo & Irina Khrebtukova & Lu Zhang & Christine Mayr & Stephen F. Kingsmore & Gary P. Schroth & Christopher B. Burge, 2008. "Alternative isoform regulation in human tissue transcriptomes," Nature, Nature, vol. 456(7221), pages 470-476, November.
    2. Xinyu Xiang & Yu Tao & Jonathan DiRusso & Fei-Man Hsu & Jinchun Zhang & Ziwei Xue & Julien Pontis & Didier Trono & Wanlu Liu & Amander T. Clark, 2022. "Human reproduction is regulated by retrotransposons derived from ancient Hominidae-specific viral infections," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    3. Yuchao Xia & Zijie Jin & Chengsheng Zhang & Linkun Ouyang & Yuhao Dong & Juan Li & Lvze Guo & Biyang Jing & Yang Shi & Susheng Miao & Ruibin Xi, 2023. "TAGET: a toolkit for analyzing full-length transcripts from long-read sequencing," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    4. Chenchen Zhu & Jingyan Wu & Han Sun & Francesca Briganti & Benjamin Meder & Wu Wei & Lars M. Steinmetz, 2021. "Single-molecule, full-length transcript isoform sequencing reveals disease-associated RNA isoforms in cardiomyocytes," Nature Communications, Nature, vol. 12(1), pages 1-9, December.
    5. Feng Yue & Yong Cheng & Alessandra Breschi & Jeff Vierstra & Weisheng Wu & Tyrone Ryba & Richard Sandstrom & Zhihai Ma & Carrie Davis & Benjamin D. Pope & Yin Shen & Dmitri D. Pervouchine & Sarah Djeb, 2014. "A comparative encyclopedia of DNA elements in the mouse genome," Nature, Nature, vol. 515(7527), pages 355-364, November.
    6. Alison D. Tang & Cameron M. Soulette & Marijke J. van Baren & Kevyn Hart & Eva Hrabeta-Robinson & Catherine J. Wu & Angela N. Brooks, 2020. "Full-length transcript characterization of SF3B1 mutation in chronic lymphocytic leukemia reveals downregulation of retained introns," Nature Communications, Nature, vol. 11(1), pages 1-12, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nicholas C. Gervais & Rebecca S. Shapiro, 2024. "Discovering the hidden function in fungal genomes," Nature Communications, Nature, vol. 15(1), pages 1-12, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jun Inamo & Akari Suzuki & Mahoko Takahashi Ueda & Kensuke Yamaguchi & Hiroshi Nishida & Katsuya Suzuki & Yuko Kaneko & Tsutomu Takeuchi & Hiroaki Hatano & Kazuyoshi Ishigaki & Yasushi Ishihama & Kazu, 2024. "Long-read sequencing for 29 immune cell subsets reveals disease-linked isoforms," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    2. Wei Hu & Yangjun Wu & Qili Shi & Jingni Wu & Deping Kong & Xiaohua Wu & Xianghuo He & Teng Liu & Shengli Li, 2022. "Systematic characterization of cancer transcriptome at transcript resolution," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    3. Gustavo Glusman & Juan Caballero & Max Robinson & Burak Kutlu & Leroy Hood, 2013. "Optimal Scaling of Digital Transcriptomes," PLOS ONE, Public Library of Science, vol. 8(11), pages 1-12, November.
    4. Xiaohong Li & Guy N Brock & Eric C Rouchka & Nigel G F Cooper & Dongfeng Wu & Timothy E O’Toole & Ryan S Gill & Abdallah M Eteleeb & Liz O’Brien & Shesh N Rai, 2017. "A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-22, May.
    5. Areum Han & Peter Stoilov & Anthony J Linares & Yu Zhou & Xiang-Dong Fu & Douglas L Black, 2014. "De Novo Prediction of PTBP1 Binding and Splicing Targets Reveals Unexpected Features of Its RNA Recognition and Function," PLOS Computational Biology, Public Library of Science, vol. 10(1), pages 1-18, January.
    6. Judith A Potashkin & Jose A Santiago & Bernard M Ravina & Arthur Watts & Alexey A Leontovich, 2012. "Biosignatures for Parkinson’s Disease and Atypical Parkinsonian Disorders Patients," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-13, August.
    7. Mario Ivanković & Jeremias N. Brand & Luca Pandolfini & Thomas Brown & Martin Pippel & Andrei Rozanski & Til Schubert & Markus A. Grohme & Sylke Winkler & Laura Robledillo & Meng Zhang & Azzurra Codin, 2024. "A comparative analysis of planarian genomes reveals regulatory conservation in the face of rapid structural divergence," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    8. Tulsi Patel & Jennifer Hammelman & Siaresh Aziz & Sumin Jang & Michael Closser & Theodore L. Michaels & Jacob A. Blum & David K. Gifford & Hynek Wichterle, 2022. "Transcriptional dynamics of murine motor neuron maturation in vivo and in vitro," Nature Communications, Nature, vol. 13(1), pages 1-20, December.
    9. Hongchun Lin & Hui Peng & Yuxiang Sun & Meijun Si & Jiao Wu & Yanlin Wang & Sandhya S. Thomas & Zheng Sun & Zhaoyong Hu, 2023. "Reprogramming of cis-regulatory networks during skeletal muscle atrophy in male mice," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    10. Jianfei Hu & Eli Boritz & William Wylie & Daniel C Douek, 2017. "Stochastic principles governing alternative splicing of RNA," PLOS Computational Biology, Public Library of Science, vol. 13(9), pages 1-20, September.
    11. Hillary M. Heiling & Douglas R. Wilson & Naim U. Rashid & Wei Sun & Joseph G. Ibrahim, 2023. "Estimating cell type composition using isoform expression one gene at a time," Biometrics, The International Biometric Society, vol. 79(2), pages 854-865, June.
    12. Hanyong Jin & Ji-Hyun Yeom & Eunkyoung Shin & Yoonjie Ha & Haifeng Liu & Daeyoung Kim & Minju Joo & Yong-Hak Kim & Hak Kyun Kim & Minkyung Ryu & Hong-Man Kim & Jeongkyu Kim & Keun P. Kim & Yoonsoo Hah, 2024. "5′-tRNAGly(GCC) halves generated by IRE1α are linked to the ER stress response," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    13. Seungjae Lee & Yen-Chung Chen & Austin E. Gillen & J. Matthew Taliaferro & Bart Deplancke & Hongjie Li & Eric C. Lai, 2022. "Diverse cell-specific patterns of alternative polyadenylation in Drosophila," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    14. Yue Yuan & Qiang Huo & Ziru Zhang & Qun Wang & Juanxia Wang & Shuaikang Chang & Peng Cai & Karen M. Song & David W. Galbraith & Weixiao Zhang & Long Huang & Rentao Song & Zeyang Ma, 2024. "Decoding the gene regulatory network of endosperm differentiation in maize," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    15. Wei Sun & Yufeng Liu & James J. Crowley & Ting-Huei Chen & Hua Zhou & Haitao Chu & Shunping Huang & Pei-Fen Kuan & Yuan Li & Darla Miller & Ginger Shaw & Yichao Wu & Vasyl Zhabotynsky & Leonard McMill, 2015. "IsoDOT Detects Differential RNA-Isoform Expression/Usage With Respect to a Categorical or Continuous Covariate With High Sensitivity and Specificity," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 975-986, September.
    16. Justin Bo-Kai Hsu & Neil Arvin Bretaña & Tzong-Yi Lee & Hsien-Da Huang, 2011. "Incorporating Evolutionary Information and Functional Domains for Identifying RNA Splicing Factors in Humans," PLOS ONE, Public Library of Science, vol. 6(11), pages 1-11, November.
    17. Stacey D Wagner & Adam J Struck & Riti Gupta & Dylan R Farnsworth & Amy E Mahady & Katy Eichinger & Charles A Thornton & Eric T Wang & J Andrew Berglund, 2016. "Dose-Dependent Regulation of Alternative Splicing by MBNL Proteins Reveals Biomarkers for Myotonic Dystrophy," PLOS Genetics, Public Library of Science, vol. 12(9), pages 1-24, September.
    18. Sepideh Tavakoli & Mohammad Nabizadeh & Amr Makhamreh & Howard Gamper & Caroline A. McCormick & Neda K. Rezapour & Ya-Ming Hou & Meni Wanunu & Sara H. Rouhanifard, 2023. "Semi-quantitative detection of pseudouridine modifications and type I/II hypermodifications in human mRNAs using direct long-read sequencing," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    19. Christopher G Bell & Sarah Finer & Cecilia M Lindgren & Gareth A Wilson & Vardhman K Rakyan & Andrew E Teschendorff & Pelin Akan & Elia Stupka & Thomas A Down & Inga Prokopenko & Ian M Morison & Jonat, 2010. "Integrated Genetic and Epigenetic Analysis Identifies Haplotype-Specific Methylation in the FTO Type 2 Diabetes and Obesity Susceptibility Locus," PLOS ONE, Public Library of Science, vol. 5(11), pages 1-12, November.
    20. Yuchao Xia & Zijie Jin & Chengsheng Zhang & Linkun Ouyang & Yuhao Dong & Juan Li & Lvze Guo & Biyang Jing & Yang Shi & Susheng Miao & Ruibin Xi, 2023. "TAGET: a toolkit for analyzing full-length transcripts from long-read sequencing," Nature Communications, Nature, vol. 14(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-48117-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.