IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-36585-y.html
   My bibliography  Save this article

RNA splicing analysis using heterogeneous and large RNA-seq datasets

Author

Listed:
  • Jorge Vaquero-Garcia

    (University of Pennsylvania)

  • Joseph K. Aicher

    (University of Pennsylvania
    Children’s Hospital of Philadelphia)

  • San Jewell

    (University of Pennsylvania)

  • Matthew R. Gazzara

    (University of Pennsylvania)

  • Caleb M. Radens

    (University of Pennsylvania)

  • Anupama Jha

    (University of Pennsylvania)

  • Scott S. Norton

    (University of Pennsylvania)

  • Nicholas F. Lahens

    (University of Pennsylvania)

  • Gregory R. Grant

    (University of Pennsylvania
    University of Pennsylvania)

  • Yoseph Barash

    (University of Pennsylvania
    University of Pennsylvania)

Abstract

The ubiquity of RNA-seq has led to many methods that use RNA-seq data to analyze variations in RNA splicing. However, available methods are not well suited for handling heterogeneous and large datasets. Such datasets scale to thousands of samples across dozens of experimental conditions, exhibit increased variability compared to biological replicates, and involve thousands of unannotated splice variants resulting in increased transcriptome complexity. We describe here a suite of algorithms and tools implemented in the MAJIQ v2 package to address challenges in detection, quantification, and visualization of splicing variations from such datasets. Using both large scale synthetic data and GTEx v8 as benchmark datasets, we assess the advantages of MAJIQ v2 compared to existing methods. We then apply MAJIQ v2 package to analyze differential splicing across 2,335 samples from 13 brain subregions, demonstrating its ability to offer insights into brain subregion-specific splicing regulation.

Suggested Citation

  • Jorge Vaquero-Garcia & Joseph K. Aicher & San Jewell & Matthew R. Gazzara & Caleb M. Radens & Anupama Jha & Scott S. Norton & Nicholas F. Lahens & Gregory R. Grant & Yoseph Barash, 2023. "RNA splicing analysis using heterogeneous and large RNA-seq datasets," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-36585-y
    DOI: 10.1038/s41467-023-36585-y
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-36585-y
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-36585-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Eric L. Nostrand & Peter Freese & Gabriel A. Pratt & Xiaofeng Wang & Xintao Wei & Rui Xiao & Steven M. Blue & Jia-Yu Chen & Neal A. L. Cody & Daniel Dominguez & Sara Olson & Balaji Sundararaman & Liju, 2020. "A large-scale binding and functional map of human RNA-binding proteins," Nature, Nature, vol. 583(7818), pages 711-719, July.
    2. Margaret K. R. Donovan & Agnieszka D’Antonio-Chronowska & Matteo D’Antonio & Kelly A. Frazer, 2020. "Author Correction: Cellular deconvolution of GTEx tissues powers discovery of disease and cell-type associated regulatory variants," Nature Communications, Nature, vol. 11(1), pages 1-1, December.
    3. Barry Slaff & Caleb M. Radens & Paul Jewell & Anupama Jha & Nicholas F. Lahens & Gregory R. Grant & Andrei Thomas-Tikhonenko & Kristen W. Lynch & Yoseph Barash, 2021. "MOCCASIN: a method for correcting for known and unknown confounders in RNA splicing analysis," Nature Communications, Nature, vol. 12(1), pages 1-9, December.
    4. Daniel Maticzka & Ibrahim Avsar Ilik & Tugce Aktas & Rolf Backofen & Asifa Akhtar, 2018. "uvCLAP is a fast and non-radioactive method to identify in vivo targets of RNA-binding proteins," Nature Communications, Nature, vol. 9(1), pages 1-13, December.
    5. Margaret K. R. Donovan & Agnieszka D’Antonio-Chronowska & Matteo D’Antonio & Kelly A. Frazer, 2020. "Cellular deconvolution of GTEx tissues powers discovery of disease and cell-type associated regulatory variants," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    6. Birk Diedenhofen & Jochen Musch, 2015. "cocor: A Comprehensive Solution for the Statistical Comparison of Correlations," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-12, April.
    7. Yoseph Barash & John A. Calarco & Weijun Gao & Qun Pan & Xinchen Wang & Ofer Shai & Benjamin J. Blencowe & Brendan J. Frey, 2010. "Deciphering the splicing code," Nature, Nature, vol. 465(7294), pages 53-59, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aditya Kshirsagar & Svetlana Maslov Doroshev & Anna Gorelik & Tsviya Olender & Tamar Sapir & Daisuke Tsuboi & Irit Rosenhek-Goldian & Sergey Malitsky & Maxim Itkin & Amir Argoetti & Yael Mandel-Gutfre, 2023. "LIS1 RNA-binding orchestrates the mechanosensitive properties of embryonic stem cells in AGO2-dependent and independent ways," Nature Communications, Nature, vol. 14(1), pages 1-21, December.
    2. Paul Little & Si Liu & Vasyl Zhabotynsky & Yun Li & Dan-Yu Lin & Wei Sun, 2023. "A computational method for cell type-specific expression quantitative trait loci mapping using bulk RNA-seq data," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    3. Ryo Yamamoto & Ryan Chung & Juan Manuel Vazquez & Huanjie Sheng & Philippa L. Steinberg & Nilah M. Ioannidis & Peter H. Sudmant, 2022. "Tissue-specific impacts of aging and genetics on gene expression patterns in humans," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    4. Xiangbin Ruan & Kaining Hu & Xiaochang Zhang, 2023. "PIE-seq: identifying RNA-binding protein targets by dual RNA-deaminase editing and sequencing," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    5. Matteo D’Antonio & Jennifer P. Nguyen & Timothy D. Arthur & Hiroko Matsui & Agnieszka D’Antonio-Chronowska & Kelly A. Frazer, 2023. "Fine mapping spatiotemporal mechanisms of genetic variants underlying cardiac traits and disease," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    6. Yocelyn Recinos & Dmytro Ustianenko & Yow-Tyng Yeh & Xiaojian Wang & Martin Jacko & Lekha V. Yesantharao & Qiyang Wu & Chaolin Zhang, 2024. "CRISPR-dCas13d-based deep screening of proximal and distal splicing-regulatory elements," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    7. Xiaoyu Song & Jiayi Ji & Joseph H. Rothstein & Stacey E. Alexeeff & Lori C. Sakoda & Adriana Sistig & Ninah Achacoso & Eric Jorgenson & Alice S. Whittemore & Robert J. Klein & Laurel A. Habel & Pei Wa, 2023. "MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    8. Ridvan Eksi & Hong-Dong Li & Rajasree Menon & Yuchen Wen & Gilbert S Omenn & Matthias Kretzler & Yuanfang Guan, 2013. "Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data," PLOS Computational Biology, Public Library of Science, vol. 9(11), pages 1-16, November.
    9. Anthony Evans & Willem Sleegers & Žan Mlakar, 2020. "Individual differences in receptivity to scientific bullshit," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 15(3), pages 401-412, May.
    10. Walters, William H., 2017. "Do subjective journal ratings represent whole journals or typical articles? Unweighted or weighted citation impact?," Journal of Informetrics, Elsevier, vol. 11(3), pages 730-744.
    11. Haofan Sun & Bin Fu & Xiaohong Qian & Ping Xu & Weijie Qin, 2024. "Nuclear and cytoplasmic specific RNA binding proteome enrichment and its changes upon ferroptosis induction," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    12. Kevin Handtke & Lisa Richter-Beuschel & Susanne Bögeholz, 2022. "Self-Efficacy Beliefs of Teaching ESD: A Theory-Driven Instrument and the Effectiveness of ESD in German Teacher Education," Sustainability, MDPI, vol. 14(11), pages 1-32, May.
    13. Robert Stojnic & Audrey Qiuyan Fu & Boris Adryan, 2012. "A Graphical Modelling Approach to the Dissection of Highly Correlated Transcription Factor Binding Site Profiles," PLOS Computational Biology, Public Library of Science, vol. 8(11), pages 1-13, November.
    14. Johanna Luige & Alexandros Armaos & Gian Gaetano Tartaglia & Ulf Andersson Vang Ørom, 2024. "Predicting nuclear G-quadruplex RNA-binding proteins with roles in transcription and phase separation," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    15. Areum Han & Peter Stoilov & Anthony J Linares & Yu Zhou & Xiang-Dong Fu & Douglas L Black, 2014. "De Novo Prediction of PTBP1 Binding and Splicing Targets Reveals Unexpected Features of Its RNA Recognition and Function," PLOS Computational Biology, Public Library of Science, vol. 10(1), pages 1-18, January.
    16. Les Sikos & Noortje J Venhuizen & Heiner Drenhaus & Matthew W Crocker, 2021. "Reevaluating pragmatic reasoning in language games," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-33, March.
    17. Oliver J. Ziff & Jacob Neeves & Jamie Mitchell & Giulia Tyzack & Carlos Martinez-Ruiz & Raphaelle Luisier & Anob M. Chakrabarti & Nicholas McGranahan & Kevin Litchfield & Simon J. Boulton & Ammar Al-C, 2023. "Integrated transcriptome landscape of ALS identifies genome instability linked to TDP-43 pathology," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    18. Zili Song & Shuang Zhou & Hongjiao Zhang & Nancy P. Keller & Berl R. Oakley & Xiao Liu & Wen-Bing Yin, 2023. "Fungal secondary metabolism is governed by an RNA-binding protein CsdA/RsdA complex," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    19. Wei Hu & Yangjun Wu & Qili Shi & Jingni Wu & Deping Kong & Xiaohua Wu & Xianghuo He & Teng Liu & Shengli Li, 2022. "Systematic characterization of cancer transcriptome at transcript resolution," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    20. David Wang & Mathieu Quesnel-Vallieres & San Jewell & Moein Elzubeir & Kristen Lynch & Andrei Thomas-Tikhonenko & Yoseph Barash, 2023. "A Bayesian model for unsupervised detection of RNA splicing based subtypes in cancers," Nature Communications, Nature, vol. 14(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-36585-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.