IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-36585-y.html
   My bibliography  Save this article

RNA splicing analysis using heterogeneous and large RNA-seq datasets

Author

Listed:
  • Jorge Vaquero-Garcia

    (University of Pennsylvania)

  • Joseph K. Aicher

    (University of Pennsylvania
    Children’s Hospital of Philadelphia)

  • San Jewell

    (University of Pennsylvania)

  • Matthew R. Gazzara

    (University of Pennsylvania)

  • Caleb M. Radens

    (University of Pennsylvania)

  • Anupama Jha

    (University of Pennsylvania)

  • Scott S. Norton

    (University of Pennsylvania)

  • Nicholas F. Lahens

    (University of Pennsylvania)

  • Gregory R. Grant

    (University of Pennsylvania
    University of Pennsylvania)

  • Yoseph Barash

    (University of Pennsylvania
    University of Pennsylvania)

Abstract

The ubiquity of RNA-seq has led to many methods that use RNA-seq data to analyze variations in RNA splicing. However, available methods are not well suited for handling heterogeneous and large datasets. Such datasets scale to thousands of samples across dozens of experimental conditions, exhibit increased variability compared to biological replicates, and involve thousands of unannotated splice variants resulting in increased transcriptome complexity. We describe here a suite of algorithms and tools implemented in the MAJIQ v2 package to address challenges in detection, quantification, and visualization of splicing variations from such datasets. Using both large scale synthetic data and GTEx v8 as benchmark datasets, we assess the advantages of MAJIQ v2 compared to existing methods. We then apply MAJIQ v2 package to analyze differential splicing across 2,335 samples from 13 brain subregions, demonstrating its ability to offer insights into brain subregion-specific splicing regulation.

Suggested Citation

  • Jorge Vaquero-Garcia & Joseph K. Aicher & San Jewell & Matthew R. Gazzara & Caleb M. Radens & Anupama Jha & Scott S. Norton & Nicholas F. Lahens & Gregory R. Grant & Yoseph Barash, 2023. "RNA splicing analysis using heterogeneous and large RNA-seq datasets," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-36585-y
    DOI: 10.1038/s41467-023-36585-y
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-36585-y
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-36585-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Barry Slaff & Caleb M. Radens & Paul Jewell & Anupama Jha & Nicholas F. Lahens & Gregory R. Grant & Andrei Thomas-Tikhonenko & Kristen W. Lynch & Yoseph Barash, 2021. "MOCCASIN: a method for correcting for known and unknown confounders in RNA splicing analysis," Nature Communications, Nature, vol. 12(1), pages 1-9, December.
    2. Birk Diedenhofen & Jochen Musch, 2015. "cocor: A Comprehensive Solution for the Statistical Comparison of Correlations," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-12, April.
    3. Yoseph Barash & John A. Calarco & Weijun Gao & Qun Pan & Xinchen Wang & Ofer Shai & Benjamin J. Blencowe & Brendan J. Frey, 2010. "Deciphering the splicing code," Nature, Nature, vol. 465(7294), pages 53-59, May.
    4. Eric L. Nostrand & Peter Freese & Gabriel A. Pratt & Xiaofeng Wang & Xintao Wei & Rui Xiao & Steven M. Blue & Jia-Yu Chen & Neal A. L. Cody & Daniel Dominguez & Sara Olson & Balaji Sundararaman & Liju, 2020. "A large-scale binding and functional map of human RNA-binding proteins," Nature, Nature, vol. 583(7818), pages 711-719, July.
    5. Daniel Maticzka & Ibrahim Avsar Ilik & Tugce Aktas & Rolf Backofen & Asifa Akhtar, 2018. "uvCLAP is a fast and non-radioactive method to identify in vivo targets of RNA-binding proteins," Nature Communications, Nature, vol. 9(1), pages 1-13, December.
    6. Margaret K. R. Donovan & Agnieszka D’Antonio-Chronowska & Matteo D’Antonio & Kelly A. Frazer, 2020. "Author Correction: Cellular deconvolution of GTEx tissues powers discovery of disease and cell-type associated regulatory variants," Nature Communications, Nature, vol. 11(1), pages 1-1, December.
    7. Margaret K. R. Donovan & Agnieszka D’Antonio-Chronowska & Matteo D’Antonio & Kelly A. Frazer, 2020. "Cellular deconvolution of GTEx tissues powers discovery of disease and cell-type associated regulatory variants," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiangbin Ruan & Kaining Hu & Xiaochang Zhang, 2023. "PIE-seq: identifying RNA-binding protein targets by dual RNA-deaminase editing and sequencing," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    2. Matteo D’Antonio & Jennifer P. Nguyen & Timothy D. Arthur & Hiroko Matsui & Agnieszka D’Antonio-Chronowska & Kelly A. Frazer, 2023. "Fine mapping spatiotemporal mechanisms of genetic variants underlying cardiac traits and disease," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    3. Aditya Kshirsagar & Svetlana Maslov Doroshev & Anna Gorelik & Tsviya Olender & Tamar Sapir & Daisuke Tsuboi & Irit Rosenhek-Goldian & Sergey Malitsky & Maxim Itkin & Amir Argoetti & Yael Mandel-Gutfre, 2023. "LIS1 RNA-binding orchestrates the mechanosensitive properties of embryonic stem cells in AGO2-dependent and independent ways," Nature Communications, Nature, vol. 14(1), pages 1-21, December.
    4. Paul Little & Si Liu & Vasyl Zhabotynsky & Yun Li & Dan-Yu Lin & Wei Sun, 2023. "A computational method for cell type-specific expression quantitative trait loci mapping using bulk RNA-seq data," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    5. Yocelyn Recinos & Dmytro Ustianenko & Yow-Tyng Yeh & Xiaojian Wang & Martin Jacko & Lekha V. Yesantharao & Qiyang Wu & Chaolin Zhang, 2024. "CRISPR-dCas13d-based deep screening of proximal and distal splicing-regulatory elements," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    6. Ryo Yamamoto & Ryan Chung & Juan Manuel Vazquez & Huanjie Sheng & Philippa L. Steinberg & Nilah M. Ioannidis & Peter H. Sudmant, 2022. "Tissue-specific impacts of aging and genetics on gene expression patterns in humans," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    7. Xiaoyu Song & Jiayi Ji & Joseph H. Rothstein & Stacey E. Alexeeff & Lori C. Sakoda & Adriana Sistig & Ninah Achacoso & Eric Jorgenson & Alice S. Whittemore & Robert J. Klein & Laurel A. Habel & Pei Wa, 2023. "MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    8. Ridvan Eksi & Hong-Dong Li & Rajasree Menon & Yuchen Wen & Gilbert S Omenn & Matthias Kretzler & Yuanfang Guan, 2013. "Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data," PLOS Computational Biology, Public Library of Science, vol. 9(11), pages 1-16, November.
    9. Anthony Evans & Willem Sleegers & Žan Mlakar, 2020. "Individual differences in receptivity to scientific bullshit," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 15(3), pages 401-412, May.
    10. Walters, William H., 2017. "Do subjective journal ratings represent whole journals or typical articles? Unweighted or weighted citation impact?," Journal of Informetrics, Elsevier, vol. 11(3), pages 730-744.
    11. Matvei Khoroshkin & Andrey Buyan & Martin Dodel & Albertas Navickas & Johnny Yu & Fathima Trejo & Anthony Doty & Rithvik Baratam & Shaopu Zhou & Sean B. Lee & Tanvi Joshi & Kristle Garcia & Benedict C, 2024. "Systematic identification of post-transcriptional regulatory modules," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    12. Haofan Sun & Bin Fu & Xiaohong Qian & Ping Xu & Weijie Qin, 2024. "Nuclear and cytoplasmic specific RNA binding proteome enrichment and its changes upon ferroptosis induction," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    13. Kathrin Leppek & Gun Woo Byeon & Wipapat Kladwang & Hannah K. Wayment-Steele & Craig H. Kerr & Adele F. Xu & Do Soon Kim & Ved V. Topkar & Christian Choe & Daphna Rothschild & Gerald C. Tiu & Roger We, 2022. "Combinatorial optimization of mRNA structure, stability, and translation for RNA-based therapeutics," Nature Communications, Nature, vol. 13(1), pages 1-22, December.
    14. Kevin Handtke & Lisa Richter-Beuschel & Susanne Bögeholz, 2022. "Self-Efficacy Beliefs of Teaching ESD: A Theory-Driven Instrument and the Effectiveness of ESD in German Teacher Education," Sustainability, MDPI, vol. 14(11), pages 1-32, May.
    15. Timofey A. Karginov & Antoine Ménoret & Anthony T. Vella, 2022. "Optimal CD8+ T cell effector function requires costimulation-induced RNA-binding proteins that reprogram the transcript isoform landscape," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    16. Anna Knörlein & Chris P. Sarnowski & Tebbe Vries & Moritz Stoltz & Michael Götze & Ruedi Aebersold & Frédéric H.-T. Allain & Alexander Leitner & Jonathan Hall, 2022. "Nucleotide-amino acid π-stacking interactions initiate photo cross-linking in RNA-protein complexes," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    17. repec:cup:judgdm:v:15:y:2020:i:2:p:193-202 is not listed on IDEAS
    18. Alan E. Stewart & Harrison E. Chapman & Jackson B. L. Davis, 2023. "Anxiety and Worry about Six Categories of Climate Change Impacts," IJERPH, MDPI, vol. 21(1), pages 1-24, December.
    19. Dang,Hai-Anh H. & Pullinger,John James & Serajuddin,Umar & Stacy,Brian William, 2024. "Reviewing Assessment Tools for Measuring Country Statistical Capacity," Policy Research Working Paper Series 10717, The World Bank.
    20. Miquel Anglada-Girotto & Ludovica Ciampi & Sophie Bonnal & Sarah A. Head & Samuel Miravet-Verde & Luis Serrano, 2024. "In silico RNA isoform screening to identify potential cancer driver exons with therapeutic applications," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    21. Robert Stojnic & Audrey Qiuyan Fu & Boris Adryan, 2012. "A Graphical Modelling Approach to the Dissection of Highly Correlated Transcription Factor Binding Site Profiles," PLOS Computational Biology, Public Library of Science, vol. 8(11), pages 1-13, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-36585-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.