IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-024-55185-y.html
   My bibliography  Save this article

Systematically developing a registry of splice-site creating variants utilizing massive publicly available transcriptome sequence data

Author

Listed:
  • Naoko Iida

    (National Cancer Center Research Institute)

  • Ai Okada

    (National Cancer Center Research Institute)

  • Yoshihisa Kobayashi

    (National Cancer Center Research Institute)

  • Kenichi Chiba

    (National Cancer Center Research Institute)

  • Yasushi Yatabe

    (National Cancer Center Research Institute)

  • Yuichi Shiraishi

    (National Cancer Center Research Institute)

Abstract

Genomic variants causing abnormal splicing play important roles in genetic disorders and cancer development. Among them, variants that cause the formation of novel splice-sites (splice-site creating variants, SSCVs) are particularly difficult to identify and often overlooked in genomic studies. Additionally, these SSCVs are frequently considered promising candidates for treatment with splice-switching antisense oligonucleotides (ASOs). To leverage massive transcriptome sequence data such as those available from the Sequence Read Archive, we develop a novel framework to screen for SSCVs solely using transcriptome data. We apply it to 322,072 publicly available transcriptomes and identify 30,130 SSCVs. Among them, 5121 SSCVs affect disease-causing variants. By utilizing this extensive collection of SSCVs, we reveal the characteristics of Alu exonization via SSCVs, especially the hotspots of SSCVs within Alu sequences and their evolutionary relationships. We discover novel gain-of-function SSCVs in the deep intronic region of the NOTCH1 gene and demonstrate that their activation can be suppressed using splice-switching ASOs. Collectively, we provide a systematic approach for automatically acquiring a registry of SSCVs, which facilitates the elucidation of novel biological mechanisms underlying splicing and serves as a valuable resource for drug discovery. The catalogs of SSCVs identified in this study are accessible on the SSCV DB ( https://sscvdb.io ).

Suggested Citation

  • Naoko Iida & Ai Okada & Yoshihisa Kobayashi & Kenichi Chiba & Yasushi Yatabe & Yuichi Shiraishi, 2025. "Systematically developing a registry of splice-site creating variants utilizing massive publicly available transcriptome sequence data," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-55185-y
    DOI: 10.1038/s41467-024-55185-y
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-55185-y
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-55185-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Alexander Lachmann & Denis Torre & Alexandra B. Keenan & Kathleen M. Jagodnik & Hoyjin J. Lee & Lily Wang & Moshe C. Silverstein & Avi Ma’ayan, 2018. "Massive mining of publicly available RNA-seq data from human and mouse," Nature Communications, Nature, vol. 9(1), pages 1-10, December.
    2. Yoshihisa Kobayashi & Geoffrey R. Oxnard & Elizabeth F. Cohen & Navin R. Mahadevan & Joao V. Alessi & Yin P. Hung & Arrien A. Bertram & David E. Heppner & Mauricio F. Ribeiro & Karina P. Sacardo & Rod, 2022. "Genomic and biological study of fusion genes as resistance mechanisms to EGFR inhibitors," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    3. Christian Mertes & Ines F. Scheller & Vicente A. Yépez & Muhammed H. Çelik & Yingjiqiong Liang & Laura S. Kremer & Mirjana Gusic & Holger Prokisch & Julien Gagneur, 2021. "Detection of aberrant splicing events in RNA-seq data using FRASER," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    4. Konrad J. Karczewski & Laurent C. Francioli & Grace Tiao & Beryl B. Cummings & Jessica Alföldi & Qingbo Wang & Ryan L. Collins & Kristen M. Laricchia & Andrea Ganna & Daniel P. Birnbaum & Laura D. Gau, 2020. "The mutational constraint spectrum quantified from variation in 141,456 humans," Nature, Nature, vol. 581(7809), pages 434-443, May.
    5. Song Cao & Daniel Cui Zhou & Clara Oh & Reyka G. Jayasinghe & Yanyan Zhao & Christopher J. Yoon & Matthew A. Wyczalkowski & Matthew H. Bailey & Terrence Tsou & Qingsong Gao & Andrew Malone & Sheila Re, 2020. "Discovery of driver non-coding splice-site-creating mutations in cancer," Nature Communications, Nature, vol. 11(1), pages 1-11, December.
    6. Ruebena Dawes & Himanshu Joshi & Sandra T. Cooper, 2022. "Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    7. Jinkuk Kim & Sijae Woo & Claudio M. Gusmao & Boxun Zhao & Diana H. Chin & Renata L. DiDonato & Minh A. Nguyen & Tojo Nakayama & Chunguang April Hu & Aubrie Soucy & Ashley Kuniholm & Jennifer Karlin Th, 2023. "A framework for individualized splice-switching oligonucleotide therapy," Nature, Nature, vol. 619(7971), pages 828-836, July.
    8. Tuuli Lappalainen & Michael Sammeth & Marc R. Friedländer & Peter A. C. ‘t Hoen & Jean Monlong & Manuel A. Rivas & Mar Gonzàlez-Porta & Natalja Kurbatova & Thasso Griebel & Pedro G. Ferreira & Matthia, 2013. "Transcriptome and genome sequencing uncovers functional variation in humans," Nature, Nature, vol. 501(7468), pages 506-511, September.
    9. Yuichi Shiraishi & Ai Okada & Kenichi Chiba & Asuka Kawachi & Ikuko Omori & Raúl Nicolás Mateos & Naoko Iida & Hirofumi Yamauchi & Kenjiro Kosaki & Akihide Yoshimi, 2022. "Systematic identification of intron retention associated variants from massive publicly available transcriptome sequencing data," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yuichi Shiraishi & Ai Okada & Kenichi Chiba & Asuka Kawachi & Ikuko Omori & Raúl Nicolás Mateos & Naoko Iida & Hirofumi Yamauchi & Kenjiro Kosaki & Akihide Yoshimi, 2022. "Systematic identification of intron retention associated variants from massive publicly available transcriptome sequencing data," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    2. Taek-Chin Cheong & Ahram Jang & Qi Wang & Giulia C. Leonardi & Biagio Ricciuti & Joao V. Alessi & Alessandro Di Federico & Mark M. Awad & Maria K. Lehtinen & Marian H. Harris & Roberto Chiarle, 2024. "Mechanistic patterns and clinical implications of oncogenic tyrosine kinase fusions in human cancers," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    3. Yirong Shi & Yiwei Niu & Peng Zhang & Huaxia Luo & Shuai Liu & Sijia Zhang & Jiajia Wang & Yanyan Li & Xinyue Liu & Tingrui Song & Tao Xu & Shunmin He, 2023. "Characterization of genome-wide STR variation in 6487 human genomes," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    4. Qingbo S. Wang & Ryuya Edahiro & Ho Namkoong & Takanori Hasegawa & Yuya Shirai & Kyuto Sonehara & Hiromu Tanaka & Ho Lee & Ryunosuke Saiki & Takayoshi Hyugaji & Eigo Shimizu & Kotoe Katayama & Masahir, 2022. "The whole blood transcriptional regulation landscape in 465 COVID-19 infected samples from Japan COVID-19 Task Force," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    5. Asmundur Oddsson & Patrick Sulem & Gardar Sveinbjornsson & Gudny A. Arnadottir & Valgerdur Steinthorsdottir & Gisli H. Halldorsson & Bjarni A. Atlason & Gudjon R. Oskarsson & Hannes Helgason & Henriet, 2023. "Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    6. Vincent Michaud & Eulalie Lasseaux & David J. Green & Dave T. Gerrard & Claudio Plaisant & Tomas Fitzgerald & Ewan Birney & Benoît Arveiler & Graeme C. Black & Panagiotis I. Sergouniotis, 2022. "The contribution of common regulatory and protein-coding TYR variants to the genetic architecture of albinism," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
    7. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    8. Katharina T. Schmid & Barbara Höllbacher & Cristiana Cruceanu & Anika Böttcher & Heiko Lickert & Elisabeth B. Binder & Fabian J. Theis & Matthias Heinig, 2021. "scPower accelerates and optimizes the design of multi-sample single cell transcriptomic studies," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    9. Laura M. Mueller & Abigail Isaacson & Heather Wilson & Anna Salowka & Isabel Tay & Maolian Gong & Nancy Samir Elbarbary & Klemens Raile & Francesca M. Spagnoli, 2024. "Heterozygous missense variant in GLI2 impairs human endocrine pancreas development," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    10. Chi-Fen Chang & Shu-Pin Huang & Yu-Mei Hsueh & Jiun-Hung Geng & Chao-Yuan Huang & Bo-Ying Bao, 2022. "Genetic Analysis Implicates Dysregulation of SHANK2 in Renal Cell Carcinoma Progression," IJERPH, MDPI, vol. 19(19), pages 1-9, September.
    11. Alexendar R. Perez & Laura Sala & Richard K. Perez & Joana A. Vidigal, 2021. "CSC software corrects off-target mediated gRNA depletion in CRISPR-Cas9 essentiality screens," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    12. Kian Hong Kock & Patrick K. Kimes & Stephen S. Gisselbrecht & Sachi Inukai & Sabrina K. Phanor & James T. Anderson & Gayatri Ramakrishnan & Colin H. Lipper & Dongyuan Song & Jesse V. Kurland & Julia M, 2024. "DNA binding analysis of rare variants in homeodomains reveals homeodomain specificity-determining residues," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    13. Jonathan P. Ling & Alexei M. Bygrave & Clayton P. Santiago & Rogger P. Carmen-Orozco & Vickie T. Trinh & Minzhong Yu & Yini Li & Ying Liu & Kyra D. Bowden & Leighton H. Duncan & Jeong Han & Kamil Tane, 2022. "Cell-specific regulation of gene expression using splicing-dependent frameshifting," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    14. Kenneth A. Wilson & Sudipta Bar & Eric B. Dammer & Enrique M. Carrera & Brian A. Hodge & Tyler A. U. Hilsabeck & Joanna Bons & George W. Brownridge & Jennifer N. Beck & Jacob Rose & Melia Granath-Pane, 2024. "OXR1 maintains the retromer to delay brain aging under dietary restriction," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    15. Gaëlle Odelin & Adèle Faucherre & Damien Marchese & Amélie Pinard & Hager Jaouadi & Solena Scouarnec & Raphaël Chiarelli & Younes Achouri & Emilie Faure & Marine Herbane & Alexis Théron & Jean-Françoi, 2023. "Variations in the poly-histidine repeat motif of HOXA1 contribute to bicuspid aortic valve in mouse and zebrafish," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    16. Matthew Tegtmeyer & Jatin Arora & Samira Asgari & Beth A. Cimini & Ajay Nadig & Emily Peirent & Dhara Liyanage & Gregory P. Way & Erin Weisbart & Aparna Nathan & Tiffany Amariuta & Kevin Eggan & Marzi, 2024. "High-dimensional phenotyping to define the genetic basis of cellular morphology," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    17. Olivia Kim-McManus & Joseph G. Gleeson & Laurence Mignon & Amena Smith Fine & Winston Yan & Nicole Nolen & Scott Demarest & Elizabeth Berry-Kravis & Richard Finkel & Stefanie Leonard & Samuel Finlayso, 2024. "A framework for N-of-1 trials of individualized gene-targeted therapies for genetic diseases," Nature Communications, Nature, vol. 15(1), pages 1-5, December.
    18. Erik Schoenmakers & Federica Marelli & Helle F. Jørgensen & W. Edward Visser & Carla Moran & Stefan Groeneweg & Carolina Avalos & Sean J. Jurgens & Nichola Figg & Alison Finigan & Neha Wali & Maura Ag, 2023. "Selenoprotein deficiency disorder predisposes to aortic aneurysm formation," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    19. Sarah E. Garnish & Katherine R. Martin & Maria Kauppi & Victoria E. Jackson & Rebecca Ambrose & Vik Ven Eng & Shene Chiou & Yanxiang Meng & Daniel Frank & Emma C. Tovey Crutchfield & Komal M. Patel & , 2023. "A common human MLKL polymorphism confers resistance to negative regulation by phosphorylation," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    20. Matthew J. O’Neill & Tao Yang & Julie Laudeman & Maria E. Calandranis & M. Lorena Harvey & Joseph F. Solus & Dan M. Roden & Andrew M. Glazer, 2024. "ParSE-seq: a calibrated multiplexed assay to facilitate the clinical classification of putative splice-altering variants," Nature Communications, Nature, vol. 15(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-55185-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.