IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-32887-9.html
   My bibliography  Save this article

Systematic identification of intron retention associated variants from massive publicly available transcriptome sequencing data

Author

Listed:
  • Yuichi Shiraishi

    (National Cancer Center Research Institute)

  • Ai Okada

    (National Cancer Center Research Institute)

  • Kenichi Chiba

    (National Cancer Center Research Institute)

  • Asuka Kawachi

    (National Cancer Center Research Institute)

  • Ikuko Omori

    (National Cancer Center Research Institute)

  • Raúl Nicolás Mateos

    (National Cancer Center Research Institute)

  • Naoko Iida

    (National Cancer Center Research Institute)

  • Hirofumi Yamauchi

    (National Cancer Center Research Institute)

  • Kenjiro Kosaki

    (Keio University School of Medicine)

  • Akihide Yoshimi

    (National Cancer Center Research Institute)

Abstract

Many disease-associated genomic variants disrupt gene function through abnormal splicing. With the advancement of genomic medicine, identifying disease-associated splicing associated variants has become more important than ever. Most bioinformatics approaches to detect splicing associated variants require both genome and transcriptomic data. However, there are not many datasets where both of them are available. In this study, we develop a methodology to detect genomic variants that cause splicing changes (more specifically, intron retention), using transcriptome sequencing data alone. After evaluating its sensitivity and precision, we apply it to 230,988 transcriptome sequencing data from the publicly available repository and identified 27,049 intron retention associated variants (IRAVs). In addition, by exploring positional relationships with variants registered in existing disease databases, we extract 3,000 putative disease-associated IRAVs, which range from cancer drivers to variants linked with autosomal recessive disorders. The in-silico screening framework demonstrates the possibility of near-automatically acquiring medical knowledge, making the most of massively accumulated publicly available sequencing data. Collections of IRAVs identified in this study are available through IRAVDB ( https://iravdb.io/ ).

Suggested Citation

  • Yuichi Shiraishi & Ai Okada & Kenichi Chiba & Asuka Kawachi & Ikuko Omori & Raúl Nicolás Mateos & Naoko Iida & Hirofumi Yamauchi & Kenjiro Kosaki & Akihide Yoshimi, 2022. "Systematic identification of intron retention associated variants from massive publicly available transcriptome sequencing data," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-32887-9
    DOI: 10.1038/s41467-022-32887-9
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-32887-9
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-32887-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Konrad J. Karczewski & Laurent C. Francioli & Grace Tiao & Beryl B. Cummings & Jessica Alföldi & Qingbo Wang & Ryan L. Collins & Kristen M. Laricchia & Andrea Ganna & Daniel P. Birnbaum & Laura D. Gau, 2020. "The mutational constraint spectrum quantified from variation in 141,456 humans," Nature, Nature, vol. 581(7809), pages 434-443, May.
    2. Nazneen Rahman, 2014. "Correction: Corrigendum: Realizing the promise of cancer predisposition genes," Nature, Nature, vol. 510(7503), pages 176-176, June.
    3. Tuuli Lappalainen & Michael Sammeth & Marc R. Friedländer & Peter A. C. ‘t Hoen & Jean Monlong & Manuel A. Rivas & Mar Gonzàlez-Porta & Natalja Kurbatova & Thasso Griebel & Pedro G. Ferreira & Matthia, 2013. "Transcriptome and genome sequencing uncovers functional variation in humans," Nature, Nature, vol. 501(7468), pages 506-511, September.
    4. Song Cao & Daniel Cui Zhou & Clara Oh & Reyka G. Jayasinghe & Yanyan Zhao & Christopher J. Yoon & Matthew A. Wyczalkowski & Matthew H. Bailey & Terrence Tsou & Qingsong Gao & Andrew Malone & Sheila Re, 2020. "Discovery of driver non-coding splice-site-creating mutations in cancer," Nature Communications, Nature, vol. 11(1), pages 1-11, December.
    5. Nazneen Rahman, 2014. "Realizing the promise of cancer predisposition genes," Nature, Nature, vol. 505(7483), pages 302-308, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ulrik Kristoffer Stoltze & Jon Foss-Skiftesvik & Thomas van Overeem Hansen & Simon Rasmussen & Konrad J. Karczewski & Karin A. W. Wadt & Kjeld Schmiegelow, 2024. "The evolutionary impact of childhood cancer on the human gene pool," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    2. Mischan Vali-Pour & Solip Park & Jose Espinosa-Carrasco & Daniel Ortiz-Martínez & Ben Lehner & Fran Supek, 2022. "The impact of rare germline variants on human somatic mutation processes," Nature Communications, Nature, vol. 13(1), pages 1-21, December.
    3. Qingbo S. Wang & Ryuya Edahiro & Ho Namkoong & Takanori Hasegawa & Yuya Shirai & Kyuto Sonehara & Hiromu Tanaka & Ho Lee & Ryunosuke Saiki & Takayoshi Hyugaji & Eigo Shimizu & Kotoe Katayama & Masahir, 2022. "The whole blood transcriptional regulation landscape in 465 COVID-19 infected samples from Japan COVID-19 Task Force," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    4. Yirong Shi & Yiwei Niu & Peng Zhang & Huaxia Luo & Shuai Liu & Sijia Zhang & Jiajia Wang & Yanyan Li & Xinyue Liu & Tingrui Song & Tao Xu & Shunmin He, 2023. "Characterization of genome-wide STR variation in 6487 human genomes," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    5. Vincent Michaud & Eulalie Lasseaux & David J. Green & Dave T. Gerrard & Claudio Plaisant & Tomas Fitzgerald & Ewan Birney & Benoît Arveiler & Graeme C. Black & Panagiotis I. Sergouniotis, 2022. "The contribution of common regulatory and protein-coding TYR variants to the genetic architecture of albinism," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
    6. Chi-Fen Chang & Shu-Pin Huang & Yu-Mei Hsueh & Jiun-Hung Geng & Chao-Yuan Huang & Bo-Ying Bao, 2022. "Genetic Analysis Implicates Dysregulation of SHANK2 in Renal Cell Carcinoma Progression," IJERPH, MDPI, vol. 19(19), pages 1-9, September.
    7. Alexendar R. Perez & Laura Sala & Richard K. Perez & Joana A. Vidigal, 2021. "CSC software corrects off-target mediated gRNA depletion in CRISPR-Cas9 essentiality screens," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    8. Michel S. Naslavsky & Marilia O. Scliar & Guilherme L. Yamamoto & Jaqueline Yu Ting Wang & Stepanka Zverinova & Tatiana Karp & Kelly Nunes & José Ricardo Magliocco Ceroni & Diego Lima Carvalho & Carlo, 2022. "Whole-genome sequencing of 1,171 elderly admixed individuals from Brazil," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    9. Nicole Deflaux & Margaret Sunitha Selvaraj & Henry Robert Condon & Kelsey Mayo & Sara Haidermota & Melissa A. Basford & Chris Lunt & Anthony A. Philippakis & Dan M. Roden & Joshua C. Denny & Anjene Mu, 2023. "Demonstrating paths for unlocking the value of cloud genomics through cross cohort analysis," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    10. Andrea Wilderman & Eva D’haene & Machteld Baetens & Tara N. Yankee & Emma Wentworth Winchester & Nicole Glidden & Ellen Roets & Jo Dorpe & Sandra Janssens & Danny E. Miller & Miranda Galey & Kari M. B, 2024. "A distant global control region is essential for normal expression of anterior HOXA genes during mouse and human craniofacial development," Nature Communications, Nature, vol. 15(1), pages 1-23, December.
    11. Ruoyu Tian & Tian Ge & Hyeokmoon Kweon & Daniel B. Rocha & Max Lam & Jimmy Z. Liu & Kritika Singh & Daniel F. Levey & Joel Gelernter & Murray B. Stein & Ellen A. Tsai & Hailiang Huang & Christopher F., 2024. "Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    12. Mary-Ellen Lynall & Blagoje Soskic & James Hayhurst & Jeremy Schwartzentruber & Daniel F. Levey & Gita A. Pathak & Renato Polimanti & Joel Gelernter & Murray B. Stein & Gosia Trynka & Menna R. Clatwor, 2022. "Genetic variants associated with psychiatric disorders are enriched at epigenetically active sites in lymphoid cells," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    13. Adrienne Tin & Pascal Schlosser & Pamela R. Matias-Garcia & Chris H. L. Thio & Roby Joehanes & Hongbo Liu & Zhi Yu & Antoine Weihs & Anselm Hoppmann & Franziska Grundner-Culemann & Josine L. Min & Vic, 2021. "Epigenome-wide association study of serum urate reveals insights into urate co-regulation and the SLC2A9 locus," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    14. Oriol Pich & Iker Reyes-Salazar & Abel Gonzalez-Perez & Nuria Lopez-Bigas, 2022. "Discovering the drivers of clonal hematopoiesis," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    15. Magdalena Zimoń & Yunfeng Huang & Anthi Trasta & Aliaksandr Halavatyi & Jimmy Z. Liu & Chia-Yen Chen & Peter Blattmann & Bernd Klaus & Christopher D. Whelan & David Sexton & Sally John & Wolfgang Hube, 2021. "Pairwise effects between lipid GWAS genes modulate lipid plasma levels and cellular uptake," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    16. Yangci Liu & Haoming Zhai & Helen Alemayehu & Jérôme Boulanger & Lee J. Hopkins & Alicia C. Borgeaud & Christina Heroven & Jonathan D. Howe & Kendra E. Leigh & Clare E. Bryant & Yorgo Modis, 2023. "Cryo-electron tomography of NLRP3-activated ASC complexes reveals organelle co-localization," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    17. Ping Chun Wu & Yan Quan Lee & Mattias Möller & Jill R. Storry & Martin L. Olsson, 2023. "Elucidation of the low-expressing erythroid CR1 phenotype by bioinformatic mining of the GATA1-driven blood-group regulome," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    18. Jörn Bethune & April Kleppe & Søren Besenbacher, 2022. "A method to build extended sequence context models of point mutations and indels," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    19. Anneke Brümmer & Sven Bergmann, 2024. "Disentangling genetic effects on transcriptional and post-transcriptional gene regulation through integrating exon and intron expression QTLs," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    20. Laia Simó-Riudalbas & Sandra Offner & Evarist Planet & Julien Duc & Laurence Abrami & Sagane Dind & Alexandre Coudray & Mairene Coto-Llerena & Caner Ercan & Salvatore Piscuoglio & Claus Lindbjerg Ande, 2022. "Transposon-activated POU5F1B promotes colorectal cancer growth and metastasis," Nature Communications, Nature, vol. 13(1), pages 1-17, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-32887-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.