IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-30248-0.html
   My bibliography  Save this article

A rare variant analysis framework using public genotype summary counts to prioritize disease-predisposition genes

Author

Listed:
  • Wenan Chen

    (St. Jude Children’s Research Hospital)

  • Shuoguo Wang

    (St. Jude Children’s Research Hospital
    150 Second Street)

  • Saima Sultana Tithi

    (St. Jude Children’s Research Hospital)

  • David W. Ellison

    (St. Jude Children’s Research Hospital)

  • Daniel J. Schaid

    (Mayo Clinic)

  • Gang Wu

    (St. Jude Children’s Research Hospital
    St. Jude Children’s Research Hospital)

Abstract

Sequencing cases without matched healthy controls hinders prioritization of germline disease-predisposition genes. To circumvent this problem, genotype summary counts from public data sets can serve as controls. However, systematic inflation and false positives can arise if confounding factors are not controlled. We propose a framework, consistent summary counts based rare variant burden test (CoCoRV), to address these challenges. CoCoRV implements consistent variant quality control and filtering, ethnicity-stratified rare variant association test, accurate estimation of inflation factors, powerful FDR control, and detection of rare variant pairs in high linkage disequilibrium. When we applied CoCoRV to pediatric cancer cohorts, the top genes identified were cancer-predisposition genes. We also applied CoCoRV to identify disease-predisposition genes in adult brain tumors and amyotrophic lateral sclerosis. Given that potential confounding factors were well controlled after applying the framework, CoCoRV provides a cost-effective solution to prioritizing disease-risk genes enriched with rare pathogenic variants.

Suggested Citation

  • Wenan Chen & Shuoguo Wang & Saima Sultana Tithi & David W. Ellison & Daniel J. Schaid & Gang Wu, 2022. "A rare variant analysis framework using public genotype summary counts to prioritize disease-predisposition genes," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-30248-0
    DOI: 10.1038/s41467-022-30248-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-30248-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-30248-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sebastian M. Waszak & Giles W, Robinson & Brian L. Gudenas & Kyle S. Smith & Antoine Forget & Marija Kojic & Jesus Garcia-Lopez & Jennifer Hadley & Kayla V. Hamilton & Emilie Indersie & Ivo Buchhalter, 2020. "Germline Elongator mutations in Sonic Hedgehog medulloblastoma," Nature, Nature, vol. 580(7803), pages 396-401, April.
    2. Audrey E Hendricks & Stephen C Billups & Hamish N C Pike & I Sadaf Farooqi & Eleftheria Zeggini & Stephanie A Santorico & Inês Barroso & Josée Dupuis, 2018. "ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls," PLOS Genetics, Public Library of Science, vol. 14(10), pages 1-14, October.
    3. Konrad J. Karczewski & Laurent C. Francioli & Grace Tiao & Beryl B. Cummings & Jessica Alföldi & Qingbo Wang & Ryan L. Collins & Kristen M. Laricchia & Andrea Ganna & Daniel P. Birnbaum & Laura D. Gau, 2020. "The mutational constraint spectrum quantified from variation in 141,456 humans," Nature, Nature, vol. 581(7809), pages 434-443, May.
    4. Allison A. Regier & Yossi Farjoun & David E. Larson & Olga Krasheninina & Hyun Min Kang & Daniel P. Howrigan & Bo-Juen Chen & Manisha Kher & Eric Banks & Darren C. Ames & Adam C. English & Heng Li & J, 2018. "Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects," Nature Communications, Nature, vol. 9(1), pages 1-8, December.
    5. Qingbo Wang & Emma Pierce-Hoffman & Beryl B. Cummings & Jessica Alföldi & Laurent C. Francioli & Laura D. Gauthier & Andrew J. Hill & Anne H. O’Donnell-Luria & Konrad J. Karczewski & Daniel G. MacArth, 2020. "Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes," Nature Communications, Nature, vol. 11(1), pages 1-13, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nazia Pathan & Wei Q. Deng & Matteo Di Scipio & Mohammad Khan & Shihong Mao & Robert W. Morton & Ricky Lali & Marie Pigeyre & Michael R. Chong & Guillaume Paré, 2024. "A method to estimate the contribution of rare coding variants to complex trait heritability," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    2. Ricky Lali & Michael Chong & Arghavan Omidi & Pedrum Mohammadi-Shemirani & Ann Le & Edward Cui & Guillaume Paré, 2021. "Calibrated rare variant genetic risk scores for complex disease prediction using large exome sequence repositories," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
    3. Ulrik Kristoffer Stoltze & Jon Foss-Skiftesvik & Thomas van Overeem Hansen & Simon Rasmussen & Konrad J. Karczewski & Karin A. W. Wadt & Kjeld Schmiegelow, 2024. "The evolutionary impact of childhood cancer on the human gene pool," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    4. Chao Yang & Zhenzhen Ma & Keshan Wang & Xingxiao Dong & Meiyu Huang & Yaqiu Li & Xiagu Zhu & Ju Li & Zhihui Cheng & Changhao Bi & Xueli Zhang, 2023. "HMGN1 enhances CRISPR-directed dual-function A-to-G and C-to-G base editing," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    5. Asmundur Oddsson & Patrick Sulem & Gardar Sveinbjornsson & Gudny A. Arnadottir & Valgerdur Steinthorsdottir & Gisli H. Halldorsson & Bjarni A. Atlason & Gudjon R. Oskarsson & Hannes Helgason & Henriet, 2023. "Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    6. Vincent Michaud & Eulalie Lasseaux & David J. Green & Dave T. Gerrard & Claudio Plaisant & Tomas Fitzgerald & Ewan Birney & Benoît Arveiler & Graeme C. Black & Panagiotis I. Sergouniotis, 2022. "The contribution of common regulatory and protein-coding TYR variants to the genetic architecture of albinism," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
    7. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    8. Laura M. Mueller & Abigail Isaacson & Heather Wilson & Anna Salowka & Isabel Tay & Maolian Gong & Nancy Samir Elbarbary & Klemens Raile & Francesca M. Spagnoli, 2024. "Heterozygous missense variant in GLI2 impairs human endocrine pancreas development," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    9. Alexendar R. Perez & Laura Sala & Richard K. Perez & Joana A. Vidigal, 2021. "CSC software corrects off-target mediated gRNA depletion in CRISPR-Cas9 essentiality screens," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    10. Kian Hong Kock & Patrick K. Kimes & Stephen S. Gisselbrecht & Sachi Inukai & Sabrina K. Phanor & James T. Anderson & Gayatri Ramakrishnan & Colin H. Lipper & Dongyuan Song & Jesse V. Kurland & Julia M, 2024. "DNA binding analysis of rare variants in homeodomains reveals homeodomain specificity-determining residues," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    11. Gaëlle Odelin & Adèle Faucherre & Damien Marchese & Amélie Pinard & Hager Jaouadi & Solena Scouarnec & Raphaël Chiarelli & Younes Achouri & Emilie Faure & Marine Herbane & Alexis Théron & Jean-Françoi, 2023. "Variations in the poly-histidine repeat motif of HOXA1 contribute to bicuspid aortic valve in mouse and zebrafish," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    12. Matthew Tegtmeyer & Jatin Arora & Samira Asgari & Beth A. Cimini & Ajay Nadig & Emily Peirent & Dhara Liyanage & Gregory P. Way & Erin Weisbart & Aparna Nathan & Tiffany Amariuta & Kevin Eggan & Marzi, 2024. "High-dimensional phenotyping to define the genetic basis of cellular morphology," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    13. Erik Schoenmakers & Federica Marelli & Helle F. Jørgensen & W. Edward Visser & Carla Moran & Stefan Groeneweg & Carolina Avalos & Sean J. Jurgens & Nichola Figg & Alison Finigan & Neha Wali & Maura Ag, 2023. "Selenoprotein deficiency disorder predisposes to aortic aneurysm formation," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    14. Sarah E. Garnish & Katherine R. Martin & Maria Kauppi & Victoria E. Jackson & Rebecca Ambrose & Vik Ven Eng & Shene Chiou & Yanxiang Meng & Daniel Frank & Emma C. Tovey Crutchfield & Komal M. Patel & , 2023. "A common human MLKL polymorphism confers resistance to negative regulation by phosphorylation," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    15. Matthew J. O’Neill & Tao Yang & Julie Laudeman & Maria E. Calandranis & M. Lorena Harvey & Joseph F. Solus & Dan M. Roden & Andrew M. Glazer, 2024. "ParSE-seq: a calibrated multiplexed assay to facilitate the clinical classification of putative splice-altering variants," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    16. Xiaoyi Raymond Gao & Marion Chiariglione & Alexander J. Arch, 2022. "Whole-exome sequencing study identifies rare variants and genes associated with intraocular pressure and glaucoma," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    17. Shaan Khurshid & Julieta Lazarte & James P. Pirruccello & Lu-Chen Weng & Seung Hoan Choi & Amelia W. Hall & Xin Wang & Samuel F. Friedman & Victor Nauffal & Kiran J. Biddinger & Krishna G. Aragam & Pu, 2023. "Clinical and genetic associations of deep learning-derived cardiac magnetic resonance-based left ventricular mass," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    18. Javier Rodríguez-Ubreva & Anna Arutyunyan & Marc Jan Bonder & Lucía Del Pino-Molina & Stephen J. Clark & Carlos de la Calle-Fabregat & Luz Garcia-Alonso & Louis-François Handfield & Laura Ciudad & Edu, 2022. "Single-cell Atlas of common variable immunodeficiency shows germinal center-associated epigenetic dysregulation in B-cell responses," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    19. Michel S. Naslavsky & Marilia O. Scliar & Guilherme L. Yamamoto & Jaqueline Yu Ting Wang & Stepanka Zverinova & Tatiana Karp & Kelly Nunes & José Ricardo Magliocco Ceroni & Diego Lima Carvalho & Carlo, 2022. "Whole-genome sequencing of 1,171 elderly admixed individuals from Brazil," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    20. Nicole Deflaux & Margaret Sunitha Selvaraj & Henry Robert Condon & Kelsey Mayo & Sara Haidermota & Melissa A. Basford & Chris Lunt & Anthony A. Philippakis & Dan M. Roden & Joshua C. Denny & Anjene Mu, 2023. "Demonstrating paths for unlocking the value of cloud genomics through cross cohort analysis," Nature Communications, Nature, vol. 14(1), pages 1-10, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-30248-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.