IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-32864-2.html
   My bibliography  Save this article

Identifying interpretable gene-biomarker associations with functionally informed kernel-based tests in 190,000 exomes

Author

Listed:
  • Remo Monti

    (Digital Health - Machine Learning, Hasso Plattner Institute, University of Potsdam, Digital Engineering Faculty
    Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB))

  • Pia Rautenstrauch

    (Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB)
    Humboldt-Universität zu Berlin, Department of Computer Science)

  • Mahsa Ghanbari

    (Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB))

  • Alva Rani James

    (Digital Health - Machine Learning, Hasso Plattner Institute, University of Potsdam, Digital Engineering Faculty)

  • Matthias Kirchler

    (Digital Health - Machine Learning, Hasso Plattner Institute, University of Potsdam, Digital Engineering Faculty
    TU Kaiserslautern, Department of Computer Science)

  • Uwe Ohler

    (Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB)
    Humboldt-Universität zu Berlin, Department of Biology)

  • Stefan Konigorski

    (Digital Health - Machine Learning, Hasso Plattner Institute, University of Potsdam, Digital Engineering Faculty
    Hasso Plattner Institute for Digital Health, Icahn School of Medicine at Mount Sinai)

  • Christoph Lippert

    (Digital Health - Machine Learning, Hasso Plattner Institute, University of Potsdam, Digital Engineering Faculty
    Hasso Plattner Institute for Digital Health, Icahn School of Medicine at Mount Sinai)

Abstract

Here we present an exome-wide rare genetic variant association study for 30 blood biomarkers in 191,971 individuals in the UK Biobank. We compare gene-based association tests for separate functional variant categories to increase interpretability and identify 193 significant gene-biomarker associations. Genes associated with biomarkers were ~ 4.5-fold enriched for conferring Mendelian disorders. In addition to performing weighted gene-based variant collapsing tests, we design and apply variant-category-specific kernel-based tests that integrate quantitative functional variant effect predictions for missense variants, splicing and the binding of RNA-binding proteins. For these tests, we present a computationally efficient combination of the likelihood-ratio and score tests that found 36% more associations than the score test alone while also controlling the type-1 error. Kernel-based tests identified 13% more associations than their gene-based collapsing counterparts and had advantages in the presence of gain of function missense variants. We introduce local collapsing by amino acid position for missense variants and use it to interpret associations and identify potential novel gain of function variants in PIEZO1. Our results show the benefits of investigating different functional mechanisms when performing rare-variant association tests, and demonstrate pervasive rare-variant contribution to biomarker variability.

Suggested Citation

  • Remo Monti & Pia Rautenstrauch & Mahsa Ghanbari & Alva Rani James & Matthias Kirchler & Uwe Ohler & Stefan Konigorski & Christoph Lippert, 2022. "Identifying interpretable gene-biomarker associations with functionally informed kernel-based tests in 190,000 exomes," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-32864-2
    DOI: 10.1038/s41467-022-32864-2
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-32864-2
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-32864-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Chloé James & Valérie Ugo & Jean-Pierre Le Couédic & Judith Staerk & François Delhommeau & Catherine Lacout & Loïc Garçon & Hana Raslova & Roland Berger & Annelise Bennaceur-Griscelli & Jean Luc Ville, 2005. "A unique clonal JAK2 mutation leading to constitutive signalling causes polycythaemia vera," Nature, Nature, vol. 434(7037), pages 1144-1148, April.
    2. Juliette Albuisson & Swetha E Murthy & Michael Bandell & Bertrand Coste & Hélène Louis-dit-Picard & Jayanti Mathur & Madeleine Fénéant-Thibault & Gérard Tertian & Jean-Pierre de Jaureguiberry & Pierre, 2013. "Dehydrated hereditary stomatocytosis linked to gain-of-function mutations in mechanically activated PIEZO1 ion channels," Nature Communications, Nature, vol. 4(1), pages 1-9, October.
    3. Yajie Zhao & Stasa Stankovic & Mine Koprulu & Eleanor Wheeler & Felix R. Day & Hana Lango Allen & Nicola D. Kerrison & Maik Pietzner & Po-Ru Loh & Nicholas J. Wareham & Claudia Langenberg & Ken K. Ong, 2021. "GIGYF1 loss of function is associated with clonal mosaicism and adverse metabolic health," Nature Communications, Nature, vol. 12(1), pages 1-6, December.
    4. Konrad J. Karczewski & Laurent C. Francioli & Grace Tiao & Beryl B. Cummings & Jessica Alföldi & Qingbo Wang & Ryan L. Collins & Kristen M. Laricchia & Andrea Ganna & Daniel P. Birnbaum & Laura D. Gau, 2020. "The mutational constraint spectrum quantified from variation in 141,456 humans," Nature, Nature, vol. 581(7809), pages 434-443, May.
    5. Yaowu Liu & Jun Xie, 2020. "Cauchy Combination Test: A Powerful Test With Analytic p-Value Calculation Under Arbitrary Dependency Structures," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 393-402, January.
    6. Teri A. Manolio & Francis S. Collins & Nancy J. Cox & David B. Goldstein & Lucia A. Hindorff & David J. Hunter & Mark I. McCarthy & Erin M. Ramos & Lon R. Cardon & Aravinda Chakravarti & Judy H. Cho &, 2009. "Finding the missing heritability of complex diseases," Nature, Nature, vol. 461(7265), pages 747-753, October.
    7. Oliver Pain & Kylie P Glanville & Saskia P Hagenaars & Saskia Selzam & Anna E Fürtjes & Héléna A Gaspar & Jonathan R I Coleman & Kaili Rimfeld & Gerome Breen & Robert Plomin & Lasse Folkersen & Cathry, 2021. "Evaluation of polygenic prediction methodology within a reference-standardized framework," PLOS Genetics, Public Library of Science, vol. 17(5), pages 1-22, May.
    8. Robert B. Davies, 1980. "The Distribution of a Linear Combination of χ2 Random Variables," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(3), pages 323-333, November.
    9. Quanli Wang & Ryan S. Dhindsa & Keren Carss & Andrew R. Harper & Abhishek Nag & Ioanna Tachmazidou & Dimitrios Vitsios & Sri V. V. Deevi & Alex Mackay & Daniel Muthas & Michael Hühn & Susan Monkley & , 2021. "Rare variant contribution to human disease in 281,104 UK Biobank exomes," Nature, Nature, vol. 597(7877), pages 527-532, September.
    10. Christopher R. Genovese & Kathryn Roeder & Larry Wasserman, 2006. "False discovery control with p-value weighting," Biometrika, Biometrika Trust, vol. 93(3), pages 509-524, September.
    11. Cristopher V. Van Hout & Ioanna Tachmazidou & Joshua D. Backman & Joshua D. Hoffman & Daren Liu & Ashutosh K. Pandey & Claudia Gonzaga-Jauregui & Shareef Khalid & Bin Ye & Nilanjana Banerjee & Alexand, 2020. "Exome sequencing and characterization of 49,960 individuals in the UK Biobank," Nature, Nature, vol. 586(7831), pages 749-756, October.
    12. Ken B Hanscombe & Jonathan R I Coleman & Matthew Traylor & Cathryn M Lewis, 2019. "ukbtools: An R package to manage and query UK Biobank data," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-6, May.
    13. Elizabeth T. Cirulli & Simon White & Robert W. Read & Gai Elhanan & William J. Metcalf & Francisco Tanudjaja & Donna M. Fath & Efren Sandoval & Magnus Isaksson & Karen A. Schlauch & Joseph J. Grzymski, 2020. "Genome-wide rare variant analysis for thousands of phenotypes in over 70,000 exomes from two cohorts," Nature Communications, Nature, vol. 11(1), pages 1-10, December.
    14. Joshua D. Backman & Alexander H. Li & Anthony Marcketta & Dylan Sun & Joelle Mbatchou & Michael D. Kessler & Christian Benner & Daren Liu & Adam E. Locke & Suganthi Balasubramanian & Ashish Yadav & Ni, 2021. "Exome sequencing and analysis of 454,787 UK Biobank participants," Nature, Nature, vol. 599(7886), pages 628-634, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marcin Kierczak & Nima Rafati & Julia Höglund & Hadrien Gourlé & Valeria Lo Faro & Daniel Schmitz & Weronica E. Ek & Ulf Gyllensten & Stefan Enroth & Diana Ekman & Björn Nystedt & Torgny Karlsson & Ås, 2022. "Contribution of rare whole-genome sequencing variants to plasma protein levels and the missing heritability," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    2. Matthew Tegtmeyer & Jatin Arora & Samira Asgari & Beth A. Cimini & Ajay Nadig & Emily Peirent & Dhara Liyanage & Gregory P. Way & Erin Weisbart & Aparna Nathan & Tiffany Amariuta & Kevin Eggan & Marzi, 2024. "High-dimensional phenotyping to define the genetic basis of cellular morphology," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    3. Xiaoyi Raymond Gao & Marion Chiariglione & Alexander J. Arch, 2022. "Whole-exome sequencing study identifies rare variants and genes associated with intraocular pressure and glaucoma," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    4. Mihail Halachev & Viktoria-Eleni Gountouna & Alison Meynert & Gannie Tzoneva & Alan R. Shuldiner & Colin A. Semple & James F. Wilson, 2024. "Regionally enriched rare deleterious exonic variants in the UK and Ireland," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    5. Nazia Pathan & Wei Q. Deng & Matteo Di Scipio & Mohammad Khan & Shihong Mao & Robert W. Morton & Ricky Lali & Marie Pigeyre & Michael R. Chong & Guillaume Paré, 2024. "A method to estimate the contribution of rare coding variants to complex trait heritability," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    6. Matthias Wuttke & Eva König & Maria-Alexandra Katsara & Holger Kirsten & Saeed Khomeijani Farahani & Alexander Teumer & Yong Li & Martin Lang & Burulca Göcmen & Cristian Pattaro & Dorothee Günzel & An, 2023. "Imputation-powered whole-exome analysis identifies genes associated with kidney function and disease in the UK Biobank," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    7. Asmundur Oddsson & Patrick Sulem & Gardar Sveinbjornsson & Gudny A. Arnadottir & Valgerdur Steinthorsdottir & Gisli H. Halldorsson & Bjarni A. Atlason & Gudjon R. Oskarsson & Hannes Helgason & Henriet, 2023. "Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    8. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    9. Dick Schijven & Sourena Soheili-Nezhad & Simon E. Fisher & Clyde Francks, 2024. "Exome-wide analysis implicates rare protein-altering variants in human handedness," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    10. Erik Schoenmakers & Federica Marelli & Helle F. Jørgensen & W. Edward Visser & Carla Moran & Stefan Groeneweg & Carolina Avalos & Sean J. Jurgens & Nichola Figg & Alison Finigan & Neha Wali & Maura Ag, 2023. "Selenoprotein deficiency disorder predisposes to aortic aneurysm formation," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    11. Ruoyu Tian & Tian Ge & Hyeokmoon Kweon & Daniel B. Rocha & Max Lam & Jimmy Z. Liu & Kritika Singh & Daniel F. Levey & Joel Gelernter & Murray B. Stein & Ellen A. Tsai & Hailiang Huang & Christopher F., 2024. "Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    12. Naman S. Shetty & Mokshad Gaonkar & Nirav Patel & Akhil Pampana & Nehal Vekariya & Peng Li & Garima Arora & Pankaj Arora, 2024. "Determinants of transthyretin levels and their association with adverse clinical outcomes among UK Biobank participants," Nature Communications, Nature, vol. 15(1), pages 1-7, December.
    13. Chang Lu & Jan Zaucha & Rihab Gam & Hai Fang & Smithers & Matt E. Oates & Miguel Bernabe-Rubio & James Williams & Natalie Zelenka & Arun Prasad Pandurangan & Himani Tandon & Hashem Shihab & Raju Kalai, 2023. "Hypothesis-free phenotype prediction within a genetics-first framework," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    14. Aimee M. Deaton & Aditi Dubey & Lucas D. Ward & Peter Dornbos & Jason Flannick & Elaine Yee & Simina Ticau & Leila Noetzli & Margaret M. Parker & Rachel A. Hoffing & Carissa Willis & Mollie E. Plekan , 2022. "Rare loss of function variants in the hepatokine gene INHBE protect from abdominal obesity," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    15. Young Jin Kim & Sanghoon Moon & Mi Yeong Hwang & Sohee Han & Hye-Mi Jang & Jinhwa Kong & Dong Mun Shin & Kyungheon Yoon & Sung Min Kim & Jong-Eun Lee & Anubha Mahajan & Hyun-Young Park & Mark I. McCar, 2022. "The contribution of common and rare genetic variants to variation in metabolic traits in 288,137 East Asians," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    16. Juan Lorenzo Rodriguez-Flores & Shareef Khalid & Neelroop Parikshak & Asif Rasheed & Bin Ye & Manav Kapoor & Joshua Backman & Farshid Sepehrband & Silvio Alessandro Di Gioia & Sahar Gelfman & Tanima D, 2024. "NOTCH3 p.Arg1231Cys is markedly enriched in South Asians and associated with stroke," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    17. Gareth Hawkes & Robin N. Beaumont & Zilin Li & Ravi Mandla & Xihao Li & Christine M. Albert & Donna K. Arnett & Allison E. Ashley-Koch & Aneel A. Ashrani & Kathleen C. Barnes & Eric Boerwinkle & Jenni, 2024. "Whole-genome sequencing in 333,100 individuals reveals rare non-coding single variant and aggregate associations with height," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    18. Hong Zhang & Zheyang Wu, 2023. "The generalized Fisher's combination and accurate p‐value calculation under dependence," Biometrics, The International Biometric Society, vol. 79(2), pages 1159-1172, June.
    19. Zhuoran Xu & Quan Li & Luigi Marchionni & Kai Wang, 2023. "PhenoSV: interpretable phenotype-aware model for the prioritization of genes affected by structural variants," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    20. Xiao-Yu He & Bang-Sheng Wu & Liu Yang & Yu Guo & Yue-Ting Deng & Ze-Yu Li & Chen-Jie Fei & Wei-Shi Liu & Yi-Jun Ge & Jujiao Kang & Jianfeng Feng & Wei Cheng & Qiang Dong & Jin-Tai Yu, 2024. "Genetic associations of protein-coding variants in venous thromboembolism," Nature Communications, Nature, vol. 15(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-32864-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.