IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1000534.html
   My bibliography  Save this article

Identifying Relationships among Genomic Disease Regions: Predicting Genes at Pathogenic SNP Associations and Rare Deletions

Author

Listed:
  • Soumya Raychaudhuri
  • Robert M Plenge
  • Elizabeth J Rossin
  • Aylwin C Y Ng
  • International Schizophrenia Consortium
  • Shaun M Purcell
  • Pamela Sklar
  • Edward M Scolnick
  • Ramnik J Xavier
  • David Altshuler
  • Mark J Daly

Abstract

Translating a set of disease regions into insight about pathogenic mechanisms requires not only the ability to identify the key disease genes within them, but also the biological relationships among those key genes. Here we describe a statistical method, Gene Relationships Among Implicated Loci (GRAIL), that takes a list of disease regions and automatically assesses the degree of relatedness of implicated genes using 250,000 PubMed abstracts. We first evaluated GRAIL by assessing its ability to identify subsets of highly related genes in common pathways from validated lipid and height SNP associations from recent genome-wide studies. We then tested GRAIL, by assessing its ability to separate true disease regions from many false positive disease regions in two separate practical applications in human genetics. First, we took 74 nominally associated Crohn's disease SNPs and applied GRAIL to identify a subset of 13 SNPs with highly related genes. Of these, ten convincingly validated in follow-up genotyping; genotyping results for the remaining three were inconclusive. Next, we applied GRAIL to 165 rare deletion events seen in schizophrenia cases (less than one-third of which are contributing to disease risk). We demonstrate that GRAIL is able to identify a subset of 16 deletions containing highly related genes; many of these genes are expressed in the central nervous system and play a role in neuronal synapses. GRAIL offers a statistically robust approach to identifying functionally related genes from across multiple disease regions—that likely represent key disease pathways. An online version of this method is available for public use (http://www.broad.mit.edu/mpg/grail/).Author Summary: Modern genetic studies, including genome-wide surveys for disease-associated loci and copy number variation, provide a list of critical genomic regions that play an important role in predisposition to disease. Using these regions to understand disease pathogenesis requires the ability to first distinguish causal genes from other nearby genes spuriously contained within these regions. To do this we must identify the key pathways suggested by those causal genes. In this manuscript we describe a statistical approach, Gene Relationships Across Implicated Loci (GRAIL), to achieve this task. It starts with genomic regions and identifies related subsets of genes involved in similar biological processes—these genes highlight the likely causal genes and the key pathways. GRAIL uses abstracts from the entirety of the published scientific literature about the genes to look for potential relationships between genes. We apply GRAIL to four very different phenotypes. In each case we identify a subset of highly related genes; in cases where false positive regions are present, GRAIL is able to separate out likely true positives. GRAIL therefore offers the potential to translate disease genomic regions from unbiased genomic surveys into the key processes that may be critical to the disease.

Suggested Citation

  • Soumya Raychaudhuri & Robert M Plenge & Elizabeth J Rossin & Aylwin C Y Ng & International Schizophrenia Consortium & Shaun M Purcell & Pamela Sklar & Edward M Scolnick & Ramnik J Xavier & David Altsh, 2009. "Identifying Relationships among Genomic Disease Regions: Predicting Genes at Pathogenic SNP Associations and Rare Deletions," PLOS Genetics, Public Library of Science, vol. 5(6), pages 1-15, June.
  • Handle: RePEc:plo:pgen00:1000534
    DOI: 10.1371/journal.pgen.1000534
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1000534
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1000534&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1000534?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hui Pan & Bo-Shiun Yan & Mauricio Rojas & Yuriy V. Shebzukhov & Hongwei Zhou & Lester Kobzik & Darren E. Higgins & Mark J. Daly & Barry R. Bloom & Igor Kramnik, 2005. "Ipr1 gene mediates innate immunity to tuberculosis," Nature, Nature, vol. 434(7034), pages 767-772, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Michael A Nalls & David J Couper & Toshiko Tanaka & Frank J A van Rooij & Ming-Huei Chen & Albert V Smith & Daniela Toniolo & Neil A Zakai & Qiong Yang & Andreas Greinacher & Andrew R Wood & Melissa G, 2011. "Multiple Loci Are Associated with White Blood Cell Phenotypes," PLOS Genetics, Public Library of Science, vol. 7(6), pages 1-16, June.
    2. Gavin Band & Quang Si Le & Luke Jostins & Matti Pirinen & Katja Kivinen & Muminatou Jallow & Fatoumatta Sisay-Joof & Kalifa Bojang & Margaret Pinder & Giorgio Sirugo & David J Conway & Vysaul Nyirongo, 2013. "Imputation-Based Meta-Analysis of Severe Malaria in Three African Populations," PLOS Genetics, Public Library of Science, vol. 9(5), pages 1-13, May.
    3. Alexandra Zhernakova & Eli A Stahl & Gosia Trynka & Soumya Raychaudhuri & Eleanora A Festen & Lude Franke & Harm-Jan Westra & Rudolf S N Fehrmann & Fina A S Kurreeman & Brian Thomson & Namrata Gupta &, 2011. "Meta-Analysis of Genome-Wide Association Studies in Celiac Disease and Rheumatoid Arthritis Identifies Fourteen Non-HLA Shared Loci," PLOS Genetics, Public Library of Science, vol. 7(2), pages 1-13, February.
    4. Xiaofeng Dai & Wenwen Guo & Chunjun Zhan & Xiuxia Liu & Zhonghu Bai & Yankun Yang, 2015. "WDR5 Expression Is Prognostic of Breast Cancer Outcome," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-15, September.
    5. Benjamin Lehne & Cathryn M Lewis & Thomas Schlitt, 2011. "From SNPs to Genes: Disease Association at the Gene Level," PLOS ONE, Public Library of Science, vol. 6(6), pages 1-10, June.
    6. Mohd Khanapi Abd Ghani & Nasir G. Noma & Mazin Abed Mohammed & Karrar Hameed Abdulkareem & Begonya Garcia-Zapirain & Mashael S. Maashi & Salama A. Mostafa, 2021. "Innovative Artificial Intelligence Approach for Hearing-Loss Symptoms Identification Model Using Machine Learning Techniques," Sustainability, MDPI, vol. 13(10), pages 1-30, May.
    7. Kerstin Becker & Sabine Siegert & Mohammad Reza Toliat & Juanjiangmeng Du & Ramona Casper & Guido H Dolmans & Paul M Werker & Sigrid Tinschert & Andre Franke & Christian Gieger & Konstantin Strauch & , 2016. "Meta-Analysis of Genome-Wide Association Studies and Network Analysis-Based Integration with Gene Expression Data Identify New Suggestive Loci and Unravel a Wnt-Centric Network Associated with Dupuytr," PLOS ONE, Public Library of Science, vol. 11(7), pages 1-18, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gregory J Fox & Dinh Ngoc Sy & Nguyen Viet Nhung & Bing Yu & Magda K Ellis & Nguyen Van Hung & Nguyen Kim Cuong & Luu Thi Lien & Guy B Marks & Bernadette M Saunders & Warwick J Britton, 2014. "Polymorphisms of SP110 Are Associated with both Pulmonary and Extra-Pulmonary Tuberculosis among the Vietnamese," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-9, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1000534. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.