IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v8y2017i1d10.1038_s41467-017-00141-2.html
   My bibliography  Save this article

Annotating pathogenic non-coding variants in genic regions

Author

Listed:
  • Sahar Gelfman

    (Columbia University Medical Center
    Columbia University Medical Center)

  • Quanli Wang

    (Columbia University Medical Center
    Columbia University Medical Center)

  • K. Melodi McSweeney

    (Columbia University Medical Center
    Columbia University Medical Center)

  • Zhong Ren

    (Columbia University Medical Center
    Columbia University Medical Center)

  • Francesca La Carpia

    (Columbia University Medical Center)

  • Matt Halvorsen

    (Columbia University Medical Center
    Columbia University Medical Center)

  • Kelly Schoch

    (Duke University Health System)

  • Fanni Ratzon

    (Lenox Hill Hospital)

  • Erin L. Heinzen

    (Columbia University Medical Center
    Columbia University Medical Center)

  • Michael J. Boland

    (Columbia University Medical Center
    Columbia University)

  • Slavé Petrovski

    (Columbia University Medical Center
    University of Melbourne)

  • David B. Goldstein

    (Columbia University Medical Center
    Columbia University Medical Center)

Abstract

Identifying the underlying causes of disease requires accurate interpretation of genetic variants. Current methods ineffectively capture pathogenic non-coding variants in genic regions, resulting in overlooking synonymous and intronic variants when searching for disease risk. Here we present the Transcript-inferred Pathogenicity (TraP) score, which uses sequence context alterations to reliably identify non-coding variation that causes disease. High TraP scores single out extremely rare variants with lower minor allele frequencies than missense variants. TraP accurately distinguishes known pathogenic and benign variants in synonymous (AUC = 0.88) and intronic (AUC = 0.83) public datasets, dismissing benign variants with exceptionally high specificity. TraP analysis of 843 exomes from epilepsy family trios identifies synonymous variants in known epilepsy genes, thus pinpointing risk factors of disease from non-coding sequence data. TraP outperforms leading methods in identifying non-coding variants that are pathogenic and is therefore a valuable tool for use in gene discovery and the interpretation of personal genomes.

Suggested Citation

  • Sahar Gelfman & Quanli Wang & K. Melodi McSweeney & Zhong Ren & Francesca La Carpia & Matt Halvorsen & Kelly Schoch & Fanni Ratzon & Erin L. Heinzen & Michael J. Boland & Slavé Petrovski & David B. Go, 2017. "Annotating pathogenic non-coding variants in genic regions," Nature Communications, Nature, vol. 8(1), pages 1-11, December.
  • Handle: RePEc:nat:natcom:v:8:y:2017:i:1:d:10.1038_s41467-017-00141-2
    DOI: 10.1038/s41467-017-00141-2
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-017-00141-2
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-017-00141-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Stephanie O. Erjavec & Sahar Gelfman & Alexa R. Abdelaziz & Eunice Y. Lee & Isha Monga & Anna Alkelai & Iuliana Ionita-Laza & Lynn Petukhova & Angela M. Christiano, 2022. "Whole exome sequencing in Alopecia Areata identifies rare variants in KRT82," Nature Communications, Nature, vol. 13(1), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:8:y:2017:i:1:d:10.1038_s41467-017-00141-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.