IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1006722.html
   My bibliography  Save this article

Identifying individual risk rare variants using protein structure guided local tests (POINT)

Author

Listed:
  • Rachel Marceau West
  • Wenbin Lu
  • Daniel M Rotroff
  • Melaine A Kuenemann
  • Sheng-Mao Chang
  • Michael C Wu
  • Michael J Wagner
  • John B Buse
  • Alison A Motsinger-Reif
  • Denis Fourches
  • Jung-Ying Tzeng

Abstract

Rare variants are of increasing interest to genetic association studies because of their etiological contributions to human complex diseases. Due to the rarity of the mutant events, rare variants are routinely analyzed on an aggregate level. While aggregation analyses improve the detection of global-level signal, they are not able to pinpoint causal variants within a variant set. To perform inference on a localized level, additional information, e.g., biological annotation, is often needed to boost the information content of a rare variant. Following the observation that important variants are likely to cluster together on functional domains, we propose a protein structure guided local test (POINT) to provide variant-specific association information using structure-guided aggregation of signal. Constructed under a kernel machine framework, POINT performs local association testing by borrowing information from neighboring variants in the 3-dimensional protein space in a data-adaptive fashion. Besides merely providing a list of promising variants, POINT assigns each variant a p-value to permit variant ranking and prioritization. We assess the selection performance of POINT using simulations and illustrate how it can be used to prioritize individual rare variants in PCSK9, ANGPTL4 and CETP in the Action to Control Cardiovascular Risk in Diabetes (ACCORD) clinical trial data.Author summary: While it is known that rare variants play an important role in understanding associations between genotype and complex diseases, pinpointing individual rare variants likely to be responsible for association is still a daunting task. Due to their low frequency in the population and reduced signal, localizing causal rare variants often requires additional information, such as type of DNA change or location of variant along the sequence, to be incorporated in a biologically meaningful fashion that does not overpower the genotype data. In this paper, we use the observation that important variants tend to cluster together on functional domains to propose a new approach for prioritizing rare variants: the protein structure guided local test (POINT). POINT uses a gene’s 3-dimensional protein folding structure to guide aggregation of information from neighboring variants in the protein in a robust manner. We show how POINT improves selection performance over existing methods. We further illustrate how it can be used to prioritize individual rare variants using the Action to Control Cardiovascular Risk in Diabetes (ACCORD) clinical trial data, finding promising variants within genes in association with lipoprotein-related outcomes.

Suggested Citation

  • Rachel Marceau West & Wenbin Lu & Daniel M Rotroff & Melaine A Kuenemann & Sheng-Mao Chang & Michael C Wu & Michael J Wagner & John B Buse & Alison A Motsinger-Reif & Denis Fourches & Jung-Ying Tzeng, 2019. "Identifying individual risk rare variants using protein structure guided local tests (POINT)," PLOS Computational Biology, Public Library of Science, vol. 15(2), pages 1-24, February.
  • Handle: RePEc:plo:pcbi00:1006722
    DOI: 10.1371/journal.pcbi.1006722
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006722
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1006722&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1006722?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jung-Ying Tzeng & Daowen Zhang & Sheng-Mao Chang & Duncan C. Thomas & Marie Davidian, 2009. "Gene-Trait Similarity Regression for Multimarker-Based Association Analysis," Biometrics, The International Biometric Society, vol. 65(3), pages 822-832, September.
    2. Nanye Long & Samuel P Dickson & Jessica M Maia & Hee Shin Kim & Qianqian Zhu & Andrew S Allen, 2013. "Leveraging Prior Information to Detect Causal Variants via Multi-Variant Regression," PLOS Computational Biology, Public Library of Science, vol. 9(6), pages 1-11, June.
    3. Xinge Jessie Jeng & Zhongyin John Daye & Wenbin Lu & Jung-Ying Tzeng, 2016. "Rare Variants Association Analysis in Large-Scale Sequencing Studies at the Single Locus Level," PLOS Computational Biology, Public Library of Science, vol. 12(6), pages 1-23, June.
    4. Daniel D Kinnamon & Ray E Hershberger & Eden R Martin, 2012. "Reconsidering Association Testing Methods Using Single-Variant Test Statistics as Alternatives to Pooling Tests for Sequence Data with Rare Variants," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-15, February.
    5. Nengjun Yi & Nianjun Liu & Degui Zhi & Jun Li, 2011. "Hierarchical Generalized Linear Models for Multiple Groups of Rare and Common Variants: Jointly Estimating Group and Individual-Variant Effects," PLOS Genetics, Public Library of Science, vol. 7(12), pages 1-15, December.
    6. Thomas J Hoffmann & Nicholas J Marini & John S Witte, 2010. "Comprehensive Approach to Analyzing Rare Genetic Variants," PLOS ONE, Public Library of Science, vol. 5(11), pages 1-9, November.
    7. Wan-Yu Lin, 2014. "Association Testing of Clustered Rare Causal Variants in Case-Control Studies," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-6, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nanye Long & Samuel P Dickson & Jessica M Maia & Hee Shin Kim & Qianqian Zhu & Andrew S Allen, 2013. "Leveraging Prior Information to Detect Causal Variants via Multi-Variant Regression," PLOS Computational Biology, Public Library of Science, vol. 9(6), pages 1-11, June.
    2. Xinge Jessie Jeng & Zhongyin John Daye & Wenbin Lu & Jung-Ying Tzeng, 2016. "Rare Variants Association Analysis in Large-Scale Sequencing Studies at the Single Locus Level," PLOS Computational Biology, Public Library of Science, vol. 12(6), pages 1-23, June.
    3. Faming Liang & Momiao Xiong, 2013. "Bayesian Detection of Causal Rare Variants under Posterior Consistency," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-16, July.
    4. Elodie Persyn & Richard Redon & Lise Bellanger & Christian Dina, 2018. "The impact of a fine-scale population stratification on rare variant association test results," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-17, December.
    5. Rajesh Talluri & Sanjay Shete, 2013. "A Linkage Disequilibrium–Based Approach to Selecting Disease-Associated Rare Variants," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-6, July.
    6. Zihuai He & Min Zhang & Seunggeun Lee & Jennifer A. Smith & Xiuqing Guo & Walter Palmas & Sharon L. R. Kardia & Ana V. Diez Roux & Bhramar Mukherjee, 2015. "Set‐based tests for genetic association in longitudinal studies," Biometrics, The International Biometric Society, vol. 71(3), pages 606-615, September.
    7. Sunyoung Shin & Sündüz Keleş, 2017. "Annotation Regression for Genome-Wide Association Studies with an Application to Psychiatric Genomic Consortium Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(1), pages 50-72, June.
    8. Zihuai He & Min Zhang & Xiaowei Zhan & Qing Lu, 2014. "Modeling and testing for joint association using a genetic random field model," Biometrics, The International Biometric Society, vol. 70(3), pages 471-479, September.
    9. Gourab De & Wai-Ki Yip & Iuliana Ionita-Laza & Nan Laird, 2013. "Rare Variant Analysis for Family-Based Design," PLOS ONE, Public Library of Science, vol. 8(1), pages 1-9, January.
    10. Timothy D O’Connor & Adam Kiezun & Michael Bamshad & Stephen S Rich & Joshua D Smith & Emily Turner & NHLBIGO Exome Sequencing Project & ESP Population Genetics, Statistical Analysis Working Group & S, 2013. "Fine-Scale Patterns of Population Stratification Confound Rare Variant Association Tests," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-10, July.
    11. Daniel D Kinnamon & Ray E Hershberger & Eden R Martin, 2012. "Reconsidering Association Testing Methods Using Single-Variant Test Statistics as Alternatives to Pooling Tests for Sequence Data with Rare Variants," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-15, February.
    12. Yi Nengjun & Xu Shizhong & Lou Xiang-Yang & Mallick Himel, 2014. "Multiple comparisons in genetic association studies: a hierarchical modeling approach," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(1), pages 35-48, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1006722. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.