IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1002625.html
   My bibliography  Save this article

Improved Statistics for Genome-Wide Interaction Analysis

Author

Listed:
  • Masao Ueki
  • Heather J Cordell

Abstract

Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new “joint effects” statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al.'s originally-proposed statistics, on account of the inflated error rate that can result. Author Summary: Gene–gene interactions are a topic of great interest to geneticists carrying out studies of how genetic factors influence the development of common, complex diseases. Genes that interact may not only make important biological contributions to underlying disease processes, but also be more difficult to detect when using standard statistical methods in which we examine the effects of genetic factors one at a time. Recently a method was proposed by Wu and colleagues [1] for detecting pairwise interactions when carrying out genome-wide association studies (in which a large number of genetic variants across the genome are examined). Wu and colleagues carried out theoretical work and computer simulations that suggested their method outperformed other previously proposed approaches for detecting interactions. Here we show that, in fact, the method proposed by Wu and colleagues can result in an over-preponderence of false postive findings. We propose an adjusted version of their method that reduces the false positive rate while maintaining high power. We also propose a new method for detecting pairs of genetic effects that shows similarly high power but has some conceptual advantages over both Wu's method and also other previously proposed approaches.

Suggested Citation

  • Masao Ueki & Heather J Cordell, 2012. "Improved Statistics for Genome-Wide Interaction Analysis," PLOS Genetics, Public Library of Science, vol. 8(4), pages 1-19, April.
  • Handle: RePEc:plo:pgen00:1002625
    DOI: 10.1371/journal.pgen.1002625
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1002625
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1002625&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1002625?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Xuesen Wu & Hua Dong & Li Luo & Yun Zhu & Gang Peng & John D Reveille & Momiao Xiong, 2010. "A Novel Statistic for Genome-Wide Interaction Analysis," PLOS Genetics, Public Library of Science, vol. 6(9), pages 1-15, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dominic Russ & John A Williams & Victor Roth Cardoso & Laura Bravo-Merodio & Samantha C Pendleton & Furqan Aziz & Animesh Acharjee & Georgios V Gkoutos, 2022. "Evaluating the detection ability of a range of epistasis detection methods on simulated data for pure and impure epistatic models," PLOS ONE, Public Library of Science, vol. 17(2), pages 1-19, February.
    2. Emily Mathieu, 2016. "AGGrEGATOr: A Gene-based GEne-Gene interActTiOn test for case-control association studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(2), pages 151-171, April.
    3. Mattias Frånberg & Rona J Strawbridge & Anders Hamsten & PROCARDIS consortium & Ulf de Faire & Jens Lagergren & Bengt Sennblad, 2017. "Fast and general tests of genetic interaction for genome-wide association studies," PLOS Computational Biology, Public Library of Science, vol. 13(6), pages 1-29, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qingrun Zhang & Quan Long & Jurg Ott, 2014. "AprioriGWAS, a New Pattern Mining Strategy for Detecting Genetic Variants Associated with Disease through Interaction Effects," PLOS Computational Biology, Public Library of Science, vol. 10(6), pages 1-14, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1002625. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.