IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1004219.html
   My bibliography  Save this article

MAGMA: Generalized Gene-Set Analysis of GWAS Data

Author

Listed:
  • Christiaan A de Leeuw
  • Joris M Mooij
  • Tom Heskes
  • Danielle Posthuma

Abstract

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn’s Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn’s Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn’s Disease data was found to be considerably faster as well.Author Summary: Gene and gene-set analysis are statistical methods for analysing multiple genetic markers simultaneously to determine their joint effect. These methods can be used when the effects of individual markers is too weak to detect, which is a common problem when studying polygenic traits. Moreover, gene-set analysis can provide additional insight into functional and biological mechanisms underlying the genetic component of a trait. Although a number of methods for gene and gene-set analysis are available however, they generally suffer from various statistical issues and can be very time-consuming to run. We have therefore developed a new method called MAGMA to address these issues, and have compared it to a number of existing tools. Our results show that MAGMA detects more associated genes and gene-sets than other methods, and is also considerably faster. The way the method is set up also makes it highly flexible. This makes it suitable as a basis for more general statistical analyses aimed at investigating more complex research questions.

Suggested Citation

  • Christiaan A de Leeuw & Joris M Mooij & Tom Heskes & Danielle Posthuma, 2015. "MAGMA: Generalized Gene-Set Analysis of GWAS Data," PLOS Computational Biology, Public Library of Science, vol. 11(4), pages 1-19, April.
  • Handle: RePEc:plo:pcbi00:1004219
    DOI: 10.1371/journal.pcbi.1004219
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004219
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1004219&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1004219?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ayellet V Segrè & DIAGRAM Consortium & MAGIC investigators & Leif Groop & Vamsi K Mootha & Mark J Daly & David Altshuler, 2010. "Common Inherited Variation in Mitochondrial Genes Is Not Enriched for Associations with Type 2 Diabetes or Related Glycemic Traits," PLOS Genetics, Public Library of Science, vol. 6(8), pages 1-19, August.
    2. Hou, Chia-Ding, 2005. "A simple approximation for the distribution of the weighted combination of non-independent or independent probabilities," Statistics & Probability Letters, Elsevier, vol. 73(2), pages 179-187, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Junjiao Feng & Liang Zhang & Chunhui Chen & Jintao Sheng & Zhifang Ye & Kanyin Feng & Jing Liu & Ying Cai & Bi Zhu & Zhaoxia Yu & Chuansheng Chen & Qi Dong & Gui Xue, 2022. "A cognitive neurogenetic approach to uncovering the structure of executive functions," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    2. David Lamparter & Daniel Marbach & Rico Rueedi & Zoltán Kutalik & Sven Bergmann, 2016. "Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics," PLOS Computational Biology, Public Library of Science, vol. 12(1), pages 1-20, January.
    3. Olga A Vsevolozhskaya & Min Shi & Fengjiao Hu & Dmitri V Zaykin, 2020. "DOT: Gene-set analysis by combining decorrelated association statistics," PLOS Computational Biology, Public Library of Science, vol. 16(4), pages 1-25, April.
    4. Niina Sandholm & Rany M Salem & Amy Jayne McKnight & Eoin P Brennan & Carol Forsblom & Tamara Isakova & Gareth J McKay & Winfred W Williams & Denise M Sadlier & Ville-Petteri Mäkinen & Elizabeth J Swa, 2012. "New Susceptibility Loci Associated with Kidney Disease in Type 1 Diabetes," PLOS Genetics, Public Library of Science, vol. 8(9), pages 1-13, September.
    5. Kristina M. Garske & Asha Kar & Caroline Comenho & Brunilda Balliu & David Z. Pan & Yash V. Bhagat & Gregory Rosenberg & Amogha Koka & Sankha Subhra Das & Zong Miao & Janet S. Sinsheimer & Jaakko Kapr, 2023. "Increased body mass index is linked to systemic inflammation through altered chromatin co-accessibility in human preadipocytes," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    6. Joshua C Randall & Thomas W Winkler & Zoltán Kutalik & Sonja I Berndt & Anne U Jackson & Keri L Monda & Tuomas O Kilpeläinen & Tõnu Esko & Reedik Mägi & Shengxu Li & Tsegaselassie Workalemahu & Mary F, 2013. "Sex-stratified Genome-wide Association Studies Including 270,000 Individuals Show Sexual Dimorphism in Genetic Loci for Anthropometric Traits," PLOS Genetics, Public Library of Science, vol. 9(6), pages 1-19, June.
    7. Wimmer, Thomas & Geyer-Klingeberg, Jerome & Hütter, Marie & Schmid, Florian & Rathgeber, Andreas, 2021. "The impact of speculation on commodity prices: A Meta-Granger analysis," Journal of Commodity Markets, Elsevier, vol. 22(C).
    8. Benjamin Lehne & Cathryn M Lewis & Thomas Schlitt, 2011. "From SNPs to Genes: Disease Association at the Gene Level," PLOS ONE, Public Library of Science, vol. 6(6), pages 1-10, June.
    9. Xiaqiong Wang & Yalu Wen, 2020. "A Systematic Comparison of Methods Designed for Association Analysis with Multi-Omics Data," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 10(2), pages 30-40, August.
    10. Hong Zhang & Zheyang Wu, 2023. "The generalized Fisher's combination and accurate p‐value calculation under dependence," Biometrics, The International Biometric Society, vol. 79(2), pages 1159-1172, June.
    11. Nadja Knoll & Ivonne Jarick & Anna-Lena Volckmar & Martin Klingenspor & Thomas Illig & Harald Grallert & Christian Gieger & Heinz-Erich Wichmann & Annette Peters & Johannes Hebebrand & André Scherag &, 2013. "Gene Set of Nuclear-Encoded Mitochondrial Regulators Is Enriched for Common Inherited Variation in Obesity," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-10, February.
    12. Lina Cai & Tomas Gonzales & Eleanor Wheeler & Nicola D. Kerrison & Felix R. Day & Claudia Langenberg & John R. B. Perry & Soren Brage & Nicholas J. Wareham, 2023. "Causal associations between cardiorespiratory fitness and type 2 diabetes," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    13. Ville-Petteri Mäkinen & Mete Civelek & Qingying Meng & Bin Zhang & Jun Zhu & Candace Levian & Tianxiao Huan & Ayellet V Segrè & Sujoy Ghosh & Juan Vivar & Majid Nikpay & Alexandre F R Stewart & Christ, 2014. "Integrative Genomics Reveals Novel Molecular Pathways and Gene Networks for Coronary Artery Disease," PLOS Genetics, Public Library of Science, vol. 10(7), pages 1-14, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1004219. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.