IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0036662.html
   My bibliography  Save this article

A Two-Stage Random Forest-Based Pathway Analysis Method

Author

Listed:
  • Ren-Hua Chung
  • Ying-Erh Chen

Abstract

Pathway analysis provides a powerful approach for identifying the joint effect of genes grouped into biologically-based pathways on disease. Pathway analysis is also an attractive approach for a secondary analysis of genome-wide association study (GWAS) data that may still yield new results from these valuable datasets. Most of the current pathway analysis methods focused on testing the cumulative main effects of genes in a pathway. However, for complex diseases, gene-gene interactions are expected to play a critical role in disease etiology. We extended a random forest-based method for pathway analysis by incorporating a two-stage design. We used simulations to verify that the proposed method has the correct type I error rates. We also used simulations to show that the method is more powerful than the original random forest-based pathway approach and the set-based test implemented in PLINK in the presence of gene-gene interactions. Finally, we applied the method to a breast cancer GWAS dataset and a lung cancer GWAS dataset and interesting pathways were identified that have implications for breast and lung cancers.

Suggested Citation

  • Ren-Hua Chung & Ying-Erh Chen, 2012. "A Two-Stage Random Forest-Based Pathway Analysis Method," PLOS ONE, Public Library of Science, vol. 7(5), pages 1-6, May.
  • Handle: RePEc:plo:pone00:0036662
    DOI: 10.1371/journal.pone.0036662
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0036662
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0036662&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0036662?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hariklia Eleftherohorinou & Victoria Wright & Clive Hoggart & Anna-Liisa Hartikainen & Marjo-Riitta Jarvelin & David Balding & Lachlan Coin & Michael Levin, 2009. "Pathway Analysis of GWAS Provides New Insights into Genetic Susceptibility to 3 Inflammatory Diseases," PLOS ONE, Public Library of Science, vol. 4(11), pages 1-11, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ohad Manor & Eran Segal, 2013. "Predicting Disease Risk Using Bootstrap Ranking and Classification Algorithms," PLOS Computational Biology, Public Library of Science, vol. 9(8), pages 1-10, August.
    2. Silver Matt & Montana Giovanni & Alzheimer's Disease Neuroimaging Initiative, 2012. "Fast Identification of Biological Pathways Associated with a Quantitative Trait Using Group Lasso with Overlaps," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-43, January.
    3. Yvonne J K Edwards & Gary W Beecham & William K Scott & Sawsan Khuri & Guney Bademci & Demet Tekin & Eden R Martin & Zhijie Jiang & Deborah C Mash & Jarlath ffrench-Mullen & Margaret A Pericak-Vance &, 2011. "Identifying Consensus Disease Pathways in Parkinson's Disease Using an Integrative Systems Biology Approach," PLOS ONE, Public Library of Science, vol. 6(2), pages 1-11, February.
    4. Zhang Tian-Xiao & Beaty Terri H. & Ruczinski Ingo, 2012. "Candidate Pathway Based Analysis for Cleft Lip with or without Cleft Palate," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-21, January.
    5. Ayellet V Segrè & DIAGRAM Consortium & MAGIC investigators & Leif Groop & Vamsi K Mootha & Mark J Daly & David Altshuler, 2010. "Common Inherited Variation in Mitochondrial Genes Is Not Enriched for Associations with Type 2 Diabetes or Related Glycemic Traits," PLOS Genetics, Public Library of Science, vol. 6(8), pages 1-19, August.
    6. Nerea Bartolomé & Sergi Segarra & Marta Artieda & Olga Francino & Elisenda Sánchez & Magdalena Szczypiorska & Joaquim Casellas & Diego Tejedor & Joaquín Cerdeira & Antonio Martínez & Alfonso Velasco &, 2015. "A Genetic Predictive Model for Canine Hip Dysplasia: Integration of Genome Wide Association Study (GWAS) and Candidate Gene Approaches," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-13, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0036662. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.