IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1000641.html
   My bibliography  Save this article

Associating Genes and Protein Complexes with Disease via Network Propagation

Author

Listed:
  • Oron Vanunu
  • Oded Magger
  • Eytan Ruppin
  • Tomer Shlomi
  • Roded Sharan

Abstract

A fundamental challenge in human health is the identification of disease-causing genes. Recently, several studies have tackled this challenge via a network-based approach, motivated by the observation that genes causing the same or similar diseases tend to lie close to one another in a network of protein-protein or functional interactions. However, most of these approaches use only local network information in the inference process and are restricted to inferring single gene associations. Here, we provide a global, network-based method for prioritizing disease genes and inferring protein complex associations, which we call PRINCE. The method is based on formulating constraints on the prioritization function that relate to its smoothness over the network and usage of prior information. We exploit this function to predict not only genes but also protein complex associations with a disease of interest. We test our method on gene-disease association data, evaluating both the prioritization achieved and the protein complexes inferred. We show that our method outperforms extant approaches in both tasks. Using data on 1,369 diseases from the OMIM knowledgebase, our method is able (in a cross validation setting) to rank the true causal gene first for 34% of the diseases, and infer 139 disease-related complexes that are highly coherent in terms of the function, expression and conservation of their member proteins. Importantly, we apply our method to study three multi-factorial diseases for which some causal genes have been found already: prostate cancer, alzheimer and type 2 diabetes mellitus. PRINCE's predictions for these diseases highly match the known literature, suggesting several novel causal genes and protein complexes for further investigation.Author Summary: Understanding the genetic background of diseases is crucial to medical research, with implications in diagnosis, treatment and drug development. As molecular approaches to this challenge are time consuming and costly, computational approaches offer an efficient alternative. Such approaches aim at prioritizing genes in a genomic interval of interest according to their predicted strength-of-association with a given disease. State-of-the-art prioritization problems are based on the observation that genes causing similar diseases tend to lie close to one another in a network of protein-protein interactions. Here we develop a novel prioritization approach that uses the network data in a global manner and can tie not only single genes but also whole protein machineries with a given disease. Our method, PRINCE, is shown to outperform previous methods in both the gene prioritization task and the protein complex task. Applying PRINCE to prostate cancer, alzheimer's disease and type 2 diabetes, we are able to infer new causal genes and related protein complexes with high confidence.

Suggested Citation

  • Oron Vanunu & Oded Magger & Eytan Ruppin & Tomer Shlomi & Roded Sharan, 2010. "Associating Genes and Protein Complexes with Disease via Network Propagation," PLOS Computational Biology, Public Library of Science, vol. 6(1), pages 1-9, January.
  • Handle: RePEc:plo:pcbi00:1000641
    DOI: 10.1371/journal.pcbi.1000641
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000641
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000641&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1000641?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ke Hu & Ju Xiang & Yun-Xia Yu & Liang Tang & Qin Xiang & Jian-Ming Li & Yong-Hong Tang & Yong-Jun Chen & Yan Zhang, 2020. "Significance-based multi-scale method for network community detection and its application in disease-gene prediction," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-24, March.
    2. T M Murali & Matthew D Dyer & David Badger & Brett M Tyler & Michael G Katze, 2011. "Network-Based Prediction and Analysis of HIV Dependency Factors," PLOS Computational Biology, Public Library of Science, vol. 7(9), pages 1-15, September.
    3. Deborah Chasman & Brandi Gancarz & Linhui Hao & Michael Ferris & Paul Ahlquist & Mark Craven, 2014. "Inferring Host Gene Subnetworks Involved in Viral Replication," PLOS Computational Biology, Public Library of Science, vol. 10(5), pages 1-22, May.
    4. Elisa Salviato & Vera Djordjilović & Monica Chiogna & Chiara Romualdi, 2019. "SourceSet: A graphical model approach to identify primary genes in perturbed biological pathways," PLOS Computational Biology, Public Library of Science, vol. 15(10), pages 1-28, October.
    5. Xing Chen & Jun Yin & Jia Qu & Li Huang, 2018. "MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction," PLOS Computational Biology, Public Library of Science, vol. 14(8), pages 1-24, August.
    6. Daniel E Carlin & Barry Demchak & Dexter Pratt & Eric Sage & Trey Ideker, 2017. "Network propagation in the cytoscape cyberinfrastructure," PLOS Computational Biology, Public Library of Science, vol. 13(10), pages 1-9, October.
    7. Cui, Ying & Cai, Meng & Stanley, H. Eugene, 2018. "Discovering disease-associated genes in weighted protein–protein interaction networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 496(C), pages 53-61.
    8. Li-Chen Hung & Pei-Tseng Kung & Chi-Hsuan Lung & Ming-Hsui Tsai & Shih-An Liu & Li-Ting Chiu & Kuang-Hua Huang & Wen-Chen Tsai, 2020. "Assessment of the Risk of Oral Cancer Incidence in A High-Risk Population and Establishment of A Predictive Model for Oral Cancer Incidence Using A Population-Based Cohort in Taiwan," IJERPH, MDPI, vol. 17(2), pages 1-15, January.
    9. Juan J Cáceres & Alberto Paccanaro, 2019. "Disease gene prediction for molecularly uncharacterized diseases," PLOS Computational Biology, Public Library of Science, vol. 15(7), pages 1-14, July.
    10. Joana P Gonçalves & Alexandre P Francisco & Yves Moreau & Sara C Madeira, 2012. "Interactogeneous: Disease Gene Prioritization Using Heterogeneous Networks and Full Topology Scores," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-13, November.
    11. Mengyun Yang & Huimin Luo & Yaohang Li & Fang-Xiang Wu & Jianxin Wang, 2019. "Overlap matrix completion for predicting drug-associated indications," PLOS Computational Biology, Public Library of Science, vol. 15(12), pages 1-21, December.
    12. Abby Hill & Scott Gleim & Florian Kiefer & Frederic Sigoillot & Joseph Loureiro & Jeremy Jenkins & Melody K Morris, 2019. "Benchmarking network algorithms for contextualizing genes of interest," PLOS Computational Biology, Public Library of Science, vol. 15(12), pages 1-14, December.
    13. Jianhua Li & Xiaoyan Lin & Yueyang Teng & Shouliang Qi & Dayu Xiao & Jianying Zhang & Yan Kang, 2016. "A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization," PLOS ONE, Public Library of Science, vol. 11(7), pages 1-18, July.
    14. U Martin Singh-Blom & Nagarajan Natarajan & Ambuj Tewari & John O Woods & Inderjit S Dhillon & Edward M Marcotte, 2013. "Prediction and Validation of Gene-Disease Associations Using Methods Inspired by Social Network Analyses," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-17, May.
    15. Konstantina Charmpi & Manopriya Chokkalingam & Ronja Johnen & Andreas Beyer, 2021. "Optimizing network propagation for multi-omics data integration," PLOS Computational Biology, Public Library of Science, vol. 17(11), pages 1-26, November.
    16. Le Ou-Yang & Dao-Qing Dai & Xiao-Fei Zhang, 2013. "Protein Complex Detection via Weighted Ensemble Clustering Based on Bayesian Nonnegative Matrix Factorization," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-18, May.
    17. Florin Ratajczak & Mitchell Joblin & Marcel Hildebrandt & Martin Ringsquandl & Pascal Falter-Braun & Matthias Heinig, 2023. "Speos: an ensemble graph representation learning framework to predict core gene candidates for complex diseases," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    18. MaoQiang Xie & YingJie Xu & YaoGong Zhang & TaeHyun Hwang & Rui Kuang, 2015. "Network-based Phenome-Genome Association Prediction by Bi-Random Walk," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-18, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1000641. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.