IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1006418.html
   My bibliography  Save this article

MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction

Author

Listed:
  • Xing Chen
  • Jun Yin
  • Jia Qu
  • Li Huang

Abstract

Recently, a growing number of biological research and scientific experiments have demonstrated that microRNA (miRNA) affects the development of human complex diseases. Discovering miRNA-disease associations plays an increasingly vital role in devising diagnostic and therapeutic tools for diseases. However, since uncovering associations via experimental methods is expensive and time-consuming, novel and effective computational methods for association prediction are in demand. In this study, we developed a computational model of Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction (MDHGI) to discover new miRNA-disease associations by integrating the predicted association probability obtained from matrix decomposition through sparse learning method, the miRNA functional similarity, the disease semantic similarity, and the Gaussian interaction profile kernel similarity for diseases and miRNAs into a heterogeneous network. Compared with previous computational models based on heterogeneous networks, our model took full advantage of matrix decomposition before the construction of heterogeneous network, thereby improving the prediction accuracy. MDHGI obtained AUCs of 0.8945 and 0.8240 in the global and the local leave-one-out cross validation, respectively. Moreover, the AUC of 0.8794+/-0.0021 in 5-fold cross validation confirmed its stability of predictive performance. In addition, to further evaluate the model's accuracy, we applied MDHGI to four important human cancers in three different kinds of case studies. In the first type, 98% (Esophageal Neoplasms) and 98% (Lymphoma) of top 50 predicted miRNAs have been confirmed by at least one of the two databases (dbDEMC and miR2Disease) or at least one experimental literature in PubMed. In the second type of case study, what made a difference was that we removed all known associations between the miRNAs and Lung Neoplasms before implementing MDHGI on Lung Neoplasms. As a result, 100% (Lung Neoplasms) of top 50 related miRNAs have been indexed by at least one of the three databases (dbDEMC, miR2Disease and HMDD V2.0) or at least one experimental literature in PubMed. Furthermore, we also tested our prediction method on the HMDD V1.0 database to prove the applicability of MDHGI to different datasets. The results showed that 50 out of top 50 miRNAs related with the breast neoplasms were validated by at least one of the three databases (HMDD V2.0, dbDEMC, and miR2Disease) or at least one experimental literature.Author summary: Identifying potential miRNA-disease associations enhances the understanding towards molecular mechanisms and pathogenesis of diseases, which is beneficial for the development of diagnostic/treatment tools for diseases. Compared with traditional experiment methods, computational models can help experimenters reduce the cost of money and time. In order to computationally predict potential miRNA-disease associations, we developed MDHGI by combining the sparse learning method with the heterogeneous graph inference method. We performed MDHGI on different database and the experiment results indicated that MDHGI had significant advantages over previous methods both in leave-one-out cross validation and 5-fold cross validation. Besides, we also carried out three different kinds of case studies on four important human complex diseases to further demonstrate the prediction accuracy of MDHGI. In consequence, 98%, 98%, 100% and 100% out of the top 50 candidate miRNAs for the four diseases were confirmed by different databases or experimental literatures in PubMed, respectively. Thus, it could be concluded that MDHGI could make reliable predictions and should serve as an effective tool for predicting potential miRNA-disease associations.

Suggested Citation

  • Xing Chen & Jun Yin & Jia Qu & Li Huang, 2018. "MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction," PLOS Computational Biology, Public Library of Science, vol. 14(8), pages 1-24, August.
  • Handle: RePEc:plo:pcbi00:1006418
    DOI: 10.1371/journal.pcbi.1006418
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006418
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1006418&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1006418?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Fan Meng & Xiaomei Yang & Chenghu Zhou, 2014. "The Augmented Lagrange Multipliers Method for Matrix Completion from Corrupted Samplings with Application to Mixed Gaussian-Impulse Noise Removal," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-11, September.
    2. Gunter Meister & Thomas Tuschl, 2004. "Mechanisms of gene silencing by double-stranded RNA," Nature, Nature, vol. 431(7006), pages 343-349, September.
    3. Oron Vanunu & Oded Magger & Eytan Ruppin & Tomer Shlomi & Roded Sharan, 2010. "Associating Genes and Protein Complexes with Disease via Network Propagation," PLOS Computational Biology, Public Library of Science, vol. 6(1), pages 1-9, January.
    4. Victor Ambros, 2004. "The functions of animal microRNAs," Nature, Nature, vol. 431(7006), pages 350-355, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xie, Guobo & Wu, Lifeng & Lin, Zhiyi & Cui, Ji, 2020. "WLDAP: A computational model of weighted lncRNA-disease associations prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 558(C).
    2. Ang Li & Yingwei Deng & Yan Tan & Min Chen, 2021. "A novel miRNA-disease association prediction model using dual random walk with restart and space projection federated method," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-17, June.
    3. Cheng Liang & Shengpeng Yu & Jiawei Luo, 2019. "Adaptive multi-view multi-label learning for identifying disease-associated candidate miRNAs," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-18, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xing Chen & Li Huang, 2017. "LRSSLMDA: Laplacian Regularized Sparse Subspace Learning for MiRNA-Disease Association prediction," PLOS Computational Biology, Public Library of Science, vol. 13(12), pages 1-28, December.
    2. Thuc Duy Le & Junpeng Zhang & Lin Liu & Jiuyong Li, 2015. "Ensemble Methods for MiRNA Target Prediction from Expression Data," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-19, June.
    3. Zhen Shen & You-Hua Zhang & Kyungsook Han & Asoke K. Nandi & Barry Honig & De-Shuang Huang, 2017. "miRNA-Disease Association Prediction with Collaborative Matrix Factorization," Complexity, Hindawi, vol. 2017, pages 1-9, September.
    4. Ke Hu & Ju Xiang & Yun-Xia Yu & Liang Tang & Qin Xiang & Jian-Ming Li & Yong-Hong Tang & Yong-Jun Chen & Yan Zhang, 2020. "Significance-based multi-scale method for network community detection and its application in disease-gene prediction," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-24, March.
    5. T M Murali & Matthew D Dyer & David Badger & Brett M Tyler & Michael G Katze, 2011. "Network-Based Prediction and Analysis of HIV Dependency Factors," PLOS Computational Biology, Public Library of Science, vol. 7(9), pages 1-15, September.
    6. José María Galván-Román & Ángel Lancho-Sánchez & Sergio Luquero-Bueno & Lorena Vega-Piris & Jose Curbelo & Marcos Manzaneque-Pradales & Manuel Gómez & Hortensia de la Fuente & Mara Ortega-Gómez & Javi, 2020. "Usefulness of circulating microRNAs miR-146a and miR-16-5p as prognostic biomarkers in community-acquired pneumonia," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-13, October.
    7. Kshitij Srivastava & Anvesha Srivastava, 2012. "Comprehensive Review of Genetic Association Studies and Meta-Analyses on miRNA Polymorphisms and Cancer Risk," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-1, November.
    8. Deborah Chasman & Brandi Gancarz & Linhui Hao & Michael Ferris & Paul Ahlquist & Mark Craven, 2014. "Inferring Host Gene Subnetworks Involved in Viral Replication," PLOS Computational Biology, Public Library of Science, vol. 10(5), pages 1-22, May.
    9. Yanyan Wang & Yujie Zhang & Chi Pan & Feixia Ma & Suzhan Zhang, 2015. "Prediction of Poor Prognosis in Breast Cancer Patients Based on MicroRNA-21 Expression: A Meta-Analysis," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-13, February.
    10. Li-Chen Hung & Pei-Tseng Kung & Chi-Hsuan Lung & Ming-Hsui Tsai & Shih-An Liu & Li-Ting Chiu & Kuang-Hua Huang & Wen-Chen Tsai, 2020. "Assessment of the Risk of Oral Cancer Incidence in A High-Risk Population and Establishment of A Predictive Model for Oral Cancer Incidence Using A Population-Based Cohort in Taiwan," IJERPH, MDPI, vol. 17(2), pages 1-15, January.
    11. Jun Meng & Lin Shi & Yushi Luan, 2014. "Plant microRNA-Target Interaction Identification Model Based on the Integration of Prediction Tools and Support Vector Machine," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-12, July.
    12. Thierry Chekouo & Francesco C. Stingo & James D. Doecke & Kim-Anh Do, 2015. "miRNA–target gene regulatory networks: A Bayesian integrative approach to biomarker selection with application to kidney cancer," Biometrics, The International Biometric Society, vol. 71(2), pages 428-438, June.
    13. Ang Li & Yingwei Deng & Yan Tan & Min Chen, 2021. "A novel miRNA-disease association prediction model using dual random walk with restart and space projection federated method," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-17, June.
    14. J. Fulneček, 2007. "Isolation and detection of small RNA molecules," Plant, Soil and Environment, Czech Academy of Agricultural Sciences, vol. 53(10), pages 451-455.
    15. Charlotte Glinge & Sebastian Clauss & Kim Boddum & Reza Jabbari & Javad Jabbari & Bjarke Risgaard & Philipp Tomsits & Bianca Hildebrand & Stefan Kääb & Reza Wakili & Thomas Jespersen & Jacob Tfelt-Han, 2017. "Stability of Circulating Blood-Based MicroRNAs – Pre-Analytic Methodological Considerations," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-16, February.
    16. Jianhua Li & Xiaoyan Lin & Yueyang Teng & Shouliang Qi & Dayu Xiao & Jianying Zhang & Yan Kang, 2016. "A Comprehensive Evaluation of Disease Phenotype Networks for Gene Prioritization," PLOS ONE, Public Library of Science, vol. 11(7), pages 1-18, July.
    17. Alexander Link & Verena Becker & Ajay Goel & Thomas Wex & Peter Malfertheiner, 2012. "Feasibility of Fecal MicroRNAs as Novel Biomarkers for Pancreatic Cancer," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-9, August.
    18. Hossain Ahmed & Beyene Joseph, 2013. "Estimation of weighted log partial area under the ROC curve and its application to MicroRNA expression data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 12(6), pages 743-755, December.
    19. Emilie Estrabaud & Kevin Appourchaux & Ivan Bièche & Fabrice Carrat & Martine Lapalus & Olivier Lada & Michelle Martinot-Peignoux & Nathalie Boyer & Patrick Marcellin & Michel Vidaud & Tarik Asselah, 2015. "IFI35, mir-99a and HCV Genotype to Predict Sustained Virological Response to Pegylated-Interferon Plus Ribavirin in Chronic Hepatitis C," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-19, April.
    20. Le Ou-Yang & Dao-Qing Dai & Xiao-Fei Zhang, 2013. "Protein Complex Detection via Weighted Ensemble Clustering Based on Bayesian Nonnegative Matrix Factorization," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-18, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1006418. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.