IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0217027.html
   My bibliography  Save this article

Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data

Author

Listed:
  • Md Ashad Alam
  • Mohammd Shahjaman
  • Md Ferdush Rahman
  • Fokhrul Hossain
  • Hong-Wen Deng

Abstract

Background: Gene shaving (GS) is an essential and challenging tools for biomedical researchers due to the large number of genes in human genome and the complex nature of biological networks. Most GS methods are not applicable to non-linear and multi-view data sets. While the kernel based methods can overcome these problems, a well-founded positive definite kernel based GS method has yet to be proposed for biomedical data analysis. Methods and findings: Since the kernel based methods on genomic information can improve the prediction of diseases, here we proposed a noble method, “kernel based gene shaving” which is based on the influence function of kernel canonical correlation analysis. To investigate the performance of the proposed method in comparison to state-of-the-art-method in gene saving, we analyzed extensive simulated and real microarray gene expression data set. The performance metrics including true positive rate, true negative rate, false positive rate, false negative rate, misclassification error rate, the false discovery rate and area under curves were computed for each methods. In colon cancer data analysis, the proposed method identified a significant subsets of 210 genes out of 2000 genes and suggestive superior performance compared with other methods. The proposed method can be applied to the study of other disease process where two view data is a common task. Conclusions: We addressed the challenge of finding unique kernel based GS methods by using the influence function of kernel canonical correlation analysis. The proposed method has shown to have better performance than state-of-the-art-methods in gene saving and has identified many more significant gene interactions, suggesting that genes function in a concerted effort in colon cancer. In similar biomedical data analysis, kernel based methods could be applied to select a potential subset of genes. The positive definite kernel based methods can overcome the non-linearity problem and improve the prediction process.

Suggested Citation

  • Md Ashad Alam & Mohammd Shahjaman & Md Ferdush Rahman & Fokhrul Hossain & Hong-Wen Deng, 2019. "Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-17, May.
  • Handle: RePEc:plo:pone00:0217027
    DOI: 10.1371/journal.pone.0217027
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0217027
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0217027&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0217027?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lingyan Ruan & Ming Yuan, 2011. "An Empirical Bayes' Approach to Joint Analysis of Multiple Microarray Gene Expression Studies," Biometrics, The International Biometric Society, vol. 67(4), pages 1617-1626, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mohammad Manir Hossain Mollah & Rahman Jamal & Norfilza Mohd Mokhtar & Roslan Harun & Md Nurul Haque Mollah, 2015. "A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-26, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0217027. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.