IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1007129.html
   My bibliography  Save this article

DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences

Author

Listed:
  • Ingoo Lee
  • Jongsoo Keum
  • Hojung Nam

Abstract

Identification of drug-target interactions (DTIs) plays a key role in drug discovery. The high cost and labor-intensive nature of in vitro and in vivo experiments have highlighted the importance of in silico-based DTI prediction approaches. In several computational models, conventional protein descriptors have been shown to not be sufficiently informative to predict accurate DTIs. Thus, in this study, we propose a deep learning based DTI prediction model capturing local residue patterns of proteins participating in DTIs. When we employ a convolutional neural network (CNN) on raw protein sequences, we perform convolution on various lengths of amino acids subsequences to capture local residue patterns of generalized protein classes. We train our model with large-scale DTI information and demonstrate the performance of the proposed model using an independent dataset that is not seen during the training phase. As a result, our model performs better than previous protein descriptor-based models. Also, our model performs better than the recently developed deep learning models for massive prediction of DTIs. By examining pooled convolution results, we confirmed that our model can detect binding sites of proteins for DTIs. In conclusion, our prediction model for detecting local residue patterns of target proteins successfully enriches the protein features of a raw protein sequence, yielding better prediction results than previous approaches. Our code is available at https://github.com/GIST-CSBL/DeepConv-DTI.Author summary: Drugs work by interacting with target proteins to activate or inhibit a target’s biological process. Therefore, identification of DTIs is a crucial step in drug discovery. However, identifying drug candidates via biological assays is very time and cost consuming, which introduces the need for a computational prediction approach for the identification of DTIs. In this work, we constructed a novel DTI prediction model to extract local residue patterns of target protein sequences using a CNN-based deep learning approach. As a result, the detected local features of protein sequences perform better than other protein descriptors for DTI prediction and previous models for predicting PubChem independent test datasets. That is, our approach of capturing local residue patterns with CNN successfully enriches protein features from a raw sequence.

Suggested Citation

  • Ingoo Lee & Jongsoo Keum & Hojung Nam, 2019. "DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences," PLOS Computational Biology, Public Library of Science, vol. 15(6), pages 1-21, June.
  • Handle: RePEc:plo:pcbi00:1007129
    DOI: 10.1371/journal.pcbi.1007129
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007129
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1007129&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1007129?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zhisong He & Jian Zhang & Xiao-He Shi & Le-Le Hu & Xiangyin Kong & Yu-Dong Cai & Kuo-Chen Chou, 2010. "Predicting Drug-Target Interaction Networks Based on Functional Groups and Biological Features," PLOS ONE, Public Library of Science, vol. 5(3), pages 1-8, March.
    2. Zhu-Hong You & Keith C C Chan & Pengwei Hu, 2015. "Predicting Protein-Protein Interactions from Primary Protein Sequences Using a Novel Multi-Scale Local Feature Representation Scheme and the Random Forest," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-19, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yuxuan Wang & Ying Xia & Junchi Yan & Ye Yuan & Hong-Bin Shen & Xiaoyong Pan, 2023. "ZeroBind: a protein-specific zero-shot predictor with subgraph matching for drug-target interactions," Nature Communications, Nature, vol. 14(1), pages 1-14, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jianjun He & Hong Gu & Wenqi Liu, 2012. "Imbalanced Multi-Modal Multi-Label Learning for Subcellular Localization Prediction of Human Proteins with Both Single and Multiple Sites," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-10, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1007129. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.