IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1000397.html
   My bibliography  Save this article

Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening

Author

Listed:
  • Nobuyoshi Nagamine
  • Takayuki Shirakawa
  • Yusuke Minato
  • Kentaro Torii
  • Hiroki Kobayashi
  • Masaya Imoto
  • Yasubumi Sakakibara

Abstract

Predictions of interactions between target proteins and potential leads are of great benefit in the drug discovery process. We present a comprehensively applicable statistical prediction method for interactions between any proteins and chemical compounds, which requires only protein sequence data and chemical structure data and utilizes the statistical learning method of support vector machines. In order to realize reasonable comprehensive predictions which can involve many false positives, we propose two approaches for reduction of false positives: (i) efficient use of multiple statistical prediction models in the framework of two-layer SVM and (ii) reasonable design of the negative data to construct statistical prediction models. In two-layer SVM, outputs produced by the first-layer SVM models, which are constructed with different negative samples and reflect different aspects of classifications, are utilized as inputs to the second-layer SVM. In order to design negative data which produce fewer false positive predictions, we iteratively construct SVM models or classification boundaries from positive and tentative negative samples and select additional negative sample candidates according to pre-determined rules. Moreover, in order to fully utilize the advantages of statistical learning methods, we propose a strategy to effectively feedback experimental results to computational predictions with consideration of biological effects of interest. We show the usefulness of our approach in predicting potential ligands binding to human androgen receptors from more than 19 million chemical compounds and verifying these predictions by in vitro binding. Moreover, we utilize this experimental validation as feedback to enhance subsequent computational predictions, and experimentally validate these predictions again. This efficient procedure of the iteration of the in silico prediction and in vitro or in vivo experimental verifications with the sufficient feedback enabled us to identify novel ligand candidates which were distant from known ligands in the chemical space.Author Summary: This work describes a statistical method that identifies chemical compounds binding to a target protein given the sequence of the target or distinguishes proteins to which a small molecule binds given the chemical structure of the molecule. As our method can be utilized for virtual screening that seeks for lead compounds in drug discovery, we showed the usefulness of our method in its application to the comprehensive prediction of ligands binding to human androgen receptors and in vitro experimental verification of its predictions. In contrast to most previous virtual screening studies which predict chemical compounds of interest mainly with 3D structure-based methods and experimentally verify them, we proposed a strategy to effectively feedback experimental results for subsequent predictions and applied the strategy to the second predictions followed by the second experimental verification. This feedback strategy makes full use of statistical learning methods and, in practical terms, gave a ligand candidate of interest that structurally differs from known drugs. We hope that this paper will encourage reevaluation of statistical learning methods in virtual screening and that the utilization of statistical methods with efficient feedback strategies will contribute to the acceleration of drug discovery.

Suggested Citation

  • Nobuyoshi Nagamine & Takayuki Shirakawa & Yusuke Minato & Kentaro Torii & Hiroki Kobayashi & Masaya Imoto & Yasubumi Sakakibara, 2009. "Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening," PLOS Computational Biology, Public Library of Science, vol. 5(6), pages 1-11, June.
  • Handle: RePEc:plo:pcbi00:1000397
    DOI: 10.1371/journal.pcbi.1000397
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000397
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000397&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1000397?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Søren G. F. Rasmussen & Hee-Jung Choi & Daniel M. Rosenbaum & Tong Sun Kobilka & Foon Sun Thian & Patricia C. Edwards & Manfred Burghammer & Venkata R. P. Ratnala & Ruslan Sanishvili & Robert F. Fisch, 2007. "Crystal structure of the human β2 adrenergic G-protein-coupled receptor," Nature, Nature, vol. 450(7168), pages 383-387, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhisong He & Jian Zhang & Xiao-He Shi & Le-Le Hu & Xiangyin Kong & Yu-Dong Cai & Kuo-Chen Chou, 2010. "Predicting Drug-Target Interaction Networks Based on Functional Groups and Biological Features," PLOS ONE, Public Library of Science, vol. 5(3), pages 1-8, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Valérie Capra & Marta Busnelli & Alessandro Perenna & Manuela Ambrosio & Maria Rosa Accomazzo & Celine Galés & Bice Chini & G Enrico Rovati, 2013. "Full and Partial Agonists of Thromboxane Prostanoid Receptor Unveil Fine Tuning of Receptor Superactive Conformation and G Protein Activation," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-12, March.
    2. Valérie Boivin-Jahns & Kerstin Uhland & Hans-Peter Holthoff & Niklas Beyersdorf & Vladimir Kocoski & Thomas Kerkau & Götz Münch & Martin J Lohse & Martin Ungerer & Roland Jahns, 2018. "Cyclopeptide COR-1 to treat beta1-adrenergic receptor antibody-induced heart failure," PLOS ONE, Public Library of Science, vol. 13(8), pages 1-23, August.
    3. Holly J Atkinson & John H Morris & Thomas E Ferrin & Patricia C Babbitt, 2009. "Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies," PLOS ONE, Public Library of Science, vol. 4(2), pages 1-14, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1000397. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.