IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1002101.html
   My bibliography  Save this article

Prediction of Cell Penetrating Peptides by Support Vector Machines

Author

Listed:
  • William S Sanders
  • C Ian Johnston
  • Susan M Bridges
  • Shane C Burgess
  • Kenneth O Willeford

Abstract

Cell penetrating peptides (CPPs) are those peptides that can transverse cell membranes to enter cells. Once inside the cell, different CPPs can localize to different cellular components and perform different roles. Some generate pore-forming complexes resulting in the destruction of cells while others localize to various organelles. Use of machine learning methods to predict potential new CPPs will enable more rapid screening for applications such as drug delivery. We have investigated the influence of the composition of training datasets on the ability to classify peptides as cell penetrating using support vector machines (SVMs). We identified 111 known CPPs and 34 known non-penetrating peptides from the literature and commercial vendors and used several approaches to build training data sets for the classifiers. Features were calculated from the datasets using a set of basic biochemical properties combined with features from the literature determined to be relevant in the prediction of CPPs. Our results using different training datasets confirm the importance of a balanced training set with approximately equal number of positive and negative examples. The SVM based classifiers have greater classification accuracy than previously reported methods for the prediction of CPPs, and because they use primary biochemical properties of the peptides as features, these classifiers provide insight into the properties needed for cell-penetration. To confirm our SVM classifications, a subset of peptides classified as either penetrating or non-penetrating was selected for synthesis and experimental validation. Of the synthesized peptides predicted to be CPPs, 100% of these peptides were shown to be penetrating. Author Summary: Cell penetrating peptides (CPPs) are peptides that can potentially transport other functional molecules across cellular membranes and therefore serve a role as drug delivery vehicles. The properties of a given peptide that make it cell penetrating are unclear, and the rapid screening of potential CPPs aids researchers by allowing focus on those peptides most likely to be utilized in a therapeutic capacity. This paper shows that basic features representing primary biochemical properties of these peptides can be used to train a classifier that can accurately predict cell penetrating potential of peptides and provide insight into the biochemical properties associated with cell penetration.

Suggested Citation

  • William S Sanders & C Ian Johnston & Susan M Bridges & Shane C Burgess & Kenneth O Willeford, 2011. "Prediction of Cell Penetrating Peptides by Support Vector Machines," PLOS Computational Biology, Public Library of Science, vol. 7(7), pages 1-12, July.
  • Handle: RePEc:plo:pcbi00:1002101
    DOI: 10.1371/journal.pcbi.1002101
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002101
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1002101&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1002101?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Garci'a Lopez, Felix & Garci'a Torres, Miguel & Melian Batista, Belen & Moreno Perez, Jose A. & Moreno-Vega, J. Marcos, 2006. "Solving feature subset selection problem by a Parallel Scatter Search," European Journal of Operational Research, Elsevier, vol. 169(2), pages 477-489, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Georgia Melagraki & Evangelos Ntougkos & Vagelis Rinotas & Christos Papaneophytou & Georgios Leonis & Thomas Mavromoustakos & George Kontopidis & Eleni Douni & Antreas Afantitis & George Kollias, 2017. "Cheminformatics-aided discovery of small-molecule Protein-Protein Interaction (PPI) dual inhibitors of Tumor Necrosis Factor (TNF) and Receptor Activator of NF-κB Ligand (RANKL)," PLOS Computational Biology, Public Library of Science, vol. 13(4), pages 1-27, April.
    2. Giuseppe Maccari & Mariagrazia Di Luca & Riccardo Nifosí & Francesco Cardarelli & Giovanni Signore & Claudia Boccardi & Angelo Bifone, 2013. "Antimicrobial Peptides Design by Evolutionary Multiobjective Optimization," PLOS Computational Biology, Public Library of Science, vol. 9(9), pages 1-12, September.
    3. Sofie Stalmans & Evelien Wynendaele & Nathalie Bracke & Bert Gevaert & Matthias D’Hondt & Kathelijne Peremans & Christian Burvenich & Bart De Spiegeleer, 2013. "Chemical-Functional Diversity in Cell-Penetrating Peptides," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-1, August.
    4. Alex D Herbert & Antony M Carr & Eva Hoffmann, 2014. "FindFoci: A Focus Detection Algorithm with Automated Parameter Training That Closely Matches Human Assignments, Reduces Human Inconsistencies and Increases Speed of Analysis," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-33, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Muhammad Arif & Ahmed Kattan, 2015. "Physical Activities Monitoring Using Wearable Acceleration Sensors Attached to the Body," PLOS ONE, Public Library of Science, vol. 10(7), pages 1-16, July.
    2. Panagopoulos, Orestis P. & Pappu, Vijay & Xanthopoulos, Petros & Pardalos, Panos M., 2016. "Constrained subspace classifier for high dimensional datasets," Omega, Elsevier, vol. 59(PA), pages 40-46.
    3. Kerkhove, L.-P. & Vanhoucke, M., 2017. "A parallel multi-objective scatter search for optimising incentive contract design in projects," European Journal of Operational Research, Elsevier, vol. 261(3), pages 1066-1084.
    4. Pablo Venegas & Francisco Calderon & Daniel Riofrío & Diego Benítez & Giovani Ramón & Diego Cisneros-Heredia & Miguel Coimbra & José Luis Rojo-Álvarez & Noel Pérez, 2021. "Automatic ladybird beetle detection using deep-learning models," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-21, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1002101. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.