IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0122294.html
   My bibliography  Save this article

A Grammar Inference Approach for Predicting Kinase Specific Phosphorylation Sites

Author

Listed:
  • Sutapa Datta
  • Subhasis Mukhopadhyay

Abstract

Kinase mediated phosphorylation site detection is the key mechanism of post translational mechanism that plays an important role in regulating various cellular processes and phenotypes. Many diseases, like cancer are related with the signaling defects which are associated with protein phosphorylation. Characterizing the protein kinases and their substrates enhances our ability to understand the mechanism of protein phosphorylation and extends our knowledge of signaling network; thereby helping us to treat such diseases. Experimental methods for predicting phosphorylation sites are labour intensive and expensive. Also, manifold increase of protein sequences in the databanks over the years necessitates the improvement of high speed and accurate computational methods for predicting phosphorylation sites in protein sequences. Till date, a number of computational methods have been proposed by various researchers in predicting phosphorylation sites, but there remains much scope of improvement. In this communication, we present a simple and novel method based on Grammatical Inference (GI) approach to automate the prediction of kinase specific phosphorylation sites. In this regard, we have used a popular GI algorithm Alergia to infer Deterministic Stochastic Finite State Automata (DSFA) which equally represents the regular grammar corresponding to the phosphorylation sites. Extensive experiments on several datasets generated by us reveal that, our inferred grammar successfully predicts phosphorylation sites in a kinase specific manner. It performs significantly better when compared with the other existing phosphorylation site prediction methods. We have also compared our inferred DSFA with two other GI inference algorithms. The DSFA generated by our method performs superior which indicates that our method is robust and has a potential for predicting the phosphorylation sites in a kinase specific manner.

Suggested Citation

  • Sutapa Datta & Subhasis Mukhopadhyay, 2015. "A Grammar Inference Approach for Predicting Kinase Specific Phosphorylation Sites," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-19, April.
  • Handle: RePEc:plo:pone00:0122294
    DOI: 10.1371/journal.pone.0122294
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0122294
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0122294&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0122294?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Christopher Loose & Kyle Jensen & Isidore Rigoutsos & Gregory Stephanopoulos, 2006. "A linguistic model for the rational design of antimicrobial peptides," Nature, Nature, vol. 443(7113), pages 867-869, October.
    2. Sutapa Datta & Subhasis Mukhopadhyay, 2013. "A Composite Method Based on Formal Grammar and DNA Structural Features in Detecting Human Polymerase II Promoter Region," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-11, February.
    3. David B. Searls, 2002. "The language of genes," Nature, Nature, vol. 420(6912), pages 211-217, November.
    4. Yi-Cheng Chen & Kripamoy Aguan & Chu-Wen Yang & Yao-Tsung Wang & Nikhil R Pal & I-Fang Chung, 2011. "Discovery of Protein Phosphorylation Motifs through Exploratory Data Analysis," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-15, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sutapa Datta & Subhasis Mukhopadhyay, 2013. "A Composite Method Based on Formal Grammar and DNA Structural Features in Detecting Human Polymerase II Promoter Region," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-11, February.
    2. Moghaddasi, Hanieh & Rezaei, Soghra & Darooneh, Amir Hossein & Heshmati, Emran & Khalifeh, Khosrow, 2020. "A comparative analysis of dipeptides distribution in eukaryotes and prokaryotes by statistical mechanics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 555(C).
    3. William F Porto & Állan S Pires & Octavio L Franco, 2012. "CS-AMPPred: An Updated SVM Model for Antimicrobial Activity Prediction in Cysteine-Stabilized Peptides," PLOS ONE, Public Library of Science, vol. 7(12), pages 1-7, December.
    4. Deepesh Nagarajan & Tushar Nagarajan & Neha Nanajkar & Nagasuma Chandra, 2019. "A Uniform In Vitro Efficacy Dataset to Guide Antimicrobial Peptide Design," Data, MDPI, vol. 4(1), pages 1-13, February.
    5. Ben Jacob, Eshel & Shapira, Yoash & Tauber, Alfred I., 2006. "Seeking the foundations of cognition in bacteria: From Schrödinger's negative entropy to latent information," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 359(C), pages 495-524.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0122294. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.