IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1008681.html
   My bibliography  Save this article

KinPred: A unified and sustainable approach for harnessing proteome-level human kinase-substrate predictions

Author

Listed:
  • Bingjie Xue
  • Benjamin Jordan
  • Saqib Rizvi
  • Kristen M Naegle

Abstract

Tyrosine and serine/threonine kinases are essential regulators of cell processes and are important targets for human therapies. Unfortunately, very little is known about specific kinase-substrate relationships, making it difficult to infer meaning from dysregulated phosphoproteomic datasets or for researchers to identify possible kinases that regulate specific or novel phosphorylation sites. The last two decades have seen an explosion in algorithms to extrapolate from what little is known into the larger unknown—predicting kinase relationships with site-specific substrates using a variety of approaches that include the sequence-specificity of kinase catalytic domains and various other factors, such as evolutionary relationships, co-expression, and protein-protein interaction networks. Unfortunately, a number of limitations prevent researchers from easily harnessing these resources, such as loss of resource accessibility, limited information in publishing that results in a poor mapping to a human reference, and not being updated to match the growth of the human phosphoproteome. Here, we propose a methodological framework for publishing predictions in a unified way, which entails ensuring predictions have been run on a current reference proteome, mapping the same substrates and kinases across resources to a common reference, filtering for the human phosphoproteome, and providing methods for updating the resource easily in the future. We applied this framework on three currently available resources, published in the last decade, which provide kinase-specific predictions in the human proteome. Using the unified datasets, we then explore the role of study bias, the emergent network properties of these predictive algorithms, and comparisons within and between predictive algorithms. The combination of the code for unification and analysis, as well as the unified predictions are available under the resource we named KinPred. We believe this resource will be useful for a wide range of applications and establishes best practices for long-term usability and sustainability for new and existing predictive algorithms.Author summary: Kinases are the enzymes responsible for controlling a large number of cell biological processes. They can regulate protein interactions, protein function, protein localization, and protein turnover by adding negatively charged phosphate groups to three amino acid types that make up proteins. Despite a large growth in knowledge about what sites on what proteins kinases can target, we still do not fully understand the exact relationship between every one of the hundreds of human protein kinases and the hundreds of thousands of possible amino acid targets within proteins. Instead, we can use algorithms to predict relationships and this has been an intense area of research and many such algorithms, based on different underlying features and mathematical approaches, have been published. Here, we create a method for making it easy to harness and compare multiple algorithms and keep these updated as new phosphorylation sites are observed. The outcome of this work is a computational pipeline, a current resource of predictions for three prediction algorithms for all know phosphorylation sites in the human proteome, and the results of comparing how different algorithms compare to each other and their inherent properties.

Suggested Citation

  • Bingjie Xue & Benjamin Jordan & Saqib Rizvi & Kristen M Naegle, 2021. "KinPred: A unified and sustainable approach for harnessing proteome-level human kinase-substrate predictions," PLOS Computational Biology, Public Library of Science, vol. 17(2), pages 1-20, February.
  • Handle: RePEc:plo:pcbi00:1008681
    DOI: 10.1371/journal.pcbi.1008681
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008681
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008681&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1008681?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sam Crowl & Ben T. Jordan & Hamza Ahmed & Cynthia X. Ma & Kristen M. Naegle, 2022. "KSTAR: An algorithm to predict patient-specific kinase activities from phosphoproteomic data," Nature Communications, Nature, vol. 13(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008681. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.