IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0155290.html
   My bibliography  Save this article

SVM-Prot 2016: A Web-Server for Machine Learning Prediction of Protein Functional Families from Sequence Irrespective of Similarity

Author

Listed:
  • Ying Hong Li
  • Jing Yu Xu
  • Lin Tao
  • Xiao Feng Li
  • Shuang Li
  • Xian Zeng
  • Shang Ying Chen
  • Peng Zhang
  • Chu Qin
  • Cheng Zhang
  • Zhe Chen
  • Feng Zhu
  • Yu Zong Chen

Abstract

Knowledge of protein function is important for biological, medical and therapeutic studies, but many proteins are still unknown in function. There is a need for more improved functional prediction methods. Our SVM-Prot web-server employed a machine learning method for predicting protein functional families from protein sequences irrespective of similarity, which complemented those similarity-based and other methods in predicting diverse classes of proteins including the distantly-related proteins and homologous proteins of different functions. Since its publication in 2003, we made major improvements to SVM-Prot with (1) expanded coverage from 54 to 192 functional families, (2) more diverse protein descriptors protein representation, (3) improved predictive performances due to the use of more enriched training datasets and more variety of protein descriptors, (4) newly integrated BLAST analysis option for assessing proteins in the SVM-Prot predicted functional families that were similar in sequence to a query protein, and (5) newly added batch submission option for supporting the classification of multiple proteins. Moreover, 2 more machine learning approaches, K nearest neighbor and probabilistic neural networks, were added for facilitating collective assessment of protein functions by multiple methods. SVM-Prot can be accessed at http://bidd2.nus.edu.sg/cgi-bin/svmprot/svmprot.cgi.

Suggested Citation

  • Ying Hong Li & Jing Yu Xu & Lin Tao & Xiao Feng Li & Shuang Li & Xian Zeng & Shang Ying Chen & Peng Zhang & Chu Qin & Cheng Zhang & Zhe Chen & Feng Zhu & Yu Zong Chen, 2016. "SVM-Prot 2016: A Web-Server for Machine Learning Prediction of Protein Functional Families from Sequence Irrespective of Similarity," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-14, August.
  • Handle: RePEc:plo:pone00:0155290
    DOI: 10.1371/journal.pone.0155290
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0155290
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0155290&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0155290?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sheila M Reynolds & Lukas Käll & Michael E Riffle & Jeff A Bilmes & William Stafford Noble, 2008. "Transmembrane Topology and Signal Peptide Prediction Using Dynamic Bayesian Networks," PLOS Computational Biology, Public Library of Science, vol. 4(11), pages 1-14, November.
    2. Chen Lin & Ying Zou & Ji Qin & Xiangrong Liu & Yi Jiang & Caihuan Ke & Quan Zou, 2013. "Hierarchical Classification of Protein Folds Using a Novel Ensemble Classifier," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-11, February.
    3. Stephen P. Jackson & Jiri Bartek, 2009. "The DNA-damage response in human biology and disease," Nature, Nature, vol. 461(7267), pages 1071-1078, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Solomon Shiferaw Beyene & Tianyi Ling & Blagoj Ristevski & Ming Chen, 2020. "A novel riboswitch classification based on imbalanced sequences achieved by machine learning," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-23, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zita Gál & Stavroula Boukoura & Kezia Catharina Oxe & Sara Badawi & Blanca Nieto & Lea Milling Korsholm & Sille Blangstrup Geisler & Ekaterina Dulina & Anna Vestergaard Rasmussen & Christina Dahl & We, 2024. "Hyper-recombination in ribosomal DNA is driven by long-range resection-independent RAD51 accumulation," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    2. Ilaria Rosso & Corey Jones-Weinert & Francesca Rossiello & Matteo Cabrini & Silvia Brambillasca & Leonel Munoz-Sagredo & Zeno Lavagnino & Emanuele Martini & Enzo Tedone & Massimiliano Garre’ & Julio A, 2023. "Alternative lengthening of telomeres (ALT) cells viability is dependent on C-rich telomeric RNAs," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    3. Bi-Qing Li & Le-Le Hu & Lei Chen & Kai-Yan Feng & Yu-Dong Cai & Kuo-Chen Chou, 2012. "Prediction of Protein Domain with mRMR Feature Selection and Analysis," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-14, June.
    4. Halh Al-Serori & Franziska Ferk & Michael Kundi & Andrea Bileck & Christopher Gerner & Miroslav Mišík & Armen Nersesyan & Monika Waldherr & Manuel Murbach & Tamara T Lah & Christel Herold-Mende & Andr, 2018. "Mobile phone specific electromagnetic fields induce transient DNA damage and nucleotide excision repair in serum-deprived human glioblastoma cells," PLOS ONE, Public Library of Science, vol. 13(4), pages 1-17, April.
    5. Yuxin Che & Ying Ju & Ping Xuan & Ren Long & Fei Xing, 2016. "Identification of Multi-Functional Enzyme with Multi-Label Classifier," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-13, April.
    6. Daipayan Banerjee & Kurt Langberg & Salar Abbas & Eric Odermatt & Praveen Yerramothu & Martin Volaric & Matthew A. Reidenbach & Kathy J. Krentz & C. Dustin Rubinstein & David L. Brautigan & Tarek Abba, 2021. "A non-canonical, interferon-independent signaling activity of cGAMP triggers DNA damage response signaling," Nature Communications, Nature, vol. 12(1), pages 1-24, December.
    7. Jenny Kaur Singh & Rebecca Smith & Magdalena B. Rother & Anton J. L. Groot & Wouter W. Wiegant & Kees Vreeken & Ostiane D’Augustin & Robbert Q. Kim & Haibin Qian & Przemek M. Krawczyk & Román González, 2021. "Zinc finger protein ZNF384 is an adaptor of Ku to DNA during classical non-homologous end-joining," Nature Communications, Nature, vol. 12(1), pages 1-21, December.
    8. Miho M. Suzuki & Kenta Iijima & Koichi Ogami & Keiko Shinjo & Yoshiteru Murofushi & Jingqi Xie & Xuebing Wang & Yotaro Kitano & Akira Mamiya & Yuji Kibe & Tatsunori Nishimura & Fumiharu Ohka & Ryuta S, 2023. "TUG1-mediated R-loop resolution at microsatellite loci as a prerequisite for cancer cell proliferation," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    9. Ye Cai & Huifen Cao & Fang Wang & Yufei Zhang & Philipp Kapranov, 2022. "Complex genomic patterns of abasic sites in mammalian DNA revealed by a high-resolution SSiNGLe-AP method," Nature Communications, Nature, vol. 13(1), pages 1-21, December.
    10. Sidrah Shah & Alison Cheung & Mikolaj Kutka & Matin Sheriff & Stergios Boussios, 2022. "Epithelial Ovarian Cancer: Providing Evidence of Predisposition Genes," IJERPH, MDPI, vol. 19(13), pages 1-14, July.
    11. Jérémy Sandoz & Max Cigrang & Amélie Zachayus & Philippe Catez & Lise-Marie Donnio & Clèmence Elly & Jadwiga Nieminuszczy & Pietro Berico & Cathy Braun & Sergey Alekseev & Jean-Marc Egly & Wojciech Ni, 2023. "Active mRNA degradation by EXD2 nuclease elicits recovery of transcription after genotoxic stress," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    12. Jessica D. Tischler & Hiroshi Tsuchida & Rosevalentine Bosire & Tommy T. Oda & Ana Park & Richard O. Adeyemi, 2024. "FLIP(C1orf112)-FIGNL1 complex regulates RAD51 chromatin association to promote viability after replication stress," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    13. Lucas Serra Moncadas & Cyrill Hofer & Paul-Adrian Bulzu & Jakob Pernthaler & Adrian-Stefan Andrei, 2024. "Freshwater genome-reduced bacteria exhibit pervasive episodes of adaptive stasis," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    14. Lin-Lin Zhou & Tao Zhang & Yun Xue & Chuan Yue & Yihui Pan & Pengyu Wang & Teng Yang & Meixia Li & Hu Zhou & Kan Ding & Jianhua Gan & Hongbin Ji & Cai-Guang Yang, 2023. "Selective activator of human ClpP triggers cell cycle arrest to inhibit lung squamous cell carcinoma," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    15. Daniel Gómez-Cabello & George Pappas & Diana Aguilar-Morante & Christoffel Dinant & Jiri Bartek, 2022. "CtIP-dependent nascent RNA expression flanking DNA breaks guides the choice of DNA repair pathway," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    16. Pedro Weickert & Hao-Yi Li & Maximilian J. Götz & Sophie Dürauer & Denitsa Yaneva & Shubo Zhao & Jacqueline Cordes & Aleida C. Acampora & Ignasi Forne & Axel Imhof & Julian Stingele, 2023. "SPRTN patient variants cause global-genome DNA-protein crosslink repair defects," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    17. Pradeep Ramalingam & Michael C. Gutkin & Michael G. Poulos & Taylor Tillery & Chelsea Doughty & Agatha Winiarski & Ana G. Freire & Shahin Rafii & David Redmond & Jason M. Butler, 2023. "Restoring bone marrow niche function rejuvenates aged hematopoietic stem cells by reactivating the DNA Damage Response," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    18. Ross J. Hill & Nazareno Bona & Job Smink & Hannah K. Webb & Alastair Crisp & Juan I. Garaycoechea & Gerry P. Crossan, 2024. "p53 regulates diverse tissue-specific outcomes to endogenous DNA damage in mice," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    19. Andrea M. Kaminski & Kishore K. Chiruvella & Dale A. Ramsden & Katarzyna Bebenek & Thomas A. Kunkel & Lars C. Pedersen, 2022. "Analysis of diverse double-strand break synapsis with Polλ reveals basis for unique substrate specificity in nonhomologous end-joining," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    20. Muhammad Naveed Iqbal Qureshi & Beomjun Min & Hang Joon Jo & Boreom Lee, 2016. "Multiclass Classification for the Differential Diagnosis on the ADHD Subtypes Using Recursive Feature Elimination and Hierarchical Extreme Learning Machine: Structural MRI Study," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-20, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0155290. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.