IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1000213.html
   My bibliography  Save this article

Transmembrane Topology and Signal Peptide Prediction Using Dynamic Bayesian Networks

Author

Listed:
  • Sheila M Reynolds
  • Lukas Käll
  • Michael E Riffle
  • Jeff A Bilmes
  • William Stafford Noble

Abstract

Hidden Markov models (HMMs) have been successfully applied to the tasks of transmembrane protein topology prediction and signal peptide prediction. In this paper we expand upon this work by making use of the more powerful class of dynamic Bayesian networks (DBNs). Our model, Philius, is inspired by a previously published HMM, Phobius, and combines a signal peptide submodel with a transmembrane submodel. We introduce a two-stage DBN decoder that combines the power of posterior decoding with the grammar constraints of Viterbi-style decoding. Philius also provides protein type, segment, and topology confidence metrics to aid in the interpretation of the predictions. We report a relative improvement of 13% over Phobius in full-topology prediction accuracy on transmembrane proteins, and a sensitivity and specificity of 0.96 in detecting signal peptides. We also show that our confidence metrics correlate well with the observed precision. In addition, we have made predictions on all 6.3 million proteins in the Yeast Resource Center (YRC) database. This large-scale study provides an overall picture of the relative numbers of proteins that include a signal-peptide and/or one or more transmembrane segments as well as a valuable resource for the scientific community. All DBNs are implemented using the Graphical Models Toolkit. Source code for the models described here is available at http://noble.gs.washington.edu/proj/philius. A Philius Web server is available at http://www.yeastrc.org/philius, and the predictions on the YRC database are available at http://www.yeastrc.org/pdr.Author Summary: Transmembrane proteins control the flow of information and substances into and out of the cell and are involved in a broad range of biological processes. Their interfacing role makes them rewarding drug targets, and it is estimated that more than 50% of recently launched drugs target membrane proteins. However, experimentally determining the three-dimensional structure of a transmembrane protein is still a difficult task, and few of the currently known tertiary structures are of transmembrane proteins despite the fact that as many as one quarter of the proteins in a given organism are transmembrane proteins. Computational methods for predicting the basic topology of a transmembrane protein are therefore of great interest, and these methods must be able to distinguish between mature, membrane-spanning proteins and proteins that, when first synthesized, contain an N-terminal membrane-spanning signal peptide. In this work, we present Philius, a new computational approach that outperforms previous methods in simultaneously detecting signal peptides and correctly predicting the topology of transmembrane proteins. Philius also supplies a set of confidence scores with each prediction. A Philius Web server is available to the public as well as precomputed predictions for over six million proteins in the Yeast Resource Center database.

Suggested Citation

  • Sheila M Reynolds & Lukas Käll & Michael E Riffle & Jeff A Bilmes & William Stafford Noble, 2008. "Transmembrane Topology and Signal Peptide Prediction Using Dynamic Bayesian Networks," PLOS Computational Biology, Public Library of Science, vol. 4(11), pages 1-14, November.
  • Handle: RePEc:plo:pcbi00:1000213
    DOI: 10.1371/journal.pcbi.1000213
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000213
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000213&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1000213?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bi-Qing Li & Le-Le Hu & Lei Chen & Kai-Yan Feng & Yu-Dong Cai & Kuo-Chen Chou, 2012. "Prediction of Protein Domain with mRMR Feature Selection and Analysis," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-14, June.
    2. Yuanwei Gou & Dongfang Li & Minghui Zhao & Mengxin Li & Jiaojiao Zhang & Yilian Zhou & Feng Xiao & Gaofei Liu & Haote Ding & Chenfan Sun & Cuifang Ye & Chang Dong & Jucan Gao & Di Gao & Zehua Bao & Le, 2024. "Intein-mediated temperature control for complete biosynthesis of sanguinarine and its halogenated derivatives in yeast," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    3. Lei Wang & Jiangguo Zhang & Dali Wang & Chen Song, 2022. "Membrane contact probability: An essential and predictive character for the structural and functional studies of membrane proteins," PLOS Computational Biology, Public Library of Science, vol. 18(3), pages 1-27, March.
    4. Ying Hong Li & Jing Yu Xu & Lin Tao & Xiao Feng Li & Shuang Li & Xian Zeng & Shang Ying Chen & Peng Zhang & Chu Qin & Cheng Zhang & Zhe Chen & Feng Zhu & Yu Zong Chen, 2016. "SVM-Prot 2016: A Web-Server for Machine Learning Prediction of Protein Functional Families from Sequence Irrespective of Similarity," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-14, August.
    5. Lucas Serra Moncadas & Cyrill Hofer & Paul-Adrian Bulzu & Jakob Pernthaler & Adrian-Stefan Andrei, 2024. "Freshwater genome-reduced bacteria exhibit pervasive episodes of adaptive stasis," Nature Communications, Nature, vol. 15(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1000213. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.