IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/0030140.html
   My bibliography  Save this article

Natively Unstructured Loops Differ from Other Loops

Author

Listed:
  • Avner Schlessinger
  • Jinfeng Liu
  • Burkhard Rost

Abstract

Natively unstructured or disordered protein regions may increase the functional complexity of an organism; they are particularly abundant in eukaryotes and often evade structure determination. Many computational methods predict unstructured regions by training on outliers in otherwise well-ordered structures. Here, we introduce an approach that uses a neural network in a very different and novel way. We hypothesize that very long contiguous segments with nonregular secondary structure (NORS regions) differ significantly from regular, well-structured loops, and that a method detecting such features could predict natively unstructured regions. Training our new method, NORSnet, on predicted information rather than on experimental data yielded three major advantages: it removed the overlap between testing and training, it systematically covered entire proteomes, and it explicitly focused on one particular aspect of unstructured regions with a simple structural interpretation, namely that they are loops. Our hypothesis was correct: well-structured and unstructured loops differ so substantially that NORSnet succeeded in their distinction. Benchmarks on previously used and new experimental data of unstructured regions revealed that NORSnet performed very well. Although it was not the best single prediction method, NORSnet was sufficiently accurate to flag unstructured regions in proteins that were previously not annotated. In one application, NORSnet revealed previously undetected unstructured regions in putative targets for structural genomics and may thereby contribute to increasing structural coverage of large eukaryotic families. NORSnet found unstructured regions more often in domain boundaries than expected at random. In another application, we estimated that 50%–70% of all worm proteins observed to have more than seven protein–protein interaction partners have unstructured regions. The comparative analysis between NORSnet and DISOPRED2 suggested that long unstructured loops are a major part of unstructured regions in molecular networks.: The details of protein structures are important for function. Regions that do not adopt any regular structure in isolation (natively unstructured or disordered regions) initially appeared as a curious exception to this structure–function paradigm. It has become increasingly clear that unstructured regions are fundamental to many roles and that they are particularly important for multicellular organisms. Structural biology is just beginning to apprehend the stunning diversity of these roles. Here, we focused on unstructured regions dominated by a particular type of loop, namely the natively unstructured one. We developed a method that succeeded in the distinction between well-structured and natively unstructured loops. For the development, we did not use any experimental data for unstructured regions; when tested on experimental data, the method performed surprisingly well. Due to its different premises, the method captured very different aspects of unstructured regions than other methods that we tested. We applied the new method to two different problems. The first was the identification of proteins that may be difficult targets for structure determination. The second was the identification of worm proteins that have many interaction partners (more than seven) and unstructured regions. Surprisingly, we found unstructured regions of the loopy type in more than 50% of all the promiscuous worm proteins.

Suggested Citation

  • Avner Schlessinger & Jinfeng Liu & Burkhard Rost, 2007. "Natively Unstructured Loops Differ from Other Loops," PLOS Computational Biology, Public Library of Science, vol. 3(7), pages 1-12, July.
  • Handle: RePEc:plo:pcbi00:0030140
    DOI: 10.1371/journal.pcbi.0030140
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.0030140
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.0030140&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.0030140?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Masato Enari & Hideki Sakahira & Hideki Yokoyama & Katsuya Okawa & Akihiro Iwamatsu & Shigekazu Nagata, 1998. "A caspase-activated DNase that degrades DNA during apoptosis, and its inhibitor ICAD," Nature, Nature, vol. 391(6662), pages 43-50, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Esmeralda Vicedo & Avner Schlessinger & Burkhard Rost, 2015. "Environmental Pressure May Change the Composition Protein Disorder in Prokaryotes," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-21, August.
    2. Avner Schlessinger & Marco Punta & Guy Yachdav & Laszlo Kajan & Burkhard Rost, 2009. "Improved Disorder Prediction by Combination of Orthogonal Approaches," PLOS ONE, Public Library of Science, vol. 4(2), pages 1-10, February.
    3. Jiangning Song & Hao Tan & Andrew J Perry & Tatsuya Akutsu & Geoffrey I Webb & James C Whisstock & Robert N Pike, 2012. "PROSPER: An Integrated Feature-Based Tool for Predicting Protease Substrate Cleavage Sites," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-23, November.
    4. Jiangning Song & Hao Tan & Mingjun Wang & Geoffrey I Webb & Tatsuya Akutsu, 2012. "TANGLE: Two-Level Support Vector Regression Approach for Protein Backbone Torsion Angle Prediction from Primary Sequences," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-16, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Leire Valcárcel-Ocete & Gorka Alkorta-Aranburu & Mikel Iriondo & Asier Fullaondo & María García-Barcina & José Manuel Fernández-García & Elena Lezcano-García & José María Losada-Domingo & Javier Ruiz-, 2015. "Exploring Genetic Factors Involved in Huntington Disease Age of Onset: E2F2 as a New Potential Modifier Gene," PLOS ONE, Public Library of Science, vol. 10(7), pages 1-14, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:0030140. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.