IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1000945.html
   My bibliography  Save this article

Finding the “Dark Matter” in Human and Yeast Protein Network Prediction and Modelling

Author

Listed:
  • Juan A G Ranea
  • Ian Morilla
  • Jon G Lees
  • Adam J Reid
  • Corin Yeats
  • Andrew B Clegg
  • Francisca Sanchez-Jimenez
  • Christine Orengo

Abstract

Accurate modelling of biological systems requires a deeper and more complete knowledge about the molecular components and their functional associations than we currently have. Traditionally, new knowledge on protein associations generated by experiments has played a central role in systems modelling, in contrast to generally less trusted bio-computational predictions. However, we will not achieve realistic modelling of complex molecular systems if the current experimental designs lead to biased screenings of real protein networks and leave large, functionally important areas poorly characterised. To assess the likelihood of this, we have built comprehensive network models of the yeast and human proteomes by using a meta-statistical integration of diverse computationally predicted protein association datasets. We have compared these predicted networks against combined experimental datasets from seven biological resources at different level of statistical significance. These eukaryotic predicted networks resemble all the topological and noise features of the experimentally inferred networks in both species, and we also show that this observation is not due to random behaviour. In addition, the topology of the predicted networks contains information on true protein associations, beyond the constitutive first order binary predictions. We also observe that most of the reliable predicted protein associations are experimentally uncharacterised in our models, constituting the hidden or “dark matter” of networks by analogy to astronomical systems. Some of this dark matter shows enrichment of particular functions and contains key functional elements of protein networks, such as hubs associated with important functional areas like the regulation of Ras protein signal transduction in human cells. Thus, characterising this large and functionally important dark matter, elusive to established experimental designs, may be crucial for modelling biological systems. In any case, these predictions provide a valuable guide to these experimentally elusive regions.Author Summary: To model accurate protein networks we need to extend our knowledge of protein associations in molecular systems much further. Biologists believe that high-throughput experiments will fill the gaps in our knowledge. However, if these approaches perform biased screenings, leaving important areas poorly characterized, success in modelling protein networks will require additional approaches to explore these ‘dark’ areas. We assess the value of integrating bio-computational approaches to build accurate and comprehensive network models for human and yeast proteomes and compare these models with models derived by combining multiple experimental datasets. We show that the predicted networks resemble the topological and error features of the experimental networks, and contain information on true protein associations within and beyond their constitutive first order binary predictions. We suggest that the majority of predicted network space is dark matter containing important functional areas, elusive to current experimental designs. Until novel experimental designs emerge as effective tools to screen these hidden regions, computational predictions will be a valuable approach for exploring them.

Suggested Citation

  • Juan A G Ranea & Ian Morilla & Jon G Lees & Adam J Reid & Corin Yeats & Andrew B Clegg & Francisca Sanchez-Jimenez & Christine Orengo, 2010. "Finding the “Dark Matter” in Human and Yeast Protein Network Prediction and Modelling," PLOS Computational Biology, Public Library of Science, vol. 6(9), pages 1-14, September.
  • Handle: RePEc:plo:pcbi00:1000945
    DOI: 10.1371/journal.pcbi.1000945
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000945
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000945&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1000945?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Richard Massey & Jason Rhodes & Richard Ellis & Nick Scoville & Alexie Leauthaud & Alexis Finoguenov & Peter Capak & David Bacon & Hervé Aussel & Jean-Paul Kneib & Anton Koekemoer & Henry McCracken & , 2007. "Dark matter maps reveal cosmic scaffolding," Nature, Nature, vol. 445(7125), pages 286-290, January.
    2. Silpa Suthram & Taylor Sittler & Trey Ideker, 2005. "The Plasmodium protein network diverges from those of other eukaryotes," Nature, Nature, vol. 438(7064), pages 108-112, November.
    3. Asa Ben-Hur & Cheng Soon Ong & Sören Sonnenburg & Bernhard Schölkopf & Gunnar Rätsch, 2008. "Support Vector Machines and Kernels for Computational Biology," PLOS Computational Biology, Public Library of Science, vol. 4(10), pages 1-10, October.
    4. Chris J Needham & James R Bradford & Andrew J Bulpitt & David R Westhead, 2007. "A Primer on Learning in Bayesian Networks for Computational Biology," PLOS Computational Biology, Public Library of Science, vol. 3(8), pages 1-8, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ana M Rojas & Anna Santamaria & Rainer Malik & Thomas Skøt Jensen & Roman Körner & Ian Morilla & David de Juan & Martin Krallinger & Daniel Aaen Hansen & Robert Hoffmann & Jonathan Lees & Adam Reid & , 2012. "Uncovering the Molecular Machinery of the Human Spindle—An Integration of Wet and Dry Systems Biology," PLOS ONE, Public Library of Science, vol. 7(3), pages 1-16, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alaa Tharwat & Aboul Ella Hassanien, 2019. "Quantum-Behaved Particle Swarm Optimization for Parameter Optimization of Support Vector Machine," Journal of Classification, Springer;The Classification Society, vol. 36(3), pages 576-598, October.
    2. Benjamin-Fink, Nicole & Reilly, Brian K., 2017. "A road map for developing and applying object-oriented bayesian networks to “WICKED” problems," Ecological Modelling, Elsevier, vol. 360(C), pages 27-44.
    3. Emily S W Wong & Margaret C Hardy & David Wood & Timothy Bailey & Glenn F King, 2013. "SVM-Based Prediction of Propeptide Cleavage Sites in Spider Toxins Identifies Toxin Innovation in an Australian Tarantula," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-11, July.
    4. Alessandro Ambrosi & Claudia Cattoglio & Clelia Di Serio, 2008. "Retroviral Integration Process in the Human Genome: Is It Really Non-Random? A New Statistical Approach," PLOS Computational Biology, Public Library of Science, vol. 4(8), pages 1-6, August.
    5. Lior Shamir & John D Delaney & Nikita Orlov & D Mark Eckley & Ilya G Goldberg, 2010. "Pattern Recognition Software and Techniques for Biological Image Analysis," PLOS Computational Biology, Public Library of Science, vol. 6(11), pages 1-10, November.
    6. Kay H Brodersen & Thomas M Schofield & Alexander P Leff & Cheng Soon Ong & Ekaterina I Lomakina & Joachim M Buhmann & Klaas E Stephan, 2011. "Generative Embedding for Model-Based Classification of fMRI Data," PLOS Computational Biology, Public Library of Science, vol. 7(6), pages 1-19, June.
    7. Shweta Bhandare & Debra S Goldberg & Robin Dowell, 2017. "Discriminating between HuR and TTP binding sites using the k-spectrum kernel method," PLOS ONE, Public Library of Science, vol. 12(3), pages 1-14, March.
    8. J. H. Smid & A. N. Swart & A. H. Havelaar & A. Pielaat, 2011. "A Practical Framework for the Construction of a Biotracing Model: Application to Salmonella in the Pork Slaughter Chain," Risk Analysis, John Wiley & Sons, vol. 31(9), pages 1434-1450, September.
    9. B J Morrison McKay & Clare Sansom, 2009. "Webb Miller and Trey Ideker To Receive Top International Bioinformatics Awards for 2009 from the International Society for Computational Biology," PLOS Computational Biology, Public Library of Science, vol. 5(4), pages 1-4, April.
    10. Wei Shui & Yiyi Zhang & Xinggui Wang & Yuanmeng Liu & Qianfeng Wang & Fei Duan & Chaowei Wu & Wanyu Shui, 2022. "Does Tibetan Household Livelihood Capital Enhance Tourism Participation Sustainability? Evidence from China’s Jiaju Tibetan Village," IJERPH, MDPI, vol. 19(15), pages 1-15, July.
    11. Marina M -C Vidovic & Nico Görnitz & Klaus-Robert Müller & Gunnar Rätsch & Marius Kloft, 2015. "SVM2Motif—Reconstructing Overlapping DNA Sequence Motifs by Mimicking an SVM Predictor," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-23, December.
    12. Emili Balaguer-Ballester & Christopher C Lapish & Jeremy K Seamans & Daniel Durstewitz, 2011. "Attracting Dynamics of Frontal Cortex Ensembles during Memory-Guided Decision-Making," PLOS Computational Biology, Public Library of Science, vol. 7(5), pages 1-19, May.
    13. A Ivanenko & P Watkins & M A J van Gerven & K Hammerschmidt & B Englitz, 2020. "Classifying sex and strain from mouse ultrasonic vocalizations using deep learning," PLOS Computational Biology, Public Library of Science, vol. 16(6), pages 1-27, June.
    14. Yue Deng & Yanyu Zhao & Yebin Liu & Qionghai Dai, 2013. "Differences Help Recognition: A Probabilistic Interpretation," PLOS ONE, Public Library of Science, vol. 8(6), pages 1-10, June.
    15. Charlotte Soneson & Sarah Gerster & Mauro Delorenzi, 2014. "Batch Effect Confounding Leads to Strong Bias in Performance Estimates Obtained by Cross-Validation," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-13, June.
    16. Igor O Korolev & Laura L Symonds & Andrea C Bozoki & Alzheimer's Disease Neuroimaging Initiative, 2016. "Predicting Progression from Mild Cognitive Impairment to Alzheimer's Dementia Using Clinical, MRI, and Plasma Biomarkers via Probabilistic Pattern Classification," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-25, February.
    17. Stephen J Gilmore, 2018. "Automated decision support in melanocytic lesion management," PLOS ONE, Public Library of Science, vol. 13(9), pages 1-15, September.
    18. Paula Laccourreye & Concha Bielza & Pedro Larrañaga, 2022. "Explainable Machine Learning for Longitudinal Multi-Omic Microbiome," Mathematics, MDPI, vol. 10(12), pages 1-23, June.
    19. S. Camelo & M. González-Lima & A. Quiroz, 2015. "Nearest neighbors methods for support vector machines," Annals of Operations Research, Springer, vol. 235(1), pages 85-101, December.
    20. Jumeniyaz Seydehmet & Guang Hui Lv & Ilyas Nurmemet & Tayierjiang Aishan & Abdulla Abliz & Mamat Sawut & Abdugheni Abliz & Mamattursun Eziz, 2018. "Model Prediction of Secondary Soil Salinization in the Keriya Oasis, Northwest China," Sustainability, MDPI, vol. 10(3), pages 1-22, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1000945. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.