IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1008990.html
   My bibliography  Save this article

Finding recurrent RNA structural networks with fast maximal common subgraphs of edge-colored graphs

Author

Listed:
  • Antoine Soulé
  • Vladimir Reinharz
  • Roman Sarrazin-Gendron
  • Alain Denise
  • Jérôme Waldispühl

Abstract

RNA tertiary structure is crucial to its many non-coding molecular functions. RNA architecture is shaped by its secondary structure composed of stems, stacked canonical base pairs, enclosing loops. While stems are precisely captured by free-energy models, loops composed of non-canonical base pairs are not. Nor are distant interactions linking together those secondary structure elements (SSEs). Databases of conserved 3D geometries (a.k.a. modules) not captured by energetic models are leveraged for structure prediction and design, but the computational complexity has limited their study to local elements, loops. Representing the RNA structure as a graph has recently allowed to expend this work to pairs of SSEs, uncovering a hierarchical organization of these 3D modules, at great computational cost. Systematically capturing recurrent patterns on a large scale is a main challenge in the study of RNA structures. In this paper, we present an efficient algorithm to compute maximal isomorphisms in edge colored graphs. We extend this algorithm to a framework well suited to identify RNA modules, and fast enough to considerably generalize previous approaches. To exhibit the versatility of our framework, we first reproduce results identifying all common modules spanning more than 2 SSEs, in a few hours instead of weeks. The efficiency of our new algorithm is demonstrated by computing the maximal modules between any pair of entire RNA in the non-redundant corpus of known RNA 3D structures. We observe that the biggest modules our method uncovers compose large shared sub-structure spanning hundreds of nucleotides and base pairs between the ribosomes of Thermus thermophilus, Escherichia Coli, and Pseudomonas aeruginosa.Author summary: Ribonucleic Acids (RNAs) are performing a broad range of essential molecular functions in cells, many of which rely on intricate folding properties of the molecule. Watson-Crick and Wobble base pairs form early, stack onto each other to create stems connected by loops, which are themselves stabilized by more sophisticated base interaction patterns. These networks are essential to shape RNA 3D structures but unfortunately still poorly understood. Here, we undertake the task to build a catalog of base interaction networks occurring in multiple structures. However, a pairwise comparison of all RNA structures is computationally heavy. Therefore, we devise an algorithm leveraging intrinsic properties of RNA base interaction networks that enables us to quickly mine full databases of 3D structures. Compared to previous methods, our techniques bring the total running time of the analysis from months to hours while performing more general searches. The data collected though this work will benefit molecular evolution studies and serve in structure prediction tools.

Suggested Citation

  • Antoine Soulé & Vladimir Reinharz & Roman Sarrazin-Gendron & Alain Denise & Jérôme Waldispühl, 2021. "Finding recurrent RNA structural networks with fast maximal common subgraphs of edge-colored graphs," PLOS Computational Biology, Public Library of Science, vol. 17(5), pages 1-28, May.
  • Handle: RePEc:plo:pcbi00:1008990
    DOI: 10.1371/journal.pcbi.1008990
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008990
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008990&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1008990?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Konstantin Bokov & Sergey V. Steinberg, 2009. "A hierarchical model for evolution of 23S ribosomal RNA," Nature, Nature, vol. 457(7232), pages 977-980, February.
    2. Marc Parisien & François Major, 2008. "The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data," Nature, Nature, vol. 452(7183), pages 51-55, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liang Liu & Shi-Jie Chen, 2012. "Coarse-Grained Prediction of RNA Loop Structures," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-15, November.
    2. Jun Li & Wei Zhu & Jun Wang & Wenfei Li & Sheng Gong & Jian Zhang & Wei Wang, 2018. "RNA3DCNN: Local and global quality assessments of RNA 3D structures using 3D deep convolutional neural networks," PLOS Computational Biology, Public Library of Science, vol. 14(11), pages 1-18, November.
    3. Ge Han & Yi Xue, 2022. "Rational design of hairpin RNA excited states reveals multi-step transitions," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    4. Michael F Sloma & David H Mathews, 2017. "Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs," PLOS Computational Biology, Public Library of Science, vol. 13(11), pages 1-23, November.
    5. Jun Li & Jian Zhang & Jun Wang & Wenfei Li & Wei Wang, 2016. "Structure Prediction of RNA Loops with a Probabilistic Approach," PLOS Computational Biology, Public Library of Science, vol. 12(8), pages 1-17, August.
    6. Jes Frellsen & Ida Moltke & Martin Thiim & Kanti V Mardia & Jesper Ferkinghoff-Borg & Thomas Hamelryck, 2009. "A Probabilistic Model of RNA Conformational Space," PLOS Computational Biology, Public Library of Science, vol. 5(6), pages 1-11, June.
    7. Mélanie Boudard & Julie Bernauer & Dominique Barth & Johanne Cohen & Alain Denise, 2015. "GARN: Sampling RNA 3D Structure Space with Game Theory and Knowledge-Based Scoring Strategies," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-21, August.
    8. Wenkai Wang & Chenjie Feng & Renmin Han & Ziyi Wang & Lisha Ye & Zongyang Du & Hong Wei & Fa Zhang & Zhenling Peng & Jianyi Yang, 2023. "trRosettaRNA: automated prediction of RNA 3D structure with transformer network," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    9. Carla L. Esposito & Ida Autiero & Annamaria Sandomenico & H. Li & Mahmoud A. Bassal & Maria L. Ibba & Dongfang Wang & Lucrezia Rinaldi & Simone Ummarino & Giulia Gaggi & Marta Borchiellini & Piotr Swi, 2023. "Targeted systematic evolution of an RNA platform neutralizing DNMT1 function and controlling DNA methylation," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    10. Kai Sheng & Ning Li & Jessica N. Rabuck-Gibbons & Xiyu Dong & Dmitry Lyumkis & James R. Williamson, 2023. "Assembly landscape for the bacterial large ribosomal subunit," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    11. Jiaqiang Zhu & Wei Huang & Jing Zhao & Loc Huynh & Derek J. Taylor & Michael E. Harris, 2022. "Structural and mechanistic basis for recognition of alternative tRNA precursor substrates by bacterial ribonuclease P," Nature Communications, Nature, vol. 13(1), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008990. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.