IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0234978.html
   My bibliography  Save this article

Data-driven network alignment

Author

Listed:
  • Shawn Gu
  • Tijana Milenković

Abstract

In this study, we deal with the problem of biological network alignment (NA), which aims to find a node mapping between species’ molecular networks that uncovers similar network regions, thus allowing for the transfer of functional knowledge between the aligned nodes. We provide evidence that current NA methods, which assume that topologically similar nodes (i.e., nodes whose network neighborhoods are isomorphic-like) have high functional relatedness, do not actually end up aligning functionally related nodes. That is, we show that the current topological similarity assumption does not hold well. Consequently, we argue that a paradigm shift is needed with how the NA problem is approached. So, we redefine NA as a data-driven framework, called TARA (data-driven NA), which attempts to learn the relationship between topological relatedness and functional relatedness without assuming that topological relatedness corresponds to topological similarity. TARA makes no assumptions about what nodes should be aligned, distinguishing it from existing NA methods. Specifically, TARA trains a classifier to predict whether two nodes from different networks are functionally related based on their network topological patterns (features). We find that TARA is able to make accurate predictions. TARA then takes each pair of nodes that are predicted as related to be part of an alignment. Like traditional NA methods, TARA uses this alignment for the across-species transfer of functional knowledge. TARA as currently implemented uses topological but not protein sequence information for functional knowledge transfer. In this context, we find that TARA outperforms existing state-of-the-art NA methods that also use topological information, WAVE and SANA, and even outperforms or complements a state-of-the-art NA method that uses both topological and sequence information, PrimAlign. Hence, adding sequence information to TARA, which is our future work, is likely to further improve its performance. The software and data are available at http://www.nd.edu/~cone/TARA/.

Suggested Citation

  • Shawn Gu & Tijana Milenković, 2020. "Data-driven network alignment," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-30, July.
  • Handle: RePEc:plo:pone00:0234978
    DOI: 10.1371/journal.pone.0234978
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234978
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0234978&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0234978?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Giovanni Ciriello & Marco Mina & Pietro H Guzzi & Mario Cannataro & Concettina Guerra, 2012. "AlignNemo: A Local Network Alignment Method to Integrate Homology and Topology," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-14, June.
    2. Sayed Mohammad Ebrahim Sahraeian & Byung-Jun Yoon, 2013. "SMETANA: Accurate and Scalable Algorithm for Probabilistic Alignment of Large-Scale Biological Networks," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-12, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hyun-Myung Woo & Hyundoo Jeong & Byung-Jun Yoon, 2020. "NAPAbench 2: A network synthesis algorithm for generating realistic protein-protein interaction (PPI) network families," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-20, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0234978. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.