IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0136577.html
   My bibliography  Save this article

Two Dimensional Yau-Hausdorff Distance with Applications on Comparison of DNA and Protein Sequences

Author

Listed:
  • Kun Tian
  • Xiaoqian Yang
  • Qin Kong
  • Changchuan Yin
  • Rong L He
  • Stephen S-T Yau

Abstract

Comparing DNA or protein sequences plays an important role in the functional analysis of genomes. Despite many methods available for sequences comparison, few methods retain the information content of sequences. We propose a new approach, the Yau-Hausdorff method, which considers all translations and rotations when seeking the best match of graphical curves of DNA or protein sequences. The complexity of this method is lower than that of any other two dimensional minimum Hausdorff algorithm. The Yau-Hausdorff method can be used for measuring the similarity of DNA sequences based on two important tools: the Yau-Hausdorff distance and graphical representation of DNA sequences. The graphical representations of DNA sequences conserve all sequence information and the Yau-Hausdorff distance is mathematically proved as a true metric. Therefore, the proposed distance can preciously measure the similarity of DNA sequences. The phylogenetic analyses of DNA sequences by the Yau-Hausdorff distance show the accuracy and stability of our approach in similarity comparison of DNA or protein sequences. This study demonstrates that Yau-Hausdorff distance is a natural metric for DNA and protein sequences with high level of stability. The approach can be also applied to similarity analysis of protein sequences by graphic representations, as well as general two dimensional shape matching.

Suggested Citation

  • Kun Tian & Xiaoqian Yang & Qin Kong & Changchuan Yin & Rong L He & Stephen S-T Yau, 2015. "Two Dimensional Yau-Hausdorff Distance with Applications on Comparison of DNA and Protein Sequences," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-19, September.
  • Handle: RePEc:plo:pone00:0136577
    DOI: 10.1371/journal.pone.0136577
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0136577
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0136577&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0136577?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zixuan Cang & Lin Mu & Guo-Wei Wei, 2018. "Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening," PLOS Computational Biology, Public Library of Science, vol. 14(1), pages 1-44, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0136577. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.