IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v14y2015i4p375-389n5.html
   My bibliography  Save this article

Synonymous and nonsynonymous distances help untangle convergent evolution and recombination

Author

Listed:
  • Chi Peter B.

    (Department of Statistics, California Polytechnic State University, San Luis Obispo, CA, 93407, USA)

  • Chattopadhyay Sujay

    (Department of Microbiology, University of Washington, Seattle, WA, 98195, USA)

  • Lemey Philippe

    (Rega Institute, Department of Microbiology and Immunology, KU Leuven – University of Leuven, B-3000 Leuven, Belgium)

  • Sokurenko Evgeni V.

    (Department of Microbiology, University of Washington, Seattle, WA, 98195, USA)

  • Minin Vladimir N.

    (Departments of Statistics and Biology, University of Washington, Seattle, WA, 98195, USA)

Abstract

When estimating a phylogeny from a multiple sequence alignment, researchers often assume the absence of recombination. However, if recombination is present, then tree estimation and all downstream analyses will be impacted, because different segments of the sequence alignment support different phylogenies. Similarly, convergent selective pressures at the molecular level can also lead to phylogenetic tree incongruence across the sequence alignment. Current methods for detection of phylogenetic incongruence are not equipped to distinguish between these two different mechanisms and assume that the incongruence is a result of recombination or other horizontal transfer of genetic information. We propose a new recombination detection method that can make this distinction, based on synonymous codon substitution distances. Although some power is lost by discarding the information contained in the nonsynonymous substitutions, our new method has lower false positive probabilities than the comparable recombination detection method when the phylogenetic incongruence signal is due to convergent evolution. We apply our method to three empirical examples, where we analyze: (1) sequences from a transmission network of the human immunodeficiency virus, (2) tlpB gene sequences from a geographically diverse set of 38 Helicobacter pylori strains, and (3) hepatitis C virus sequences sampled longitudinally from one patient.

Suggested Citation

  • Chi Peter B. & Chattopadhyay Sujay & Lemey Philippe & Sokurenko Evgeni V. & Minin Vladimir N., 2015. "Synonymous and nonsynonymous distances help untangle convergent evolution and recombination," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(4), pages 375-389, August.
  • Handle: RePEc:bpj:sagmbi:v:14:y:2015:i:4:p:375-389:n:5
    DOI: 10.1515/sagmb-2014-0078
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/sagmb-2014-0078
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/sagmb-2014-0078?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Suchard M.A. & Weiss R.E. & Dorman K.S. & Sinsheimer J.S., 2003. "Inferring Spatial Phylogenetic Variation Along Nucleotide Sequences: A Multiple Changepoint Model," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 427-437, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Leonardo Oliveira Martins & Hirohisa Kishino, 2010. "Distribution of distances between topologies and its effect on detection of phylogenetic recombination," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 62(1), pages 145-159, February.
    2. Yu Chuan Tai & Mark N. Kvale & John S. Witte, 2010. "Segmentation and Estimation for SNP Microarrays: A Bayesian Multiple Change-Point Approach," Biometrics, The International Biometric Society, vol. 66(3), pages 675-683, September.
    3. Husmeier Dirk & Mantzaris Alexander V., 2008. "Addressing the Shortcomings of Three Recent Bayesian Methods for Detecting Interspecific Recombination in DNA Sequence Alignments," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-41, November.
    4. Wolfgang P. Lehrach & Dirk Husmeier, 2009. "Segmenting bacterial and viral DNA sequence alignments with a trans‐dimensional phylogenetic factorial hidden Markov model," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 58(3), pages 307-327, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:14:y:2015:i:4:p:375-389:n:5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.