IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v431y2004i7011d10.1038_nature02917.html
   My bibliography  Save this article

Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous

Author

Listed:
  • Bryan Kolaczkowski

    (Department of Computer and Information Science)

  • Joseph W. Thornton

    (Center for Ecology and Evolutionary Biology, University of Oregon)

Abstract

All inferences in comparative biology depend on accurate estimates of evolutionary relationships. Recent phylogenetic analyses have turned away from maximum parsimony towards the probabilistic techniques of maximum likelihood and bayesian Markov chain Monte Carlo (BMCMC). These probabilistic techniques represent a parametric approach to statistical phylogenetics, because their criterion for evaluating a topology—the probability of the data, given the tree—is calculated with reference to an explicit evolutionary model from which the data are assumed to be identically distributed. Maximum parsimony can be considered nonparametric, because trees are evaluated on the basis of a general metric—the minimum number of character state changes required to generate the data on a given tree—without assuming a specific distribution1. The shift to parametric methods was spurred, in large part, by studies showing that although both approaches perform well most of the time2, maximum parsimony is strongly biased towards recovering an incorrect tree under certain combinations of branch lengths, whereas maximum likelihood is not3,4,5,6. All these evaluations simulated sequences by a largely homogeneous evolutionary process in which data are identically distributed. There is ample evidence, however, that real-world gene sequences evolve heterogeneously and are not identically distributed7,8,9,10,11,12,13,14,15,16. Here we show that maximum likelihood and BMCMC can become strongly biased and statistically inconsistent when the rates at which sequence sites evolve change non-identically over time. Maximum parsimony performs substantially better than current parametric methods over a wide range of conditions tested, including moderate heterogeneity and phylogenetic problems not normally considered difficult.

Suggested Citation

  • Bryan Kolaczkowski & Joseph W. Thornton, 2004. "Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous," Nature, Nature, vol. 431(7011), pages 980-984, October.
  • Handle: RePEc:nat:nature:v:431:y:2004:i:7011:d:10.1038_nature02917
    DOI: 10.1038/nature02917
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/nature02917
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/nature02917?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jussi Määttä & Teemu Roos, 2016. "Maximum Parsimony and the Skewness Test: A Simulation Study of the Limits of Applicability," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-21, April.
    2. Johannes Bergsten & Kelly B Miller, 2007. "Phylogeny of Diving Beetles Reveals a Coevolutionary Arms Race between the Sexes," PLOS ONE, Public Library of Science, vol. 2(6), pages 1-6, June.
    3. April M Wright & David M Hillis, 2014. "Bayesian Analysis Using a Simple Likelihood Model Outperforms Parsimony for Estimation of Phylogeny from Discrete Morphological Data," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-6, October.
    4. Hung D Nguyen & Maki Yoshihama & Naoya Kenmochi, 2005. "New Maximum Likelihood Estimators for Eukaryotic Intron Evolution," PLOS Computational Biology, Public Library of Science, vol. 1(7), pages 1-8, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:431:y:2004:i:7011:d:10.1038_nature02917. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.