IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v110y2016icp12-24.html
   My bibliography  Save this article

Consistency and inconsistency of consensus methods for inferring species trees from gene trees in the presence of ancestral population structure

Author

Listed:
  • DeGiorgio, Michael
  • Rosenberg, Noah A.

Abstract

In the last few years, several statistically consistent consensus methods for species tree inference have been devised that are robust to the gene tree discordance caused by incomplete lineage sorting in unstructured ancestral populations. One source of gene tree discordance that has only recently been identified as a potential obstacle for phylogenetic inference is ancestral population structure. In this article, we describe a general model of ancestral population structure, and by relying on a single carefully constructed example scenario, we show that the consensus methods Democratic Vote, STEAC, STAR, R∗ Consensus, Rooted Triple Consensus, Minimize Deep Coalescences, and Majority-Rule Consensus are statistically inconsistent under the model. We find that among the consensus methods evaluated, the only method that is statistically consistent in the presence of ancestral population structure is GLASS/Maximum Tree. We use simulations to evaluate the behavior of the various consensus methods in a model with ancestral population structure, showing that as the number of gene trees increases, estimates on the basis of GLASS/Maximum Tree approach the true species tree topology irrespective of the level of population structure, whereas estimates based on the remaining methods only approach the true species tree topology if the level of structure is low. However, through simulations using species trees both with and without ancestral population structure, we show that GLASS/Maximum Tree performs unusually poorly on gene trees inferred from alignments with little information. This practical limitation of GLASS/Maximum Tree together with the inconsistency of other methods prompts the need for both further testing of additional existing methods and development of novel methods under conditions that incorporate ancestral population structure.

Suggested Citation

  • DeGiorgio, Michael & Rosenberg, Noah A., 2016. "Consistency and inconsistency of consensus methods for inferring species trees from gene trees in the presence of ancestral population structure," Theoretical Population Biology, Elsevier, vol. 110(C), pages 12-24.
  • Handle: RePEc:eee:thpobi:v:110:y:2016:i:c:p:12-24
    DOI: 10.1016/j.tpb.2016.02.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S004058091630003X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2016.02.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Nick Patterson & Daniel J. Richter & Sante Gnerre & Eric S. Lander & David Reich, 2006. "Genetic evidence for complex speciation of humans and chimpanzees," Nature, Nature, vol. 441(7097), pages 1103-1108, June.
    2. Michael Steel, 1992. "The complexity of reconstructing trees from qualitative characters and subtrees," Journal of Classification, Springer;The Classification Society, vol. 9(1), pages 91-116, January.
    3. Alexander Shapiro & Jos Berge, 2002. "Statistical inference of minimum rank factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 67(1), pages 79-94, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m-dependent random variables," Statistics & Probability Letters, Elsevier, vol. 129(C), pages 171-181.
    2. Denter, Philipp & Sisak, Dana, 2015. "Do polls create momentum in political competition?," Journal of Public Economics, Elsevier, vol. 130(C), pages 1-14.
    3. Salgado Alfredo, 2018. "Incomplete Information and Costly Signaling in College Admissions," Working Papers 2018-23, Banco de México.
    4. Albrecht, James & Anderson, Axel & Vroman, Susan, 2010. "Search by committee," Journal of Economic Theory, Elsevier, vol. 145(4), pages 1386-1407, July.
    5. Blier-Wong, Christopher & Cossette, Hélène & Marceau, Etienne, 2023. "Risk aggregation with FGM copulas," Insurance: Mathematics and Economics, Elsevier, vol. 111(C), pages 102-120.
    6. Paul Bastide & Mahendra Mariadassou & Stéphane Robin, 2017. "Detection of adaptive shifts on phylogenies by using shifted stochastic processes on a tree," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1067-1093, September.
    7. Simon Bruhn & Thomas Grebel & Lionel Nesta, 2023. "The fallacy in productivity decomposition," Journal of Evolutionary Economics, Springer, vol. 33(3), pages 797-835, July.
    8. Wim J. van der Linden, 2019. "Lord’s Equity Theorem Revisited," Journal of Educational and Behavioral Statistics, , vol. 44(4), pages 415-430, August.
    9. Simar, Léopold & Wilson, Paul, 2022. "Modern Tools for Evaluating the Performance of Health-Care Providers," LIDAM Discussion Papers ISBA 2022006, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    10. Baey, Charlotte & Didier, Anne & Lemaire, Sébastien & Maupas, Fabienne & Cournède, Paul-Henry, 2013. "Modelling the interindividual variability of organogenesis in sugar beet populations using a hierarchical segmented model," Ecological Modelling, Elsevier, vol. 263(C), pages 56-63.
    11. Tasche, Dirk, 2013. "Bayesian estimation of probabilities of default for low default portfolios," Journal of Risk Management in Financial Institutions, Henry Stewart Publications, vol. 6(3), pages 302-326, July.
    12. Diers, Dorothea & Linde, Marc & Hahn, Lukas, 2016. "Addendum to ‘The multi-year non-life insurance risk in the additive reserving model’ [Insurance Math. Econom. 52(3) (2013) 590–598]: Quantification of multi-year non-life insurance risk in chain ladde," Insurance: Mathematics and Economics, Elsevier, vol. 67(C), pages 187-199.
    13. Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m -dependent random variables," LSE Research Online Documents on Economics 83635, London School of Economics and Political Science, LSE Library.
    14. Hirschberg, Joe & Lye, Jenny, 2017. "Inverting the indirect—The ellipse and the boomerang: Visualizing the confidence intervals of the structural coefficient from two-stage least squares," Journal of Econometrics, Elsevier, vol. 199(2), pages 173-183.
    15. Serguei Kaniovski & Alexander Zaigraev, 2018. "The probability of majority inversion in a two-stage voting system with three states," Theory and Decision, Springer, vol. 84(4), pages 525-546, June.
    16. Packham, Natalie & Woebbeking, Fabian, 2021. "Correlation scenarios and correlation stress testing," IRTG 1792 Discussion Papers 2021-012, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    17. Guillermo Martínez-Flórez & Roger Tovar-Falón & Carlos Barrera-Causil, 2022. "Inflated Unit-Birnbaum-Saunders Distribution," Mathematics, MDPI, vol. 10(4), pages 1-14, February.
    18. Xyngis, Georgios, 2017. "Business-cycle variation in macroeconomic uncertainty and the cross-section of expected returns: Evidence for scale-dependent risks," Journal of Empirical Finance, Elsevier, vol. 44(C), pages 43-65.
    19. Nair, Gopalan M. & Turlach, Berwin A., 2012. "The stochastic h-index," Journal of Informetrics, Elsevier, vol. 6(1), pages 80-87.
    20. Riko Kelter, 2021. "Analysis of type I and II error rates of Bayesian and frequentist parametric and nonparametric two-sample hypothesis tests under preliminary assessment of normality," Computational Statistics, Springer, vol. 36(2), pages 1263-1288, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:110:y:2016:i:c:p:12-24. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/intelligence .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.