IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v93y2014icp14-29.html
   My bibliography  Save this article

Theory and applications of a deterministic approximation to the coalescent model

Author

Listed:
  • Jewett, Ethan M.
  • Rosenberg, Noah A.

Abstract

Under the coalescent model, the random number nt of lineages ancestral to a sample is nearly deterministic as a function of time when nt is moderate to large in value, and it is well approximated by its expectation E[nt]. In turn, this expectation is well approximated by simple deterministic functions that are easy to compute. Such deterministic functions have been applied to estimate allele age, effective population size, and genetic diversity, and they have been used to study properties of models of infectious disease dynamics. Although a number of simple approximations of E[nt] have been derived and applied to problems of population-genetic inference, the theoretical accuracy of the resulting approximate formulas and the inferences obtained using these approximations is not known, and the range of problems to which they can be applied is not well understood. Here, we demonstrate general procedures by which the approximation nt≈E[nt] can be used to reduce the computational complexity of coalescent formulas, and we show that the resulting approximations converge to their true values under simple assumptions. Such approximations provide alternatives to exact formulas that are computationally intractable or numerically unstable when the number of sampled lineages is moderate or large. We also extend an existing class of approximations of E[nt] to the case of multiple populations of time-varying size with migration among them. Our results facilitate the use of the deterministic approximation nt≈E[nt] for deriving functionally simple, computationally efficient, and numerically stable approximations of coalescent formulas under complicated demographic scenarios.

Suggested Citation

  • Jewett, Ethan M. & Rosenberg, Noah A., 2014. "Theory and applications of a deterministic approximation to the coalescent model," Theoretical Population Biology, Elsevier, vol. 93(C), pages 14-29.
  • Handle: RePEc:eee:thpobi:v:93:y:2014:i:c:p:14-29
    DOI: 10.1016/j.tpb.2013.12.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040580913001482
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2013.12.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Eligius M.T. Hendrix & Boglárka G.-Tóth, 2010. "Introduction to Nonlinear and Global Optimization," Springer Optimization and Its Applications, Springer, number 978-0-387-88670-1, December.
    2. Efromovich Sam & Salter Kubatko Laura, 2008. "Coalescent Time Distributions in Trees of Arbitrary Size," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-30, January.
    3. Huang, Lucy & Buzbas, Erkan O. & Rosenberg, Noah A., 2013. "Genotype imputation in a coalescent model with infinitely-many-sites mutation," Theoretical Population Biology, Elsevier, vol. 87(C), pages 62-74.
    4. Vaart,A. W. van der, 2000. "Asymptotic Statistics," Cambridge Books, Cambridge University Press, number 9780521784504, November.
    5. Alexander Shapiro & Jos Berge, 2002. "Statistical inference of minimum rank factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 67(1), pages 79-94, March.
    6. Davison, D. & Pritchard, J.K. & Coop, G., 2009. "An approximate likelihood for genetic data under a model with recombination and population splitting," Theoretical Population Biology, Elsevier, vol. 75(4), pages 331-345.
    7. Heng Li & Richard Durbin, 2011. "Inference of human population history from individual whole-genome sequences," Nature, Nature, vol. 475(7357), pages 493-496, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Reppell, M. & Zöllner, S., 2018. "An efficient algorithm for generating the internal branches of a Kingman coalescent," Theoretical Population Biology, Elsevier, vol. 122(C), pages 57-66.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Steinrücken, Matthias & Paul, Joshua S. & Song, Yun S., 2013. "A sequentially Markov conditional sampling distribution for structured populations with migration and recombination," Theoretical Population Biology, Elsevier, vol. 87(C), pages 51-61.
    2. Kasy, Maximilian, 2011. "A nonparametric test for path dependence in discrete panel data," Economics Letters, Elsevier, vol. 113(2), pages 172-175.
    3. Atı̇la Abdulkadı̇roğlu & Joshua D. Angrist & Yusuke Narita & Parag Pathak, 2022. "Breaking Ties: Regression Discontinuity Design Meets Market Design," Econometrica, Econometric Society, vol. 90(1), pages 117-151, January.
    4. Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m-dependent random variables," Statistics & Probability Letters, Elsevier, vol. 129(C), pages 171-181.
    5. Denter, Philipp & Sisak, Dana, 2015. "Do polls create momentum in political competition?," Journal of Public Economics, Elsevier, vol. 130(C), pages 1-14.
    6. Salgado Alfredo, 2018. "Incomplete Information and Costly Signaling in College Admissions," Working Papers 2018-23, Banco de México.
    7. Albrecht, James & Anderson, Axel & Vroman, Susan, 2010. "Search by committee," Journal of Economic Theory, Elsevier, vol. 145(4), pages 1386-1407, July.
    8. Yoici Arai & Taisuke Otsu & Mengshan Xu, 2022. "GLS under monotone heteroskedasticity," STICERD - Econometrics Paper Series 625, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    9. Ashesh Rambachan & Jonathan Roth, 2020. "Design-Based Uncertainty for Quasi-Experiments," Papers 2008.00602, arXiv.org, revised Oct 2024.
    10. Blier-Wong, Christopher & Cossette, Hélène & Marceau, Etienne, 2023. "Risk aggregation with FGM copulas," Insurance: Mathematics and Economics, Elsevier, vol. 111(C), pages 102-120.
    11. van Dijk, Diana & Hendrix, Eligius M.T. & Haijema, Rene & Groeneveld, Rolf A. & van Ierland, Ekko C., 2014. "On solving a bi-level stochastic dynamic programming model for analyzing fisheries policies: Fishermen behavior and optimal fish quota," Ecological Modelling, Elsevier, vol. 272(C), pages 68-75.
    12. Debashis Ghosh, 2004. "Semiparametric methods for the binormal model with multiple biomarkers," The University of Michigan Department of Biostatistics Working Paper Series 1046, Berkeley Electronic Press.
    13. Brian D. Williamson & Peter B. Gilbert & Marco Carone & Noah Simon, 2021. "Nonparametric variable importance assessment using machine learning techniques," Biometrics, The International Biometric Society, vol. 77(1), pages 9-22, March.
    14. Arie Beresteanu & Francesca Molinari, 2008. "Asymptotic Properties for a Class of Partially Identified Models," Econometrica, Econometric Society, vol. 76(4), pages 763-814, July.
    15. Laurent Davezies & Xavier D'Haultfoeuille & Yannick Guyonvarch, 2018. "Asymptotic results under multiway clustering," Papers 1807.07925, arXiv.org, revised Aug 2018.
    16. Dominic Edelmann & Tobias Terzer & Donald Richards, 2021. "A Basic Treatment of the Distance Covariance," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 12-25, May.
    17. A Stefano Caria & Grant Gordon & Maximilian Kasy & Simon Quinn & Soha Osman Shami & Alexander Teytelboym, 2024. "An Adaptive Targeted Field Experiment: Job Search Assistance for Refugees in Jordan," Journal of the European Economic Association, European Economic Association, vol. 22(2), pages 781-836.
    18. Clément de Chaisemartin & Xavier D'Haultfœuille, 2020. "Two-Way Fixed Effects Estimators with Heterogeneous Treatment Effects," American Economic Review, American Economic Association, vol. 110(9), pages 2964-2996, September.
    19. Abe, Toshihiro & Miyata, Yoichi & Shiohama, Takayuki, 2023. "Bayesian estimation for mode and anti-mode preserving circular distributions," Econometrics and Statistics, Elsevier, vol. 27(C), pages 136-160.
    20. Bryan S. Graham, 2017. "An econometric model of network formation with degree heterogeneity," CeMMAP working papers 08/17, Institute for Fiscal Studies.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:93:y:2014:i:c:p:14-29. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/intelligence .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.