IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003919.html
   My bibliography  Save this article

Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration

Author

Listed:
  • Alexandra Gavryushkina
  • David Welch
  • Tanja Stadler
  • Alexei J Drummond

Abstract

Phylogenetic analyses which include fossils or molecular sequences that are sampled through time require models that allow one sample to be a direct ancestor of another sample. As previously available phylogenetic inference tools assume that all samples are tips, they do not allow for this possibility. We have developed and implemented a Bayesian Markov Chain Monte Carlo (MCMC) algorithm to infer what we call sampled ancestor trees, that is, trees in which sampled individuals can be direct ancestors of other sampled individuals. We use a family of birth-death models where individuals may remain in the tree process after sampling, in particular we extend the birth-death skyline model [Stadler et al., 2013] to sampled ancestor trees. This method allows the detection of sampled ancestors as well as estimation of the probability that an individual will be removed from the process when it is sampled. We show that even if sampled ancestors are not of specific interest in an analysis, failing to account for them leads to significant bias in parameter estimates. We also show that sampled ancestor birth-death models where every sample comes from a different time point are non-identifiable and thus require one parameter to be known in order to infer other parameters. We apply our phylogenetic inference accounting for sampled ancestors to epidemiological data, where the possibility of sampled ancestors enables us to identify individuals that infected other individuals after being sampled and to infer fundamental epidemiological parameters. We also apply the method to infer divergence times and diversification rates when fossils are included along with extant species samples, so that fossilisation events are modelled as a part of the tree branching process. Such modelling has many advantages as argued in the literature. The sampler is available as an open-source BEAST2 package (https://github.com/CompEvol/sampled-ancestors).Author Summary: A central goal of phylogenetic analysis is to estimate evolutionary relationships and the dynamical parameters underlying the evolutionary branching process (e.g. macroevolutionary or epidemiological parameters) from molecular data. The statistical methods used in these analyses require that the underlying tree branching process is specified. Standard models for the branching process which were originally designed to describe the evolutionary past of present day species do not allow one sampled taxon to be the ancestor of another. However the probability of sampling a direct ancestor is not negligible for many types of data. For example, when fossil and living species are analysed together to infer species divergence times, fossil species may or may not be direct ancestors of living species. In epidemiology, a sampled individual (a host from which a pathogen sequence was obtained) can infect other individuals after sampling, which then go on to be sampled themselves. The models that account for direct ancestors produce phylogenetic trees with a different structure from classic phylogenetic trees and so using these models in inference requires new computational methods. Here we developed a method for phylogenetic analysis that accounts for the possibility of direct ancestors.

Suggested Citation

  • Alexandra Gavryushkina & David Welch & Tanja Stadler & Alexei J Drummond, 2014. "Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration," PLOS Computational Biology, Public Library of Science, vol. 10(12), pages 1-15, December.
  • Handle: RePEc:plo:pcbi00:1003919
    DOI: 10.1371/journal.pcbi.1003919
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003919
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1003919&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003919?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Alexei J Drummond & Simon Y W Ho & Matthew J Phillips & Andrew Rambaut, 2006. "Relaxed Phylogenetics and Dating with Confidence," PLOS Biology, Public Library of Science, vol. 4(5), pages 1-1, March.
    2. Wilkinson, Richard D. & Tavaré, Simon, 2009. "Estimating primate divergence times by using conditioned birth-and-death processes," Theoretical Population Biology, Elsevier, vol. 75(4), pages 278-285.
    3. Simon Tavaré & Charles R. Marshall & Oliver Will & Christophe Soligo & Robert D. Martin, 2002. "Using the fossil record to estimate the age of the last common ancestor of extant primates," Nature, Nature, vol. 416(6882), pages 726-729, April.
    4. Bob Mau & Michael A. Newton & Bret Larget, 1999. "Bayesian Phylogenetic Inference via Markov Chain Monte Carlo Methods," Biometrics, The International Biometric Society, vol. 55(1), pages 1-12, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chloé Loiseau & Etthel M. Windels & Sebastian M. Gygli & Levan Jugheli & Nino Maghradze & Daniela Brites & Amanda Ross & Galo Goig & Miriam Reinhard & Sonia Borrell & Andrej Trauner & Anna Dötsch & Ru, 2023. "The relative transmission fitness of multidrug-resistant Mycobacterium tuberculosis in a drug resistance hotspot," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    2. Andrew F Magee & Sebastian Höhna & Tetyana I Vasylyeva & Adam D Leaché & Vladimir N Minin, 2020. "Locally adaptive Bayesian birth-death model successfully detects slow and rapid rate shifts," PLOS Computational Biology, Public Library of Science, vol. 16(10), pages 1-23, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael D Nowak & Andrew B Smith & Carl Simpson & Derrick J Zwickl, 2013. "A Simple Method for Estimating Informative Node Age Priors for the Fossil Calibration of Molecular Divergence Time Analyses," PLOS ONE, Public Library of Science, vol. 8(6), pages 1-13, June.
    2. Jordan Douglas & Rong Zhang & Remco Bouckaert, 2021. "Adaptive dating and fast proposals: Revisiting the phylogenetic relaxed clock model," PLOS Computational Biology, Public Library of Science, vol. 17(2), pages 1-30, February.
    3. Wilkinson, Richard D. & Tavaré, Simon, 2009. "Estimating primate divergence times by using conditioned birth-and-death processes," Theoretical Population Biology, Elsevier, vol. 75(4), pages 278-285.
    4. Joachim Schmidt & Lars Opgenoorth & Steffen Höll & Ralf Bastrop, 2012. "Into the Himalayan Exile: The Phylogeography of the Ground Beetle Ethira clade Supports the Tibetan Origin of Forest-Dwelling Himalayan Species Groups," PLOS ONE, Public Library of Science, vol. 7(9), pages 1-15, September.
    5. Barry G Hall & Stephen J Salipante, 2007. "Measures of Clade Confidence Do Not Correlate with Accuracy of Phylogenetic Trees," PLOS Computational Biology, Public Library of Science, vol. 3(3), pages 1-9, March.
    6. Dinesh Aggarwal & Ben Warne & Aminu S. Jahun & William L. Hamilton & Thomas Fieldman & Louis Plessis & Verity Hill & Beth Blane & Emmeline Watkins & Elizabeth Wright & Grant Hall & Catherine Ludden & , 2022. "Genomic epidemiology of SARS-CoV-2 in a UK university identifies dynamics of transmission," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    7. Iliana Bista & Jonathan M. D. Wood & Thomas Desvignes & Shane A. McCarthy & Michael Matschiner & Zemin Ning & Alan Tracey & James Torrance & Ying Sims & William Chow & Michelle Smith & Karen Oliver & , 2023. "Genomics of cold adaptations in the Antarctic notothenioid fish radiation," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    8. Haitao Shang & Daniel H. Rothman & Gregory P. Fournier, 2022. "Oxidative metabolisms catalyzed Earth’s oxygenation," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    9. Nan Song & Ai-Ping Liang, 2013. "A Preliminary Molecular Phylogeny of Planthoppers (Hemiptera: Fulgoroidea) Based on Nuclear and Mitochondrial DNA Sequences," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-11, March.
    10. Saúl F. Domínguez-Guerrero & Fausto R. Méndez-de la Cruz & Norma L. Manríquez-Morán & Mark E. Olson & Patricia Galina-Tessaro & Diego M. Arenas-Moreno & Adán Bautista- del Moral & Adriana Benítez-Vill, 2022. "Exceptional parallelisms characterize the evolutionary transition to live birth in phrynosomatid lizards," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    11. Mekala Sundaram & Janna R Willoughby & Nathanael I Lichti & Michael A Steele & Robert K Swihart, 2015. "Segregating the Effects of Seed Traits and Common Ancestry of Hardwood Trees on Eastern Gray Squirrel Foraging Decisions," PLOS ONE, Public Library of Science, vol. 10(6), pages 1-16, June.
    12. Alice M. Clement & Richard Cloutier & Michael S. Y. Lee & Benedict King & Olivia Vanhaesebroucke & Corey J. A. Bradshaw & Hugo Dutel & Kate Trinajstic & John A. Long, 2024. "A Late Devonian coelacanth reconfigures actinistian phylogeny, disparity, and evolutionary dynamics," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    13. Mercedes M Burns & Marshal Hedin & Jeffrey W Shultz, 2013. "Comparative Analyses of Reproductive Structures in Harvestmen (Opiliones) Reveal Multiple Transitions from Courtship to Precopulatory Antagonism," PLOS ONE, Public Library of Science, vol. 8(6), pages 1-11, June.
    14. Jonathon D. Gass & Nichola J. Hill & Lambodhar Damodaran & Elena N. Naumova & Felicia B. Nutter & Jonathan A. Runstadler, 2023. "Ecogeographic Drivers of the Spatial Spread of Highly Pathogenic Avian Influenza Outbreaks in Europe and the United States, 2016–Early 2022," IJERPH, MDPI, vol. 20(11), pages 1-17, June.
    15. Nico Neureiter & Peter Ranacher & Nour Efrat-Kowalsky & Gereon A. Kaiping & Robert Weibel & Paul Widmer & Remco R. Bouckaert, 2022. "Detecting contact in language trees: a Bayesian phylogenetic model with horizontal transfer," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-14, December.
    16. Andrew F Magee & Sebastian Höhna & Tetyana I Vasylyeva & Adam D Leaché & Vladimir N Minin, 2020. "Locally adaptive Bayesian birth-death model successfully detects slow and rapid rate shifts," PLOS Computational Biology, Public Library of Science, vol. 16(10), pages 1-23, October.
    17. Stéphane Aris-Brosou, 2007. "Dating Phylogenies with Hybrid Local Molecular Clocks," PLOS ONE, Public Library of Science, vol. 2(9), pages 1-8, September.
    18. Ian J. Wilson & Michael E. Weale & David J. Balding, 2003. "Inferences from DNA data: population histories, evolutionary processes and forensic match probabilities," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 166(2), pages 155-188, June.
    19. Elena Rivas & Sean R Eddy, 2008. "Probabilistic Phylogenetic Inference with Insertions and Deletions," PLOS Computational Biology, Public Library of Science, vol. 4(9), pages 1-21, September.
    20. McKinley Trevelyan & Cook Alex R & Deardon Robert, 2009. "Inference in Epidemic Models without Likelihoods," The International Journal of Biostatistics, De Gruyter, vol. 5(1), pages 1-40, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003919. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.