IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v166y2003i2p155-188.html
   My bibliography  Save this article

Inferences from DNA data: population histories, evolutionary processes and forensic match probabilities

Author

Listed:
  • Ian J. Wilson
  • Michael E. Weale
  • David J. Balding

Abstract

Summary. We develop a flexible class of Metropolis–Hastings algorithms for drawing inferences about population histories and mutation rates from deoxyribonucleic acid (DNA) sequence data. Match probabilities for use in forensic identification are also obtained, which is particularly useful for mitochondrial DNA profiles. Our data augmentation approach, in which the ancestral DNA data are inferred at each node of the genealogical tree, simplifies likelihood calculations and permits a wide class of mutation models to be employed, so that many different types of DNA sequence data can be analysed within our framework. Moreover, simpler likelihood calculations imply greater freedom for generating tree proposals, so that algorithms with good mixing properties can be implemented. We incorporate the effects of demography by means of simple mechanisms for changes in population size and structure, and we estimate the corresponding demographic parameters, but we do not here allow for the effects of either recombination or selection. We illustrate our methods by application to four human DNA data sets, consisting of DNA sequences, short tandem repeat loci, single‐nucleotide polymorphism sites and insertion sites. Two of the data sets are drawn from the male‐specific Y‐chromosome, one from maternally inherited mitochondrial DNA and one from the β‐globin locus on chromosome 11.

Suggested Citation

  • Ian J. Wilson & Michael E. Weale & David J. Balding, 2003. "Inferences from DNA data: population histories, evolutionary processes and forensic match probabilities," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 166(2), pages 155-188, June.
  • Handle: RePEc:bla:jorssa:v:166:y:2003:i:2:p:155-188
    DOI: 10.1111/1467-985X.00264
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/1467-985X.00264
    Download Restriction: no

    File URL: https://libkey.io/10.1111/1467-985X.00264?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Igor V. Ovchinnikov & Anders Götherström & Galina P. Romanova & Vitaliy M. Kharitonov & Kerstin Lidén & William Goodwin, 2000. "Molecular analysis of Neanderthal DNA from the northern Caucasus," Nature, Nature, vol. 404(6777), pages 490-493, March.
    2. Bob Mau & Michael A. Newton & Bret Larget, 1999. "Bayesian Phylogenetic Inference via Markov Chain Monte Carlo Methods," Biometrics, The International Biometric Society, vol. 55(1), pages 1-12, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ajay Shrestha & Ausif Mahmood, 2016. "Improving Genetic Algorithm with Fine-Tuned Crossover and Scaled Architecture," Journal of Mathematics, Hindawi, vol. 2016, pages 1-10, April.
    2. Noa Chapal-Ilani & Yosef E Maruvka & Adam Spiro & Yitzhak Reizel & Rivka Adar & Liran I Shlush & Ehud Shapiro, 2013. "Comparing Algorithms That Reconstruct Cell Lineage Trees Utilizing Information on Microsatellite Mutations," PLOS Computational Biology, Public Library of Science, vol. 9(11), pages 1-17, November.
    3. Jenkins Paul A., 2012. "Stopping-Time Resampling and Population Genetic Inference under Coalescent Models," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-20, January.
    4. Tsukuda Koji & Mano Shuhei & Yamamoto Toshimichi, 2020. "Bayesian approach to discriminant problems for count data with application to multilocus short tandem repeat dataset," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 19(2), pages 1-18, April.
    5. Heled, Joseph, 2012. "Sequence diversity under the multispecies coalescent with Yule process and constant population size," Theoretical Population Biology, Elsevier, vol. 81(2), pages 97-101.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Barry G Hall & Stephen J Salipante, 2007. "Measures of Clade Confidence Do Not Correlate with Accuracy of Phylogenetic Trees," PLOS Computational Biology, Public Library of Science, vol. 3(3), pages 1-9, March.
    2. Cyran, Krzysztof A. & Kimmel, Marek, 2010. "Alternatives to the Wright–Fisher model: The robustness of mitochondrial Eve dating," Theoretical Population Biology, Elsevier, vol. 78(3), pages 165-172.
    3. Alexandra Gavryushkina & David Welch & Tanja Stadler & Alexei J Drummond, 2014. "Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration," PLOS Computational Biology, Public Library of Science, vol. 10(12), pages 1-15, December.
    4. Pham, Tuan D., 2012. "Regularity dimension of sequences and its application to phylogenetic tree reconstruction," Chaos, Solitons & Fractals, Elsevier, vol. 45(6), pages 879-887.
    5. Elena Rivas & Sean R Eddy, 2008. "Probabilistic Phylogenetic Inference with Insertions and Deletions," PLOS Computational Biology, Public Library of Science, vol. 4(9), pages 1-21, September.
    6. Lin, Yu-Min & Fang, Shu-Cherng & Thorne, Jeffrey L., 2007. "A tabu search algorithm for maximum parsimony phylogeny inference," European Journal of Operational Research, Elsevier, vol. 176(3), pages 1908-1917, February.
    7. Rigat, F. & Mira, A., 2012. "Parallel hierarchical sampling: A general-purpose interacting Markov chains Monte Carlo algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1450-1467.
    8. Jordan Douglas & Rong Zhang & Remco Bouckaert, 2021. "Adaptive dating and fast proposals: Revisiting the phylogenetic relaxed clock model," PLOS Computational Biology, Public Library of Science, vol. 17(2), pages 1-30, February.
    9. Spade David A., 2020. "An extended model for phylogenetic maximum likelihood based on discrete morphological characters," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 19(1), pages 1-11, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:166:y:2003:i:2:p:155-188. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.