IDEAS home Printed from https://ideas.repec.org/a/eee/thpobi/v107y2016icp39-51.html
   My bibliography  Save this article

Bayesian pedigree inference with small numbers of single nucleotide polymorphisms via a factor-graph representation

Author

Listed:
  • Anderson, Eric C.
  • Ng, Thomas C.

Abstract

We develop a computational framework for addressing pedigree inference problems using small numbers (80–400) of single nucleotide polymorphisms (SNPs). Our approach relaxes the assumptions, which are commonly made, that sampling is complete with respect to the pedigree and that there is no genotyping error. It relies on representing the inferred pedigree as a factor graph and invoking the Sum-Product algorithm to compute and store quantities that allow the joint probability of the data to be rapidly computed under a large class of rearrangements of the pedigree structure. This allows efficient MCMC sampling over the space of pedigrees, and, hence, Bayesian inference of pedigree structure. In this paper we restrict ourselves to inference of pedigrees without loops using SNPs assumed to be unlinked. We present the methodology in general for multigenerational inference, and we illustrate the method by applying it to the inference of full sibling groups in a large sample (n=1157) of Chinook salmon typed at 95 SNPs. The results show that our method provides a better point estimate and estimate of uncertainty than the currently best-available maximum-likelihood sibling reconstruction method. Extensions of this work to more complex scenarios are briefly discussed.

Suggested Citation

  • Anderson, Eric C. & Ng, Thomas C., 2016. "Bayesian pedigree inference with small numbers of single nucleotide polymorphisms via a factor-graph representation," Theoretical Population Biology, Elsevier, vol. 107(C), pages 39-51.
  • Handle: RePEc:eee:thpobi:v:107:y:2016:i:c:p:39-51
    DOI: 10.1016/j.tpb.2015.09.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040580915000908
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tpb.2015.09.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Cowell, Robert G., 2009. "Efficient maximum likelihood pedigree reconstruction," Theoretical Population Biology, Elsevier, vol. 76(4), pages 285-291.
    2. Eddelbuettel, Dirk & Francois, Romain, 2011. "Rcpp: Seamless R and C++ Integration," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i08).
    3. Almudevar, Anthony & LaCombe, Jason, 2012. "On the choice of prior density for the Bayesian analysis of pedigree structure," Theoretical Population Biology, Elsevier, vol. 81(2), pages 131-143.
    4. Sheehan, Nuala A. & Bartlett, Mark & Cussens, James, 2014. "Improved maximum likelihood reconstruction of complex multi-generational pedigrees," Theoretical Population Biology, Elsevier, vol. 97(C), pages 11-19.
    5. N. A. Sheehan, 2000. "On the Application of Markov Chain Monte Carlo Methods to Genetic Analyses on Complex Pedigrees," International Statistical Review, International Statistical Institute, vol. 68(1), pages 83-110, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Almudevar, Anthony, 2016. "An information theoretic approach to pedigree reconstruction," Theoretical Population Biology, Elsevier, vol. 107(C), pages 52-64.
    2. Sun, M. & Jobling, M.A. & Taliun, D. & Pramstaller, P.P. & Egeland, T. & Sheehan, N.A., 2016. "On the use of dense SNP marker data for the identification of distant relative pairs," Theoretical Population Biology, Elsevier, vol. 107(C), pages 14-25.
    3. Sheehan, Nuala A. & Bartlett, Mark & Cussens, James, 2014. "Improved maximum likelihood reconstruction of complex multi-generational pedigrees," Theoretical Population Biology, Elsevier, vol. 97(C), pages 11-19.
    4. Cowell, Robert G., 2013. "A simple greedy algorithm for reconstructing pedigrees," Theoretical Population Biology, Elsevier, vol. 83(C), pages 55-63.
    5. Fernández de Marcos Giménez de los Galanes, Alberto, 2022. "Data-driven stabilizations of goodness-of-fit tests," DES - Working Papers. Statistics and Econometrics. WS 35324, Universidad Carlos III de Madrid. Departamento de Estadística.
    6. Cindy Frascolla & Guillaume Lecuelle & Pascal Schlich & Hervé Cardot, 2022. "Two sample tests for Semi-Markov processes with parametric sojourn time distributions: an application in sensory analysis," Computational Statistics, Springer, vol. 37(5), pages 2553-2580, November.
    7. Samrachana Adhikari & Tracy Sweet & Brian Junker, 2021. "Analysis of longitudinal advice‐seeking networks following implementation of high stakes testing," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1475-1500, October.
    8. Bill Venables, 2017. "JOHN M. CHAMBERS . Extending R . Boca Raton : CRC Press," Biometrics, The International Biometric Society, vol. 73(2), pages 709-710, June.
    9. Anoek Castelein & Dennis Fok & Richard Paap, 2020. "A multinomial and rank-ordered logit model with inter- and intra-individual heteroscedasticity," Tinbergen Institute Discussion Papers 20-069/III, Tinbergen Institute.
    10. Virginia X. He & Matt P. Wand, 2024. "Bayesian generalized additive model selection including a fast variational option," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 108(3), pages 639-668, September.
    11. Adrien Ickowicz & Jessica Ford & Keith Hayes, 2019. "A Mixture Model Approach for Compositional Data: Inferring Land-Use Influence on Point-Referenced Water Quality Measurements," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(4), pages 719-739, December.
    12. Martinetti, Davide & Geniaux, Ghislain, 2017. "Approximate likelihood estimation of spatial probit models," Regional Science and Urban Economics, Elsevier, vol. 64(C), pages 30-45.
    13. Jin, Shaobo & Moustaki, Irini & Yang-Wallentin, Fan, 2018. "Approximated penalized maximum likelihood for exploratory factor analysis: an orthogonal case," LSE Research Online Documents on Economics 88118, London School of Economics and Political Science, LSE Library.
    14. Martina Sundqvist & Julien Chiquet & Guillem Rigaill, 2023. "Adjusting the adjusted Rand Index," Computational Statistics, Springer, vol. 38(1), pages 327-347, March.
    15. Rathelot, Roland, 2014. "Ethnic differentials on the labor market in the presence of asymmetric spatial sorting: Set identification and estimation," Regional Science and Urban Economics, Elsevier, vol. 48(C), pages 154-167.
    16. Roger S. Bivand, 2021. "Progress in the R ecosystem for representing and handling spatial data," Journal of Geographical Systems, Springer, vol. 23(4), pages 515-546, October.
    17. Mirshani, Ardalan & Reimherr, Matthew, 2021. "Adaptive function-on-scalar regression with a smoothing elastic net," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    18. Helmut Lutkepohl & Fei Shang & Luis Uzeda & Tomasz Wo'zniak, 2024. "Partial Identification of Heteroskedastic Structural VARs: Theory and Bayesian Inference," Papers 2404.11057, arXiv.org.
    19. Fernández-de-Marcos, Alberto & García-Portugués, Eduardo, 2023. "Data-driven stabilizations of goodness-of-fit tests," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    20. Julien Boelaert, 2013. "A Neural Network Demand System," Documents de travail du Centre d'Economie de la Sorbonne 13081, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:thpobi:v:107:y:2016:i:c:p:39-51. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/intelligence .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.