IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v53y2009i12p4028-4045.html
   My bibliography  Save this article

Estimating Bayes factors via thermodynamic integration and population MCMC

Author

Listed:
  • Calderhead, Ben
  • Girolami, Mark

Abstract

A Bayesian approach to model comparison based on the integrated or marginal likelihood is considered, and applications to linear regression models and nonlinear ordinary differential equation (ODE) models are used as the setting in which to elucidate and further develop existing statistical methodology. The focus is on two methods of marginal likelihood estimation. First, a statistical failure of the widely employed Posterior Harmonic Mean estimator is highlighted. It is demonstrated that there is a systematic bias capable of significantly skewing Bayes factor estimates, which has not previously been highlighted in the literature. Second, a detailed study of the recently proposed Thermodynamic Integral estimator is presented, which characterises the error associated with its discrete form. An experimental study using analytically tractable linear regression models highlights substantial differences with recently published results regarding optimal discretisation. Finally, with the insights gained, it is demonstrated how Population MCMC and thermodynamic integration methods may be elegantly combined to estimate Bayes factors accurately enough to discriminate between nonlinear models based on systems of ODEs, which has important application in describing the behaviour of complex processes arising in a wide variety of research areas, such as Systems Biology, Computational Ecology and Chemical Engineering.

Suggested Citation

  • Calderhead, Ben & Girolami, Mark, 2009. "Estimating Bayes factors via thermodynamic integration and population MCMC," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4028-4045, October.
  • Handle: RePEc:eee:csdana:v:53:y:2009:i:12:p:4028-4045
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(09)00272-2
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Pierre Del Moral & Arnaud Doucet & Ajay Jasra, 2006. "Sequential Monte Carlo samplers," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(3), pages 411-436, June.
    2. N. Friel & A. N. Pettitt, 2008. "Marginal likelihood estimation via power posteriors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(3), pages 589-607, July.
    3. El Adlouni, Salaheddine & Favre, Anne-Catherine & Bobee, Bernard, 2006. "Comparison of methodologies to assess the convergence of Markov chain Monte Carlo methods," Computational Statistics & Data Analysis, Elsevier, vol. 50(10), pages 2685-2701, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chris J. Oates & Mark Girolami & Nicolas Chopin, 2017. "Control functionals for Monte Carlo integration," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 695-718, June.
    2. Luigi Spezia & Andy Vinten & Roberta Paroli & Marc Stutter, 2021. "An evolutionary Monte Carlo method for the analysis of turbidity high‐frequency time series through Markov switching autoregressive models," Environmetrics, John Wiley & Sons, Ltd., vol. 32(8), December.
    3. Marco Grzegorczyk & Andrej Aderhold & Dirk Husmeier, 2017. "Targeting Bayes factors with direct-path non-equilibrium thermodynamic integration," Computational Statistics, Springer, vol. 32(2), pages 717-761, June.
    4. Spezia, Luigi, 2020. "Bayesian variable selection in non-homogeneous hidden Markov models through an evolutionary Monte Carlo method," Computational Statistics & Data Analysis, Elsevier, vol. 143(C).
    5. Filippone, Maurizio & Sanguinetti, Guido, 2011. "Approximate inference of the bandwidth in multivariate kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3104-3122, December.
    6. Will Penny & Biswa Sengupta, 2016. "Annealed Importance Sampling for Neural Mass Models," PLOS Computational Biology, Public Library of Science, vol. 12(3), pages 1-25, March.
    7. Birgir Hrafnkelsson & Helgi Sigurdarson & Sölvi Rögnvaldsson & Axel Örn Jansson & Rafael Daníel Vias & Sigurdur M. Gardarsson, 2022. "Generalization of the power‐law rating curve using hydrodynamic theory and Bayesian hierarchical modeling," Environmetrics, John Wiley & Sons, Ltd., vol. 33(2), March.
    8. Golchi, Shirin & Campbell, David A., 2016. "Sequentially Constrained Monte Carlo," Computational Statistics & Data Analysis, Elsevier, vol. 97(C), pages 98-113.
    9. Liu Baisen & Wang Liangliang & Cao Jiguo, 2018. "Bayesian estimation of ordinary differential equation models when the likelihood has multiple local modes," Monte Carlo Methods and Applications, De Gruyter, vol. 24(2), pages 117-127, June.
    10. Rigat, F. & Mira, A., 2012. "Parallel hierarchical sampling: A general-purpose interacting Markov chains Monte Carlo algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1450-1467.
    11. Loza-Reyes, E. & Hurn, M.A. & Robinson, A., 2014. "Classification of molecular sequence data using Bayesian phylogenetic mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 75(C), pages 81-95.
    12. Eduardo A Aponte & Dario Schöbi & Klaas E Stephan & Jakob Heinzle, 2017. "The Stochastic Early Reaction, Inhibition, and late Action (SERIA) model for antisaccades," PLOS Computational Biology, Public Library of Science, vol. 13(8), pages 1-36, August.
    13. Zhou, Yan, 2015. "vSMC: Parallel Sequential Monte Carlo in C++," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 62(i09).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:dau:papers:123456789/5724 is not listed on IDEAS
    2. Drovandi, Christopher C. & McGree, James M. & Pettitt, Anthony N., 2013. "Sequential Monte Carlo for Bayesian sequentially designed experiments for discrete data," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 320-335.
    3. Luca Martino & Fernando Llorente & Ernesto Curbelo & Javier López-Santiago & Joaquín Míguez, 2021. "Automatic Tempered Posterior Distributions for Bayesian Inversion Problems," Mathematics, MDPI, vol. 9(7), pages 1-17, April.
    4. Perrakis, Konstantinos & Ntzoufras, Ioannis & Tsionas, Efthymios G., 2014. "On the use of marginal posteriors in marginal likelihood estimation via importance sampling," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 54-69.
    5. Lefebvre, Geneviève & Steele, Russell & Vandal, Alain C., 2010. "A path sampling identity for computing the Kullback-Leibler and J divergences," Computational Statistics & Data Analysis, Elsevier, vol. 54(7), pages 1719-1731, July.
    6. Mathias Drton & Martyn Plummer, 2017. "A Bayesian information criterion for singular models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 323-380, March.
    7. Xing Ju Lee & Christopher C. Drovandi & Anthony N. Pettitt, 2015. "Model choice problems using approximate Bayesian computation with applications to pathogen transmission data sets," Biometrics, The International Biometric Society, vol. 71(1), pages 198-207, March.
    8. Jeong Eun Lee & Christian Robert, 2013. "Imortance Sampling Schemes for Evidence Approximation in Mixture Models," Working Papers 2013-42, Center for Research in Economics and Statistics.
    9. James Martin & Ajay Jasra & Emma McCoy, 2013. "Inference for a class of partially observed point process models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(3), pages 413-437, June.
    10. Will Penny & Biswa Sengupta, 2016. "Annealed Importance Sampling for Neural Mass Models," PLOS Computational Biology, Public Library of Science, vol. 12(3), pages 1-25, March.
    11. Spezia, L. & Cooksley, S.L. & Brewer, M.J. & Donnelly, D. & Tree, A., 2014. "Modelling species abundance in a river by Negative Binomial hidden Markov models," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 599-614.
    12. Vitoratou, Silia & Ntzoufras, Ioannis & Moustaki, Irini, 2016. "Explaining the behavior of joint and marginal Monte Carlo estimators in latent variable models with independence assumptions," LSE Research Online Documents on Economics 57685, London School of Economics and Political Science, LSE Library.
    13. Lee Anthony & Caron Francois & Doucet Arnaud & Holmes Chris, 2012. "Bayesian Sparsity-Path-Analysis of Genetic Association Signal using Generalized t Priors," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-31, January.
    14. Xiong, Yingge & Tobias, Justin L. & Mannering, Fred L., 2014. "The analysis of vehicle crash injury-severity data: A Markov switching approach with road-segment heterogeneity," Transportation Research Part B: Methodological, Elsevier, vol. 67(C), pages 109-128.
    15. Nguyen, Hoang & Virbickaitė, Audronė, 2023. "Modeling stock-oil co-dependence with Dynamic Stochastic MIDAS Copula models," Energy Economics, Elsevier, vol. 124(C).
    16. Naoki Awaya & Yasuhiro Omori, 2021. "Particle Rolling MCMC with Double-Block Sampling ," CIRJE F-Series CIRJE-F-1175, CIRJE, Faculty of Economics, University of Tokyo.
    17. Xiaohong Chen & Timothy M. Christensen & Elie Tamer, 2018. "Monte Carlo Confidence Sets for Identified Sets," Econometrica, Econometric Society, vol. 86(6), pages 1965-2018, November.
    18. AWLP Thilan & P Menéndez & JM McGree, 2023. "Assessing the ability of adaptive designs to capture trends in hard coral cover," Environmetrics, John Wiley & Sons, Ltd., vol. 34(6), September.
    19. Dufays, Arnaud & Rombouts, Jeroen V.K., 2020. "Relevant parameter changes in structural break models," Journal of Econometrics, Elsevier, vol. 217(1), pages 46-78.
    20. Nicolas Chopin & Mathieu Gerber, 2017. "Sequential quasi-Monte Carlo: Introduction for Non-Experts, Dimension Reduction, Application to Partly Observed Diffusion Processes," Working Papers 2017-35, Center for Research in Economics and Statistics.
    21. Joshua C. C. Chan & Liana Jacobi & Dan Zhu, 2022. "An automated prior robustness analysis in Bayesian model comparison," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(3), pages 583-602, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:53:y:2009:i:12:p:4028-4045. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.