IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v111y2016i516p1591-1607.html
   My bibliography  Save this article

Accelerating Asymptotically Exact MCMC for Computationally Intensive Models via Local Approximations

Author

Listed:
  • Patrick R. Conrad
  • Youssef M. Marzouk
  • Natesh S. Pillai
  • Aaron Smith

Abstract

We construct a new framework for accelerating Markov chain Monte Carlo in posterior sampling problems where standard methods are limited by the computational cost of the likelihood, or of numerical models embedded therein. Our approach introduces local approximations of these models into the Metropolis–Hastings kernel, borrowing ideas from deterministic approximation theory, optimization, and experimental design. Previous efforts at integrating approximate models into inference typically sacrifice either the sampler’s exactness or efficiency; our work seeks to address these limitations by exploiting useful convergence characteristics of local approximations. We prove the ergodicity of our approximate Markov chain, showing that it samples asymptotically from the exact posterior distribution of interest. We describe variations of the algorithm that employ either local polynomial approximations or local Gaussian process regressors. Our theoretical results reinforce the key observation underlying this article: when the likelihood has some local regularity, the number of model evaluations per Markov chain Monte Carlo (MCMC) step can be greatly reduced without biasing the Monte Carlo average. Numerical experiments demonstrate multiple order-of-magnitude reductions in the number of forward model evaluations used in representative ordinary differential equation (ODE) and partial differential equation (PDE) inference problems, with both synthetic and real data. Supplementary materials for this article are available online.

Suggested Citation

  • Patrick R. Conrad & Youssef M. Marzouk & Natesh S. Pillai & Aaron Smith, 2016. "Accelerating Asymptotically Exact MCMC for Computationally Intensive Models via Local Approximations," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1591-1607, October.
  • Handle: RePEc:taf:jnlasa:v:111:y:2016:i:516:p:1591-1607
    DOI: 10.1080/01621459.2015.1096787
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2015.1096787
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2015.1096787?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Avishek Chakraborty & Bani K. Mallick & Ryan G. Mcclarren & Carolyn C. Kuranz & Derek Bingham & Michael J. Grosskopf & Erica M. Rutter & Hayes F. Stripling & R. Paul Drake, 2013. "Spline-Based Emulators for Radiative Shock Experiments With Measurement Error," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(502), pages 411-428, June.
    2. Michael L. Stein & Zhiyi Chi & Leah J. Welty, 2004. "Approximating likelihoods for large spatial data sets," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(2), pages 275-296, May.
    3. Marc C. Kennedy & Anthony O'Hagan, 2001. "Bayesian calibration of computer models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(3), pages 425-464.
    4. Timothy S. Gardner & Charles R. Cantor & James J. Collins, 2000. "Construction of a genetic toggle switch in Escherichia coli," Nature, Nature, vol. 403(6767), pages 339-342, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthias Katzfuss & Joseph Guinness & Wenlong Gong & Daniel Zilber, 2020. "Vecchia Approximations of Gaussian-Process Predictions," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(3), pages 383-414, September.
    2. Vanslette, Kevin & Tohme, Tony & Youcef-Toumi, Kamal, 2020. "A general model validation and testing tool," Reliability Engineering and System Safety, Elsevier, vol. 195(C).
    3. Moreno Bevilacqua & Alfredo Alegria & Daira Velandia & Emilio Porcu, 2016. "Composite Likelihood Inference for Multivariate Gaussian Random Fields," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 21(3), pages 448-469, September.
    4. Jakub Bijak & Jason D. Hilton & Eric Silverman & Viet Dung Cao, 2013. "Reforging the Wedding Ring," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 29(27), pages 729-766.
    5. Hao Wu & Michael Browne, 2015. "Random Model Discrepancy: Interpretations and Technicalities (A Rejoinder)," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 619-624, September.
    6. Villez, Kris & Del Giudice, Dario & Neumann, Marc B. & Rieckermann, Jörg, 2020. "Accounting for erroneous model structures in biokinetic process models," Reliability Engineering and System Safety, Elsevier, vol. 203(C).
    7. Yan Chen & Youran Qi & Qing Liu & Peter Chien, 2018. "Sequential sampling enhanced composite likelihood approach to estimation of social intercorrelations in large-scale networks," Quantitative Marketing and Economics (QME), Springer, vol. 16(4), pages 409-440, December.
    8. Xiaoyu Xiong & Benjamin D. Youngman & Theodoros Economou, 2021. "Data fusion with Gaussian processes for estimation of environmental hazard events," Environmetrics, John Wiley & Sons, Ltd., vol. 32(3), May.
    9. Avraham E Mayo & Yaakov Setty & Seagull Shavit & Alon Zaslaver & Uri Alon, 2006. "Plasticity of the cis-Regulatory Input Function of a Gene," PLOS Biology, Public Library of Science, vol. 4(4), pages 1-1, March.
    10. Petropoulos, G. & Wooster, M.J. & Carlson, T.N. & Kennedy, M.C. & Scholze, M., 2009. "A global Bayesian sensitivity analysis of the 1d SimSphere soil–vegetation–atmospheric transfer (SVAT) model using Gaussian model emulation," Ecological Modelling, Elsevier, vol. 220(19), pages 2427-2440.
    11. David Breitenmoser & Francesco Cerutti & Gernot Butterweck & Malgorzata Magdalena Kasprzak & Sabine Mayer, 2023. "Emulator-based Bayesian inference on non-proportional scintillation models by compton-edge probing," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    12. Caamaño-Carrillo, Christian & Bevilacqua, Moreno & López, Cristian & Morales-Oñate, Víctor, 2024. "Nearest neighbors weighted composite likelihood based on pairs for (non-)Gaussian massive spatial data with an application to Tukey-hh random fields estimation," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    13. Drignei, Dorin, 2011. "A general statistical model for computer experiments with time series output," Reliability Engineering and System Safety, Elsevier, vol. 96(4), pages 460-467.
    14. Yuan, Jun & Nian, Victor & Su, Bin & Meng, Qun, 2017. "A simultaneous calibration and parameter ranking method for building energy models," Applied Energy, Elsevier, vol. 206(C), pages 657-666.
    15. Tomas Tokar & Jozef Ulicny, 2013. "The Mathematical Model of the Bcl-2 Family Mediated MOMP Regulation Can Perform a Non-Trivial Pattern Recognition," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-8, December.
    16. Barde, Sylvain, 2024. "Bayesian estimation of large-scale simulation models with Gaussian process regression surrogates," Computational Statistics & Data Analysis, Elsevier, vol. 196(C).
    17. Gross, Eitan, 2015. "Effect of environmental stress on regulation of gene expression in the yeast," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 430(C), pages 224-235.
    18. Hwang, Youngdeok & Kim, Hang J. & Chang, Won & Yeo, Kyongmin & Kim, Yongku, 2019. "Bayesian pollution source identification via an inverse physics model," Computational Statistics & Data Analysis, Elsevier, vol. 134(C), pages 76-92.
    19. Choi, Wonjun & Menberg, Kathrin & Kikumoto, Hideki & Heo, Yeonsook & Choudhary, Ruchi & Ooka, Ryozo, 2018. "Bayesian inference of structural error in inverse models of thermal response tests," Applied Energy, Elsevier, vol. 228(C), pages 1473-1485.
    20. Yuan, Jun & Ng, Szu Hui, 2013. "A sequential approach for stochastic computer model calibration and prediction," Reliability Engineering and System Safety, Elsevier, vol. 111(C), pages 273-286.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:111:y:2016:i:516:p:1591-1607. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.