IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v84y2022i5p1751-1784.html
   My bibliography  Save this article

Dimension‐free mixing for high‐dimensional Bayesian variable selection

Author

Listed:
  • Quan Zhou
  • Jun Yang
  • Dootika Vats
  • Gareth O. Roberts
  • Jeffrey S. Rosenthal

Abstract

Yang et al. proved that the symmetric random walk Metropolis–Hastings algorithm for Bayesian variable selection is rapidly mixing under mild high‐dimensional assumptions. We propose a novel Markov chain Monte Carlo (MCMC) sampler using an informed proposal scheme, which we prove achieves a much faster mixing time that is independent of the number of covariates, under the assumptions of Yang et al. To the best of our knowledge, this is the first high‐dimensional result which rigorously shows that the mixing rate of informed MCMC methods can be fast enough to offset the computational cost of local posterior evaluation. Motivated by the theoretical analysis of our sampler, we further propose a new approach called ‘two‐stage drift condition’ to studying convergence rates of Markov chains on general state spaces, which can be useful for obtaining tight complexity bounds in high‐dimensional settings. The practical advantages of our algorithm are illustrated by both simulation studies and real data analysis.

Suggested Citation

  • Quan Zhou & Jun Yang & Dootika Vats & Gareth O. Roberts & Jeffrey S. Rosenthal, 2022. "Dimension‐free mixing for high‐dimensional Bayesian variable selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(5), pages 1751-1784, November.
  • Handle: RePEc:bla:jorssb:v:84:y:2022:i:5:p:1751-1784
    DOI: 10.1111/rssb.12546
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssb.12546
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssb.12546?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Vivekananda Roy & James P. Hobert, 2007. "Convergence rates and asymptotic standard errors for Markov chain Monte Carlo algorithms for Bayesian probit regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(4), pages 607-623, September.
    2. P. J. Brown & M. Vannucci & T. Fearn, 1998. "Multivariate Bayesian variable selection and prediction," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(3), pages 627-641.
    3. Alexandre Bouchard-Côté & Sebastian J. Vollmer & Arnaud Doucet, 2018. "The Bouncy Particle Sampler: A Nonreversible Rejection-Free Markov Chain Monte Carlo Method," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 855-867, April.
    4. Henriët. Springelkamp & René Höhn & Aniket Mishra & Pirro G. Hysi & Chiea-Chuen Khor & Stephanie J. Loomis & Jessica N. Cooke Bailey & Jane Gibson & Gudmar Thorleifsson & Sarah F. Janssen & Xiaoyan Lu, 2014. "Meta-analysis of genome-wide association studies identifies novel loci that influence cupping and the glaucomatous process," Nature Communications, Nature, vol. 5(1), pages 1-7, December.
    5. Smith, Michael & Kohn, Robert, 1996. "Nonparametric regression using Bayesian variable selection," Journal of Econometrics, Elsevier, vol. 75(2), pages 317-343, December.
    6. Roberts, G. O. & Tweedie, R. L., 1999. "Bounds on regeneration times and convergence rates for Markov chains," Stochastic Processes and their Applications, Elsevier, vol. 80(2), pages 211-229, April.
    7. Bierkens, Joris & Bouchard-Côté, Alexandre & Doucet, Arnaud & Duncan, Andrew B. & Fearnhead, Paul & Lienart, Thibaut & Roberts, Gareth & Vollmer, Sebastian J., 2018. "Piecewise deterministic Markov processes for scalable Monte Carlo on restricted domains," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 148-154.
    8. J E Griffin & K G Łatuszyński & M F J Steel, 2021. "In search of lost mixing time: adaptive Markov chain Monte Carlo schemes for Bayesian variable selection with very large p," Biometrika, Biometrika Trust, vol. 108(1), pages 53-69.
    9. Michalis K. Titsias & Christopher Yau, 2017. "The Hamming Ball Sampler," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1598-1611, October.
    10. Giacomo Zanella, 2020. "Informed Proposals for Local MCMC in Discrete Spaces," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 852-865, April.
    11. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    12. Gareth O. Roberts & Jeffrey S. Rosenthal, 1998. "Optimal scaling of discrete approximations to Langevin diffusions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(1), pages 255-268.
    13. Dootika Vats & James M Flegal & Galin L Jones, 2019. "Multivariate output analysis for Markov chain Monte Carlo," Biometrika, Biometrika Trust, vol. 106(2), pages 321-337.
    14. Giacomo Zanella & Gareth Roberts, 2019. "Scalable importance tempering and Bayesian variable selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(3), pages 489-517, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gael M. Martin & David T. Frazier & Christian P. Robert, 2020. "Computing Bayes: Bayesian Computation from 1763 to the 21st Century," Monash Econometrics and Business Statistics Working Papers 14/20, Monash University, Department of Econometrics and Business Statistics.
    2. M Ludkin & C Sherlock, 2023. "Hug and hop: a discrete-time, nonreversible Markov chain Monte Carlo algorithm," Biometrika, Biometrika Trust, vol. 110(2), pages 301-318.
    3. Gael M. Martin & David T. Frazier & Christian P. Robert, 2022. "Computing Bayes: From Then `Til Now," Monash Econometrics and Business Statistics Working Papers 14/22, Monash University, Department of Econometrics and Business Statistics.
    4. Riccardo (Jack) Lucchetti & Luca Pedini, 2020. "ParMA: Parallelised Bayesian Model Averaging for Generalised Linear Models," Working Papers 2020:28, Department of Economics, University of Venice "Ca' Foscari".
    5. Gilles Celeux & Mohammed El Anbari & Jean-Michel Marin & Christian P. Robert, 2010. "Regularization in Regression : Comparing Bayesian and Frequentist Methods in a Poorly Informative Situation," Working Papers 2010-43, Center for Research in Economics and Statistics.
    6. Dimitris Korobilis, 2008. "Forecasting in vector autoregressions with many predictors," Advances in Econometrics, in: Bayesian Econometrics, pages 403-431, Emerald Group Publishing Limited.
    7. Min Wang & Xiaoqian Sun & Tao Lu, 2015. "Bayesian structured variable selection in linear regression models," Computational Statistics, Springer, vol. 30(1), pages 205-229, March.
    8. Bertazzi, Andrea & Bierkens, Joris & Dobson, Paul, 2022. "Approximations of Piecewise Deterministic Markov Processes and their convergence properties," Stochastic Processes and their Applications, Elsevier, vol. 154(C), pages 91-153.
    9. Chakraborty, Saptarshi & Bhattacharya, Suman K. & Khare, Kshitij, 2022. "Estimating accuracy of the MCMC variance estimator: Asymptotic normality for batch means estimators," Statistics & Probability Letters, Elsevier, vol. 183(C).
    10. Robert Kohn & Rachida Ouysse, 2007. "Bayesian Variable Selection of Risk Factors in the APT Model," Discussion Papers 2007-32, School of Economics, The University of New South Wales.
    11. Ruggieri, Eric & Lawrence, Charles E., 2012. "On efficient calculations for Bayesian variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1319-1332.
    12. Nadja Klein & Michael Stanley Smith, 2021. "Bayesian variable selection for non‐Gaussian responses: a marginally calibrated copula approach," Biometrics, The International Biometric Society, vol. 77(3), pages 809-823, September.
    13. Bai, Ray & Ghosh, Malay, 2018. "High-dimensional multivariate posterior consistency under global–local shrinkage priors," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 157-170.
    14. Yang, Jun & Roberts, Gareth O. & Rosenthal, Jeffrey S., 2020. "Optimal scaling of random-walk metropolis algorithms on general target distributions," Stochastic Processes and their Applications, Elsevier, vol. 130(10), pages 6094-6132.
    15. Cai, Bo & Dunson, David B., 2007. "Bayesian Multivariate Isotonic Regression Splines: Applications to Carcinogenicity Studies," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1158-1171, December.
    16. Samuel Livingstone & Giacomo Zanella, 2022. "The Barker proposal: Combining robustness and efficiency in gradient‐based MCMC," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 496-523, April.
    17. Sweata Sen & Damitri Kundu & Kiranmoy Das, 2023. "Variable selection for categorical response: a comparative study," Computational Statistics, Springer, vol. 38(2), pages 809-826, June.
    18. Ouysse, Rachida & Kohn, Robert, 2010. "Bayesian variable selection and model averaging in the arbitrage pricing theory model," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3249-3268, December.
    19. Mattingly, J. C. & Stuart, A. M. & Higham, D. J., 2002. "Ergodicity for SDEs and approximations: locally Lipschitz vector fields and degenerate noise," Stochastic Processes and their Applications, Elsevier, vol. 101(2), pages 185-232, October.
    20. Murray Pollock & Paul Fearnhead & Adam M. Johansen & Gareth O. Roberts, 2020. "Quasi‐stationary Monte Carlo and the ScaLE algorithm," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(5), pages 1167-1221, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:84:y:2022:i:5:p:1751-1784. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.