IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v117y2013icp175-192.html
   My bibliography  Save this article

Bayesian regression based on principal components for high-dimensional data

Author

Listed:
  • Lee, Jaeyong
  • Oh, Hee-Seok

Abstract

The Gaussian sequence model can be obtained from the high-dimensional regression model through principal component analysis. It is shown that the Gaussian sequence model is equivalent to the original high-dimensional regression model in terms of prediction. Under a sparsity condition, we investigate the posterior consistency and convergence rates of the Gaussian sequence model. In particular, we examine two different modeling strategies: Bayesian inference with and without covariate selection. For Bayesian inferences without covariate selection, we obtain the consistency results of the estimators and posteriors with normal priors with constant and decreasing variances, and the James–Stein estimator; for Bayesian inference with covariate selection, we obtain convergence rates of Bayesian model averaging (BMA) and median probability model (MPM) estimators, and the posterior with variable selection prior. Based on these results, we conclude that variable selection is essential in high-dimensional Bayesian regression. A simulation study also confirms the conclusion. The methodologies are applied to a climate prediction problem.

Suggested Citation

  • Lee, Jaeyong & Oh, Hee-Seok, 2013. "Bayesian regression based on principal components for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 117(C), pages 175-192.
  • Handle: RePEc:eee:jmvana:v:117:y:2013:i:c:p:175-192
    DOI: 10.1016/j.jmva.2013.02.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X1300016X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2013.02.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. P. J. Lenk, 1999. "Bayesian inference for semiparametric regression using a Fourier representation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(4), pages 863-879.
    2. Smith, Richard L. & Tebaldi, Claudia & Nychka, Doug & Mearns, Linda O., 2009. "Bayesian Modeling of Uncertainty in Ensembles of Climate Models," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 97-116.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. De Luca, Giuseppe & Magnus, Jan R. & Peracchi, Franco, 2022. "Sampling properties of the Bayesian posterior mean with an application to WALS estimation," Journal of Econometrics, Elsevier, vol. 230(2), pages 299-317.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. S. Lorenz & S. Dessai & J. Paavola & P. Forster, 2015. "The communication of physical science uncertainty in European National Adaptation Strategies," Climatic Change, Springer, vol. 132(1), pages 143-155, September.
    2. Seongil Jo & Taeyoung Roh & Taeryon Choi, 2016. "Bayesian spectral analysis models for quantile regression with Dirichlet process mixtures," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(1), pages 177-206, March.
    3. DeCanio, Stephen J. & Manski, Charles F. & Sanstad, Alan H., 2022. "Minimax-regret climate policy with deep uncertainty in climate modeling and intergenerational discounting," Ecological Economics, Elsevier, vol. 201(C).
    4. Esther Salazar & Dorit Hammerling & Xia Wang & Bruno Sansó & Andrew O. Finley & Linda O. Mearns, 2016. "Observation-based blended projections from ensembles of regional climate models," Climatic Change, Springer, vol. 138(1), pages 55-69, September.
    5. Garnett P. McMillan & Timothy E. Hanson & Gabrielle Saunders & Frederick J. Gallun, 2013. "A two-component circular regression model for repeated measures auditory localization data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 62(4), pages 515-534, August.
    6. Sanaz Moghim & Mohammad Sina Jahangir, 2022. "Reliability framework for characterizing heat wave and cold spell events," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 112(2), pages 1503-1525, June.
    7. Christoph M. Buser & Hans R. Künsch & Alain Weber, 2010. "Biases and Uncertainty in Climate Projections," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 37(2), pages 179-199, June.
    8. Andrew Finley & Sudipto Banerjee & Alan Gelfand, 2012. "Bayesian dynamic modeling for large space-time datasets using Gaussian predictive processes," Journal of Geographical Systems, Springer, vol. 14(1), pages 29-47, January.
    9. Alexandra M. Schmidt & Marco A. Rodríguez, 2022. "Discussion on “A combined estimate of global temperature”," Environmetrics, John Wiley & Sons, Ltd., vol. 33(3), May.
    10. Howard H. Chang & Jingwen Zhou & Montserrat Fuentes, 2010. "Impact of Climate Change on Ambient Ozone Level and Mortality in Southeastern United States," IJERPH, MDPI, vol. 7(7), pages 1-15, July.
    11. Hongchang Hu & Yu Zhang & Xiong Pan, 2016. "Asymptotic normality of DHD estimators in a partially linear model," Statistical Papers, Springer, vol. 57(3), pages 567-587, September.
    12. Soumen Dey & Mohan Delampady & Ravishankar Parameshwaran & N. Samba Kumar & Arjun Srivathsa & K. Ullas Karanth, 2017. "Bayesian Methods for Estimating Animal Abundance at Large Spatial Scales Using Data from Multiple Sources," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 22(2), pages 111-139, June.
    13. Kleiber, William & Nychka, Douglas, 2012. "Nonstationary modeling for multivariate spatial processes," Journal of Multivariate Analysis, Elsevier, vol. 112(C), pages 76-91.
    14. Choi, Taeryon & Lee, Jaeyong & Roy, Anindya, 2009. "A note on the Bayes factor in a semiparametric regression model," Journal of Multivariate Analysis, Elsevier, vol. 100(6), pages 1316-1327, July.
    15. Paciorek, Christopher J., 2007. "Computational techniques for spatial logistic regression with large data sets," Computational Statistics & Data Analysis, Elsevier, vol. 51(8), pages 3631-3653, May.
    16. J. Refsgaard & K. Arnbjerg-Nielsen & M. Drews & K. Halsnæs & E. Jeppesen & H. Madsen & A. Markandya & J. Olesen & J. Porter & J. Christensen, 2013. "The role of uncertainty in climate change adaptation strategies—A Danish water management example," Mitigation and Adaptation Strategies for Global Change, Springer, vol. 18(3), pages 337-359, March.
    17. Timothy J. Gilbride & Peter J. Lenk & Jeff D. Brazell, 2008. "Market Share Constraints and the Loss Function in Choice-Based Conjoint Analysis," Marketing Science, INFORMS, vol. 27(6), pages 995-1011, 11-12.
    18. Villamizar, Rodrigo & Villamizar-Villegas, Mauricio & Arango, Lucia & Castelblanco, Geraldine, 2021. "Sustainability as a Policy Tool," Working papers 82, Red Investigadores de Economía.
    19. Jean-Yves Dauxois & Pierre Druilhet & Denys Pommeret, 2006. "A bayesian choice between poisson, binomial and negative binomial models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 15(2), pages 423-432, September.
    20. Tom Lindström & Michael Tildesley & Colleen Webb, 2015. "A Bayesian Ensemble Approach for Epidemiological Projections," PLOS Computational Biology, Public Library of Science, vol. 11(4), pages 1-30, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:117:y:2013:i:c:p:175-192. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.