IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v56y2012i9p2663-2674.html
   My bibliography  Save this article

Bayesian variable selection for logistic mixed model with nonparametric random effects

Author

Listed:
  • Yang, Mingan

Abstract

In analyzing correlated data or clustered data with linear or logistic mixed effects model, one commonly assumes that the random effects follow a normal distribution with mean zero. However, this assumption might not be appropriate in many cases. In particular, substantial violation of normality assumption might potentially impact the subset selection of variables in these models. In this article, we address the problem of joint selection of both fixed and random effects and bias control for random effects in nonparametric settings. An efficient Bayesian variable selection is implemented using a stochastic search Gibbs sampler to allow both fixed and random effects to be dropped effectively out of the model. The approach is illustrated using a simulation study and a real data example.

Suggested Citation

  • Yang, Mingan, 2012. "Bayesian variable selection for logistic mixed model with nonparametric random effects," Computational Statistics & Data Analysis, Elsevier, vol. 56(9), pages 2663-2674.
  • Handle: RePEc:eee:csdana:v:56:y:2012:i:9:p:2663-2674
    DOI: 10.1016/j.csda.2011.12.014
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016794731100449X
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2011.12.014?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Omiros Papaspiliopoulos & Gareth O. Roberts, 2008. "Retrospective Markov chain Monte Carlo methods for Dirichlet process hierarchical models," Biometrika, Biometrika Trust, vol. 95(1), pages 169-186.
    2. Ishwaran H. & Takahara G., 2002. "Independent and Identically Distributed Monte Carlo Algorithms for Semiparametric Linear Mixed Models," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1154-1166, December.
    3. Yang, Mingan & Dunson, David B. & Baird, Donna, 2010. "Semiparametric Bayes hierarchical models with mean and variance constraints," Computational Statistics & Data Analysis, Elsevier, vol. 54(9), pages 2172-2186, September.
    4. Smith, Michael & Kohn, Robert, 1996. "Nonparametric regression using Bayesian variable selection," Journal of Econometrics, Elsevier, vol. 75(2), pages 317-343, December.
    5. J. G. Booth & J. P. Hobert, 1999. "Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(1), pages 265-285.
    6. Zhen Chen & David B. Dunson, 2003. "Random Effects Selection in Linear Mixed Models," Biometrics, The International Biometric Society, vol. 59(4), pages 762-769, December.
    7. Chung, Yeonseung & Dunson, David B., 2009. "Nonparametric Bayes Conditional Distribution Modeling With Variable Selection," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1646-1660.
    8. Satkartar K. Kinney & David B. Dunson, 2007. "Fixed and Random Effects Selection in Linear and Logistic Models," Biometrics, The International Biometric Society, vol. 63(3), pages 690-698, September.
    9. Sean M. O'Brien & David B. Dunson, 2004. "Bayesian Multivariate Logistic Regression," Biometrics, The International Biometric Society, vol. 60(3), pages 739-746, September.
    10. Daowen Zhang & Marie Davidian, 2001. "Linear Mixed Models with Flexible Distributions of Random Effects for Longitudinal Data," Biometrics, The International Biometric Society, vol. 57(3), pages 795-802, September.
    11. Howard D. Bondell & Arun Krishna & Sujit K. Ghosh, 2010. "Joint Variable Selection for Fixed and Random Effects in Linear Mixed-Effects Models," Biometrics, The International Biometric Society, vol. 66(4), pages 1069-1077, December.
    12. Wendimagegn Ghidey & Emmanuel Lesaffre & Paul Eilers, 2004. "Smooth Random Effects Distribution in a Linear Mixed Model," Biometrics, The International Biometric Society, vol. 60(4), pages 945-953, December.
    13. Mingan Yang & David Dunson, 2010. "Bayesian Semiparametric Structural Equation Models with Latent Variables," Psychometrika, Springer;The Psychometric Society, vol. 75(4), pages 675-693, December.
    14. Tze Leung Lai, 2003. "Nonparametric estimation in nonlinear mixed effects models," Biometrika, Biometrika Trust, vol. 90(1), pages 1-13, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mingan Yang, 2020. "Bayesian Mixed Effects Model with Variable Selection," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 10(2), pages 27-29, August.
    2. Barrientos, Andrés F. & Canale, Antonio, 2021. "A Bayesian goodness-of-fit test for regression," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    3. Mingan Yang & Min Wang & Guanghui Dong, 2020. "Bayesian variable selection for mixed effects model with shrinkage prior," Computational Statistics, Springer, vol. 35(1), pages 227-243, March.
    4. Canale, Antonio & Lijoi, Antonio & Nipoti, Bernardo & PrĂ¼nster, Igor, 2023. "Inner spike and slab Bayesian nonparametric models," Econometrics and Statistics, Elsevier, vol. 27(C), pages 120-135.
    5. Anoek Castelein & Dennis Fok & Richard Paap, 2020. "Heterogeneous variable selection in nonlinear panel data models: A semiparametric Bayesian approach," Tinbergen Institute Discussion Papers 20-061/III, Tinbergen Institute.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mingan Yang & Min Wang & Guanghui Dong, 2020. "Bayesian variable selection for mixed effects model with shrinkage prior," Computational Statistics, Springer, vol. 35(1), pages 227-243, March.
    2. Bao, Junshu & Hanson, Timothy E., 2016. "A mean-constrained finite mixture of normals model," Statistics & Probability Letters, Elsevier, vol. 117(C), pages 93-99.
    3. Imori, Shinpei & Rosen, Dietrich von, 2015. "Covariance components selection in high-dimensional growth curve model with random coefficients," Journal of Multivariate Analysis, Elsevier, vol. 136(C), pages 86-94.
    4. Tsai-Hung Fan & Yi-Fu Wang & Yi-Chen Zhang, 2014. "Bayesian model selection in linear mixed effects models with autoregressive(p) errors using mixture priors," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(8), pages 1814-1829, August.
    5. Lee, Kyeong Eun & Kim, Yongku & Xu, Ronghui, 2014. "Bayesian variable selection under the proportional hazards mixed-effects model," Computational Statistics & Data Analysis, Elsevier, vol. 75(C), pages 53-65.
    6. Daniel R. Kowal, 2023. "Subset selection for linear mixed models," Biometrics, The International Biometric Society, vol. 79(3), pages 1853-1867, September.
    7. Armagan, Artin & Dunson, David, 2011. "Sparse variational analysis of linear mixed models for large data sets," Statistics & Probability Letters, Elsevier, vol. 81(8), pages 1056-1062, August.
    8. Fabienne Comte & Adeline Samson, 2012. "Nonparametric estimation of random-effects densities in linear mixed-effects model," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(4), pages 951-975, December.
    9. Peng Zhang & Peter X.-K. Song & Annie Qu & Tom Greene, 2008. "Efficient Estimation for Patient-Specific Rates of Disease Progression Using Nonnormal Linear Mixed Models," Biometrics, The International Biometric Society, vol. 64(1), pages 29-38, March.
    10. Ye, Rendao & Wang, Tonghui & Gupta, Arjun K., 2014. "Distribution of matrix quadratic forms under skew-normal settings," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 229-239.
    11. Simona Buscemi & Antonella Plaia, 2020. "Model selection in linear mixed-effect models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 529-575, December.
    12. Liverani, Silvia & Hastie, David I. & Azizi, Lamiae & Papathomas, Michail & Richardson, Sylvia, 2015. "PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i07).
    13. Mojtaba Ganjali & Taban Baghfalaki, 2018. "Application of Penalized Mixed Model in Identification of Genes in Yeast Cell-Cycle Gene Expression Data," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 6(2), pages 38-41, April.
    14. Reyhaneh Rikhtehgaran & Iraj Kazemi, 2013. "Semi-parametric Bayesian estimation of mixed-effects models using the multivariate skew-normal distribution," Computational Statistics, Springer, vol. 28(5), pages 2007-2027, October.
    15. Rendao Ye & Tonghui Wang & Saowanit Sukparungsee & Arjun Gupta, 2015. "Tests in variance components models under skew-normal settings," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 78(7), pages 885-904, October.
    16. Mingan Yang, 2020. "Bayesian Mixed Effects Model with Variable Selection," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 10(2), pages 27-29, August.
    17. Huang, Yifan & Meng, Shengwang, 2020. "A Bayesian nonparametric model and its application in insurance loss prediction," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 84-94.
    18. Leonardo Grilli & Carla Rampichini, 2015. "Specification of random effects in multilevel models: a review," Quality & Quantity: International Journal of Methodology, Springer, vol. 49(3), pages 967-976, May.
    19. Mingan Yang & David Dunson, 2010. "Bayesian Semiparametric Structural Equation Models with Latent Variables," Psychometrika, Springer;The Psychometric Society, vol. 75(4), pages 675-693, December.
    20. Zhang, Daowen & Davidian, Marie, 2004. "Likelihood and conditional likelihood inference for generalized additive mixed models for clustered data," Journal of Multivariate Analysis, Elsevier, vol. 91(1), pages 90-106, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:56:y:2012:i:9:p:2663-2674. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.