IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v28y2019i2d10.1007_s10260-018-00441-x.html
   My bibliography  Save this article

Bayesian variable selection in linear regression models with non-normal errors

Author

Listed:
  • Saverio Ranciati

    (University of Bologna)

  • Giuliano Galimberti

    (University of Bologna)

  • Gabriele Soffritti

    (University of Bologna)

Abstract

This paper addresses two crucial issues in multiple linear regression analysis: (i) error terms whose distribution is non-normal because of the presence of asymmetry of the response variable and/or data coming from heterogeneous populations; (ii) selection of the regressors that effectively contribute to explaining patterns in the observations and are relevant for predicting the dependent variable. A solution to the first issue can be obtained through an approach in which the distribution of the error terms is modelled using a finite mixture of Gaussian distributions. In this paper we use this approach to specify a Bayesian linear regression model with non-normal errors; furthermore, by embedding Bayesian variable selection techniques in the specification of the model, we simultaneously perform estimation and variable selection. These tasks are accomplished by sampling from the posterior distributions associated with the model. The performances of the proposed methodology are evaluated through the analysis of simulated datasets in comparison with other approaches. The results of an analysis based on a real dataset are also provided. The methods developed in this paper result to perform well when the distribution of the error terms is characterised by heavy tails, skewness and/or multimodality.

Suggested Citation

  • Saverio Ranciati & Giuliano Galimberti & Gabriele Soffritti, 2019. "Bayesian variable selection in linear regression models with non-normal errors," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(2), pages 323-358, June.
  • Handle: RePEc:spr:stmapp:v:28:y:2019:i:2:d:10.1007_s10260-018-00441-x
    DOI: 10.1007/s10260-018-00441-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10260-018-00441-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10260-018-00441-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lee, Kuo-Jung & Chen, Ray-Bing & Wu, Ying Nian, 2016. "Bayesian variable selection for finite mixture model of linear regressions," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 1-16.
    2. Park, Trevor & Casella, George, 2008. "The Bayesian Lasso," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 681-686, June.
    3. Yao, Weixin & Wei, Yan & Yu, Chun, 2014. "Robust mixture regression using the t-distribution," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 116-127.
    4. Fernández, Carmen & Steel, Mark F.J., 2000. "Bayesian Regression Analysis With Scale Mixtures Of Normals," Econometric Theory, Cambridge University Press, vol. 16(1), pages 80-101, February.
    5. Sylvia. Richardson & Peter J. Green, 1997. "On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(4), pages 731-792.
    6. Galimberti, Giuliano & Soffritti, Gabriele, 2014. "A multivariate linear regression analysis using finite mixtures of t distributions," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 138-150.
    7. Chib, Siddhartha & Tiwari, Ram C. & Jammalamadaka, S. Rao, 1988. "Bayes prediction in regressions with elliptical errors," Journal of Econometrics, Elsevier, vol. 38(3), pages 349-360, July.
    8. Bartolucci, F. & Scaccia, L., 2005. "The use of mixtures for dealing with non-normal regression errors," Computational Statistics & Data Analysis, Elsevier, vol. 48(4), pages 821-834, April.
    9. Papastamoulis, Panagiotis, 2016. "label.switching: An R Package for Dealing with the Label Switching Problem in MCMC Outputs," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 69(c01).
    10. Simon, Noah & Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2011. "Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 39(i05).
    11. Robert Tibshirani, 2011. "Regression shrinkage and selection via the lasso: a retrospective," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 273-282, June.
    12. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    13. Song, Weixing & Yao, Weixin & Xing, Yanru, 2014. "Robust mixture regression model fitting by Laplace distribution," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 128-137.
    14. Francisco J. Rubio & Keming Yu, 2017. "Flexible objective Bayesian linear regression with applications in survival analysis," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(5), pages 798-810, April.
    15. Khalili, Abbas & Chen, Jiahua, 2007. "Variable Selection in Finite Mixture of Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1025-1038, September.
    16. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    17. Basso, Rodrigo M. & Lachos, Víctor H. & Cabral, Celso Rômulo Barbosa & Ghosh, Pulak, 2010. "Robust mixture modeling based on scale mixtures of skew-normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2926-2941, December.
    18. Adelchi Azzalini & Antonella Capitanio, 2003. "Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t‐distribution," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 367-389, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Naderi, Mehrdad & Mirfarah, Elham & Wang, Wan-Lun & Lin, Tsung-I, 2023. "Robust mixture regression modeling based on the normal mean-variance mixture distributions," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    2. Lee, Kuo-Jung & Chen, Ray-Bing & Wu, Ying Nian, 2016. "Bayesian variable selection for finite mixture model of linear regressions," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 1-16.
    3. Hu, Hao & Yao, Weixin & Wu, Yichao, 2017. "The robust EM-type algorithms for log-concave mixtures of regression models," Computational Statistics & Data Analysis, Elsevier, vol. 111(C), pages 14-26.
    4. Wang, Shangshan & Xiang, Liming, 2017. "Two-layer EM algorithm for ALD mixture regression models: A new solution to composite quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 136-154.
    5. Sugasawa, Shonosuke & Kobayashi, Genya, 2022. "Robust fitting of mixture models using weighted complete estimating equations," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    6. Chun Yu & Weixin Yao & Guangren Yang, 2020. "A Selective Overview and Comparison of Robust Mixture Regression Estimators," International Statistical Review, International Statistical Institute, vol. 88(1), pages 176-202, April.
    7. Lee Anthony & Caron Francois & Doucet Arnaud & Holmes Chris, 2012. "Bayesian Sparsity-Path-Analysis of Genetic Association Signal using Generalized t Priors," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-31, January.
    8. Atefeh Zarei & Zahra Khodadadi & Mohsen Maleki & Karim Zare, 2023. "Robust mixture regression modeling based on two-piece scale mixtures of normal distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(1), pages 181-210, March.
    9. Tanin Sirimongkolkasem & Reza Drikvandi, 2019. "On Regularisation Methods for Analysis of High Dimensional Data," Annals of Data Science, Springer, vol. 6(4), pages 737-763, December.
    10. van Erp, Sara & Oberski, Daniel L. & Mulder, Joris, 2018. "Shrinkage priors for Bayesian penalized regression," OSF Preprints cg8fq, Center for Open Science.
    11. Yang, Yu-Chen & Lin, Tsung-I & Castro, Luis M. & Wang, Wan-Lun, 2020. "Extending finite mixtures of t linear mixed-effects models with concomitant covariates," Computational Statistics & Data Analysis, Elsevier, vol. 148(C).
    12. Gustavo Alexis Sabillón & Luiz Gabriel Fernandes Cotrim & Daiane Aparecida Zuanetti, 2023. "A data-driven reversible jump for estimating a finite mixture of regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(1), pages 350-369, March.
    13. De la Cruz, Rolando, 2008. "Bayesian non-linear regression models with skew-elliptical errors: Applications to the classification of longitudinal profiles," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 436-449, December.
    14. Ang Shan & Fengkai Yang, 2021. "Bayesian Inference for Finite Mixture Regression Model Based on Non-Iterative Algorithm," Mathematics, MDPI, vol. 9(6), pages 1-13, March.
    15. Nicolas Städler & Peter Bühlmann & Sara Geer, 2010. "ℓ 1 -penalization for mixture regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 19(2), pages 209-256, August.
    16. Antonio Punzo & Paul. D. McNicholas, 2017. "Robust Clustering in Regression Analysis via the Contaminated Gaussian Cluster-Weighted Model," Journal of Classification, Springer;The Classification Society, vol. 34(2), pages 249-293, July.
    17. Angelo Mazza & Antonio Punzo, 2020. "Mixtures of multivariate contaminated normal regression models," Statistical Papers, Springer, vol. 61(2), pages 787-822, April.
    18. Marco Berrettini & Giuliano Galimberti & Saverio Ranciati, 2023. "Semiparametric finite mixture of regression models with Bayesian P-splines," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(3), pages 745-775, September.
    19. Alhamzawi, Rahim, 2016. "Bayesian model selection in ordinal quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 68-78.
    20. Lee, Kuo-Jung & Feldkircher, Martin & Chen, Yi-Chi, 2021. "Variable selection in finite mixture of regression models with an unknown number of components," Computational Statistics & Data Analysis, Elsevier, vol. 158(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:28:y:2019:i:2:d:10.1007_s10260-018-00441-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.