IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v111y2017icp14-26.html
   My bibliography  Save this article

The robust EM-type algorithms for log-concave mixtures of regression models

Author

Listed:
  • Hu, Hao
  • Yao, Weixin
  • Wu, Yichao

Abstract

Finite mixture of regression (FMR) models can be reformulated as incomplete data problems and they can be estimated via the expectation–maximization (EM) algorithm. The main drawback is the strong parametric assumption such as FMR models with normal distributed residuals. The estimation might be biased if the model is misspecified. To relax the parametric assumption about the component error densities, a new method is proposed to estimate the mixture regression parameters by only assuming that the components have log-concave error densities but the specific parametric family is unknown.

Suggested Citation

  • Hu, Hao & Yao, Weixin & Wu, Yichao, 2017. "The robust EM-type algorithms for log-concave mixtures of regression models," Computational Statistics & Data Analysis, Elsevier, vol. 111(C), pages 14-26.
  • Handle: RePEc:eee:csdana:v:111:y:2017:i:c:p:14-26
    DOI: 10.1016/j.csda.2017.01.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947317300178
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2017.01.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Cule, Madeleine & Gramacy, Robert B. & Samworth, Richard, 2009. "LogConcDEAD: An R Package for Maximum Likelihood Estimation of a Multivariate Log-Concave Density," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 29(i02).
    2. Madeleine Cule & Richard Samworth & Michael Stewart, 2010. "Maximum likelihood estimation of a multi‐dimensional log‐concave density," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(5), pages 545-607, November.
    3. Benaglia, Tatiana & Chauveau, Didier & Hunter, David R. & Young, Derek S., 2009. "mixtools: An R Package for Analyzing Mixture Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i06).
    4. Salvatore Ingrassia & Antonio Punzo & Giorgio Vittadini & Simona Minotti, 2015. "Erratum to: The Generalized Linear Mixed Cluster-Weighted Model," Journal of Classification, Springer;The Classification Society, vol. 32(2), pages 327-355, July.
    5. Bartolucci, F. & Scaccia, L., 2005. "The use of mixtures for dealing with non-normal regression errors," Computational Statistics & Data Analysis, Elsevier, vol. 48(4), pages 821-834, April.
    6. Song, Weixing & Yao, Weixin & Xing, Yanru, 2014. "Robust mixture regression model fitting by Laplace distribution," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 128-137.
    7. Matthew Stephens, 2000. "Dealing with label switching in mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(4), pages 795-809.
    8. David Hunter & Derek Young, 2012. "Semiparametric mixtures of regressions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(1), pages 19-38.
    9. L. A. García‐Escudero & A. Gordaliza & R. San Martín & S. Van Aelst & R. Zamar, 2009. "Robust linear clustering," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(1), pages 301-318, January.
    10. Ingrassia, Salvatore & Minotti, Simona C. & Punzo, Antonio, 2014. "Model-based clustering via linear cluster-weighted models," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 159-182.
    11. Salvatore Ingrassia & Antonio Punzo & Giorgio Vittadini & Simona Minotti, 2015. "The Generalized Linear Mixed Cluster-Weighted Model," Journal of Classification, Springer;The Classification Society, vol. 32(1), pages 85-113, April.
    12. Yao, Weixin & Wei, Yan & Yu, Chun, 2014. "Robust mixture regression using the t-distribution," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 116-127.
    13. Galimberti, Giuliano & Soffritti, Gabriele, 2014. "A multivariate linear regression analysis using finite mixtures of t distributions," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 138-150.
    14. Wu, Qiang & Yao, Weixin, 2016. "Mixtures of quantile regressions," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 162-176.
    15. Jay Verkuilen & Michael Smithson, 2012. "Mixed and Mixture Regression Models for Continuous Bounded Responses Using the Beta Distribution," Journal of Educational and Behavioral Statistics, , vol. 37(1), pages 82-113, February.
    16. Chang, George T. & Walther, Guenther, 2007. "Clustering with mixtures of log-concave distributions," Computational Statistics & Data Analysis, Elsevier, vol. 51(12), pages 6242-6251, August.
    17. C. B. Zeller & V. H. Lachos & F. E. Vilca-Labra, 2011. "Local influence analysis for regression models with scale mixtures of skew-normal distributions," Journal of Applied Statistics, Taylor & Francis Journals, vol. 38(2), pages 343-368, October.
    18. Bettina Grün & Kurt Hornik, 2012. "Modelling human immunodeficiency virus ribonucleic acid levels with finite mixtures for censored longitudinal data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 61(2), pages 201-218, March.
    19. Neykov, N. & Filzmoser, P. & Dimova, R. & Neytchev, P., 2007. "Robust fitting of mixtures using the trimmed likelihood estimator," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 299-308, September.
    20. Yao, Weixin & Lindsay, Bruce G., 2009. "Bayesian Mixture Labeling by Highest Posterior Density," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 758-767.
    21. Lachos, Victor H. & Bandyopadhyay, Dipankar & Garay, Aldo M., 2011. "Heteroscedastic nonlinear regression models based on scale mixtures of skew-normal distributions," Statistics & Probability Letters, Elsevier, vol. 81(8), pages 1208-1217, August.
    22. Hu, Hao & Wu, Yichao & Yao, Weixin, 2016. "Maximum likelihood estimation of the mixture of log-concave densities," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 137-147.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sijia Xiang & Weixin Yao, 2020. "Semiparametric mixtures of regressions with single-index for model based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(2), pages 261-292, June.
    2. Mirfarah, Elham & Naderi, Mehrdad & Chen, Ding-Geng, 2021. "Mixture of linear experts model for censored data: A novel approach with scale-mixture of normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 158(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Antonio Punzo & Paul. D. McNicholas, 2017. "Robust Clustering in Regression Analysis via the Contaminated Gaussian Cluster-Weighted Model," Journal of Classification, Springer;The Classification Society, vol. 34(2), pages 249-293, July.
    2. Angelo Mazza & Antonio Punzo, 2020. "Mixtures of multivariate contaminated normal regression models," Statistical Papers, Springer, vol. 61(2), pages 787-822, April.
    3. Naderi, Mehrdad & Mirfarah, Elham & Wang, Wan-Lun & Lin, Tsung-I, 2023. "Robust mixture regression modeling based on the normal mean-variance mixture distributions," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    4. Chun Yu & Weixin Yao & Guangren Yang, 2020. "A Selective Overview and Comparison of Robust Mixture Regression Estimators," International Statistical Review, International Statistical Institute, vol. 88(1), pages 176-202, April.
    5. Hu, Hao & Wu, Yichao & Yao, Weixin, 2016. "Maximum likelihood estimation of the mixture of log-concave densities," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 137-147.
    6. Yang, Yu-Chen & Lin, Tsung-I & Castro, Luis M. & Wang, Wan-Lun, 2020. "Extending finite mixtures of t linear mixed-effects models with concomitant covariates," Computational Statistics & Data Analysis, Elsevier, vol. 148(C).
    7. Nguyen, Hien D. & McLachlan, Geoffrey J., 2016. "Laplace mixture of linear experts," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 177-191.
    8. Sangkon Oh & Byungtae Seo, 2023. "Merging Components in Linear Gaussian Cluster-Weighted Models," Journal of Classification, Springer;The Classification Society, vol. 40(1), pages 25-51, April.
    9. Xiaoqiong Fang & Andy W. Chen & Derek S. Young, 2023. "Predictors with measurement error in mixtures of polynomial regressions," Computational Statistics, Springer, vol. 38(1), pages 373-401, March.
    10. Yao, Weixin & Wei, Yan & Yu, Chun, 2014. "Robust mixture regression using the t-distribution," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 116-127.
    11. Wu, Qiang & Yao, Weixin, 2016. "Mixtures of quantile regressions," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 162-176.
    12. Ang Shan & Fengkai Yang, 2021. "Bayesian Inference for Finite Mixture Regression Model Based on Non-Iterative Algorithm," Mathematics, MDPI, vol. 9(6), pages 1-13, March.
    13. Sugasawa, Shonosuke & Kobayashi, Genya, 2022. "Robust fitting of mixture models using weighted complete estimating equations," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    14. Meng Li & Sijia Xiang & Weixin Yao, 2016. "Robust estimation of the number of components for mixtures of linear regression models," Computational Statistics, Springer, vol. 31(4), pages 1539-1555, December.
    15. Bai, Xiuqin & Yao, Weixin & Boyer, John E., 2012. "Robust fitting of mixture regression models," Computational Statistics & Data Analysis, Elsevier, vol. 56(7), pages 2347-2359.
    16. Saverio Ranciati & Giuliano Galimberti & Gabriele Soffritti, 2019. "Bayesian variable selection in linear regression models with non-normal errors," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(2), pages 323-358, June.
    17. Xue, Jiacheng & Yao, Weixin, 2022. "Machine Learning Embedded Semiparametric Mixtures of Regressions with Covariate-Varying Mixing Proportions," Econometrics and Statistics, Elsevier, vol. 22(C), pages 159-171.
    18. Atefeh Zarei & Zahra Khodadadi & Mohsen Maleki & Karim Zare, 2023. "Robust mixture regression modeling based on two-piece scale mixtures of normal distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(1), pages 181-210, March.
    19. Diani, Cecilia & Galimberti, Giuliano & Soffritti, Gabriele, 2022. "Multivariate cluster-weighted models based on seemingly unrelated linear regression," Computational Statistics & Data Analysis, Elsevier, vol. 171(C).
    20. Salvatore Ingrassia & Antonio Punzo, 2020. "Cluster Validation for Mixtures of Regressions via the Total Sum of Squares Decomposition," Journal of Classification, Springer;The Classification Society, vol. 37(2), pages 526-547, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:111:y:2017:i:c:p:14-26. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.