IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0249604.html
   My bibliography  Save this article

Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm

Author

Listed:
  • Chénangnon Frédéric Tovissodé
  • Aliou Diop
  • Romain Glèlè Kakaï

Abstract

Binary Generalized Linear Mixed Model (GLMM) is the most common method used by researchers to analyze clustered binary data in biological and social sciences. The traditional approach to GLMMs causes substantial bias in estimates due to steady shape of logistic and normal distribution assumptions thereby resulting into wrong and misleading decisions. This study brings forward an approach governed by skew generalized t distributions that belong to a class of potentially skewed and heavy tailed distributions. Interestingly, both the traditional logistic and probit mixed models, as well as other available methods can be utilized within the skew generalized t-link model (SGTLM) frame. We have taken advantage of the Expectation-Maximization algorithm accelerated via parameter-expansion for model fitting. We evaluated the performance of this approach to GLMMs through a simulation experiment by varying sample size and data distribution. Our findings indicated that the proposed methodology outperforms competing approaches in estimating population parameters and predicting random effects, when the traditional link and normality assumptions are violated. In addition, empirical standard errors and information criteria proved useful for detecting spurious skewness and avoiding complex models for probit data. An application with respiratory infection data points out to the superiority of the SGTLM which turns to be the most adequate model. In future, studies should focus on integrating the demonstrated flexibility in other generalized linear mixed models to enhance robust modeling.

Suggested Citation

  • Chénangnon Frédéric Tovissodé & Aliou Diop & Romain Glèlè Kakaï, 2021. "Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-31, April.
  • Handle: RePEc:plo:pone00:0249604
    DOI: 10.1371/journal.pone.0249604
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0249604
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0249604&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0249604?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Artur J. Lemonte & Jorge L. Bazán, 2018. "New links for binary regression: an application to coca cultivation in Peru," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(3), pages 597-617, September.
    2. Sungduk Kim & Ming-Hui Chen & Dipak K. Dey, 2008. "Flexible generalized t-link models for binary response data," Biometrika, Biometrika Trust, vol. 95(1), pages 93-106.
    3. Broström, Göran & Holmberg, Henrik, 2011. "Generalized linear models with clustered data: Fixed and random effects models," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3123-3134, December.
    4. Branco, Márcia D. & Dey, Dipak K., 2001. "A General Class of Multivariate Skew-Elliptical Distributions," Journal of Multivariate Analysis, Elsevier, vol. 79(1), pages 99-113, October.
    5. Christian E. Galarza & Tsung-I Lin & Wan-Lun Wang & Víctor H. Lachos, 2021. "On moments of folded and truncated multivariate Student-t distributions based on recurrence relations," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(6), pages 825-850, August.
    6. Agresti, Alan & Caffo, Brian & Ohman-Strickland, Pamela, 2004. "Examples in which misspecification of a random effects distribution reduces efficiency, and possible remedies," Computational Statistics & Data Analysis, Elsevier, vol. 47(3), pages 639-653, October.
    7. Hosseini, Fatemeh & Eidsvik, Jo & Mohammadzadeh, Mohsen, 2011. "Approximate Bayesian inference in spatial GLMM with skew normal latent variables," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1791-1806, April.
    8. Arellano-Valle, Reinaldo B. & Genton, Marc G., 2005. "On fundamental skew distributions," Journal of Multivariate Analysis, Elsevier, vol. 96(1), pages 93-116, September.
    9. Daniel B. Hall, 2000. "Zero-Inflated Poisson and Binomial Regression with Random Effects: A Case Study," Biometrics, The International Biometric Society, vol. 56(4), pages 1030-1039, December.
    10. Mark B. Stewart, 2004. "Semi-nonparametric estimation of extended ordered probit models," Stata Journal, StataCorp LP, vol. 4(1), pages 27-39, March.
    11. Meza, Cristian & Jaffrézic, Florence & Foulley, Jean-Louis, 2009. "Estimation in the probit normal model for binary outcomes using the SAEM algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1350-1360, February.
    12. Abanto-Valle, Carlos A. & Dey, Dipak K., 2014. "State space mixed models for binary responses with scale mixture of normal distributions links," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 274-287.
    13. Altemir Silva Braga & Gauss M. Cordeiro & Edwin M. M. Ortega & Giovana O. Silva, 2017. "The Odd Log-Logistic Student t Distribution: Theory and Applications," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 22(4), pages 615-639, December.
    14. Adelchi Azzalini & Antonella Capitanio, 2003. "Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t‐distribution," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 367-389, May.
    15. Marcos Antonio Alves Pereira & Cibele Maria Russo, 2019. "Nonlinear mixed-effects models with scale mixture of skew-normal distributions," Journal of Applied Statistics, Taylor & Francis Journals, vol. 46(9), pages 1602-1620, July.
    16. R.B. Arellano-Valle & H. Bolfarine & V.H. Lachos, 2007. "Bayesian Inference for Skew-normal Linear Mixed Models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 34(6), pages 663-682.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Azzalini, Adelchi, 2022. "An overview on the progeny of the skew-normal family— A personal perspective," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Reinaldo B. Arellano-Valle & Marc G. Genton, 2010. "Multivariate extended skew-t distributions and related families," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(3), pages 201-234.
    3. Lee, Sharon X. & McLachlan, Geoffrey J., 2022. "An overview of skew distributions in model-based clustering," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    4. Reinaldo B. Arellano-Valle, 2010. "On the information matrix of the multivariate skew-t model," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(3), pages 371-386.
    5. Mohsen Maleki & Darren Wraith & Reinaldo B. Arellano-Valle, 2019. "A flexible class of parametric distributions for Bayesian linear mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 543-564, June.
    6. Yin, Chuancun & Balakrishnan, Narayanaswamy, 2024. "Stochastic representations and probabilistic characteristics of multivariate skew-elliptical distributions," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    7. Kim, Hyoung-Moon & Genton, Marc G., 2011. "Characteristic functions of scale mixtures of multivariate skew-normal distributions," Journal of Multivariate Analysis, Elsevier, vol. 102(7), pages 1105-1117, August.
    8. McLachlan, Geoffrey J. & Lee, Sharon X., 2016. "Comment on “On nomenclature, and the relative merits of two formulations of skew distributions” by A. Azzalini, R. Browne, M. Genton, and P. McNicholas," Statistics & Probability Letters, Elsevier, vol. 116(C), pages 1-5.
    9. C. Adcock, 2010. "Asset pricing and portfolio selection based on the multivariate extended skew-Student-t distribution," Annals of Operations Research, Springer, vol. 176(1), pages 221-234, April.
    10. Hok Shing Kwong & Saralees Nadarajah, 2022. "A New Robust Class of Skew Elliptical Distributions," Methodology and Computing in Applied Probability, Springer, vol. 24(3), pages 1669-1691, September.
    11. Valeriano, Katherine A.L. & Galarza, Christian E. & Matos, Larissa A. & Lachos, Victor H., 2023. "Likelihood-based inference for the multivariate skew-t regression with censored or missing responses," Journal of Multivariate Analysis, Elsevier, vol. 196(C).
    12. M. C. Jones, 2015. "On Families of Distributions with Shape Parameters," International Statistical Review, International Statistical Institute, vol. 83(2), pages 175-192, August.
    13. Kahrari, F. & Rezaei, M. & Yousefzadeh, F. & Arellano-Valle, R.B., 2016. "On the multivariate skew-normal-Cauchy distribution," Statistics & Probability Letters, Elsevier, vol. 117(C), pages 80-88.
    14. Sreenivasa Rao Jammalamadaka & Emanuele Taufer & Gyorgy H. Terdik, 2021. "On Multivariate Skewness and Kurtosis," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(2), pages 607-644, August.
    15. Yangxin Huang & X. Hu & Getachew Dagne, 2014. "Jointly modeling time-to-event and longitudinal data: a Bayesian approach," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 23(1), pages 95-121, March.
    16. Cabral, Celso Rômulo Barbosa & Lachos, Víctor Hugo & Prates, Marcos O., 2012. "Multivariate mixture modeling using skew-normal independent distributions," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 126-142, January.
    17. Fang, B.Q., 2008. "Noncentral matrix quadratic forms of the skew elliptical variables," Journal of Multivariate Analysis, Elsevier, vol. 99(6), pages 1105-1127, July.
    18. Lin, Tsung I. & Ho, Hsiu J. & Chen, Chiang L., 2009. "Analysis of multivariate skew normal models with incomplete data," Journal of Multivariate Analysis, Elsevier, vol. 100(10), pages 2337-2351, November.
    19. Wan-Lun Wang & Tsung-I Lin, 2022. "Robust clustering of multiply censored data via mixtures of t factor analyzers," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 22-53, March.
    20. Contreras-Reyes, Javier E., 2014. "Asymptotic form of the Kullback–Leibler divergence for multivariate asymmetric heavy-tailed distributions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 395(C), pages 200-208.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0249604. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.