IDEAS home Printed from https://ideas.repec.org/a/spr/testjl/v33y2024i3d10.1007_s11749-024-00920-2.html
   My bibliography  Save this article

The orthogonal skew model: computationally efficient multivariate skew-normal and skew-t distributions with applications to model-based clustering

Author

Listed:
  • Ryan P. Browne

    (University of Waterloo)

  • Jeffrey L. Andrews

    (University of British Columbia)

Abstract

We introduce a parameterization for the multivariate skew normal and skew-t distributions, which enforces an orthogonal structure on the skewness parameter. This approach provides substantial benefits in computational efficiency during parameter estimation, resulting in a model which strikes an excellent balance between flexibility and model-fitting feasibility. We illustrate this primarily through implementing the proposed distributions in a mixture model-based clustering framework. We compare to competing skew distributions via both simulated and real data analyses, reporting both computation time and model-fit metrics.

Suggested Citation

  • Ryan P. Browne & Jeffrey L. Andrews, 2024. "The orthogonal skew model: computationally efficient multivariate skew-normal and skew-t distributions with applications to model-based clustering," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 33(3), pages 752-785, September.
  • Handle: RePEc:spr:testjl:v:33:y:2024:i:3:d:10.1007_s11749-024-00920-2
    DOI: 10.1007/s11749-024-00920-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11749-024-00920-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11749-024-00920-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sharon Lee & Geoffrey McLachlan, 2013. "Model-based clustering and classification with non-normal mixture distributions," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(4), pages 427-454, November.
    2. Arellano-Valle, Reinaldo B. & Genton, Marc G., 2005. "On fundamental skew distributions," Journal of Multivariate Analysis, Elsevier, vol. 96(1), pages 93-116, September.
    3. Dankmar Böhning & Ekkehart Dietz & Rainer Schaub & Peter Schlattmann & Bruce Lindsay, 1994. "The distribution of the likelihood ratio for mixtures of densities from the one-parameter exponential family," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 46(2), pages 373-388, June.
    4. Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2017. "Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 141-156.
    5. R.B. Arellano-Valle & H. Bolfarine & V.H. Lachos, 2007. "Bayesian Inference for Skew-normal Linear Mixed Models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 34(6), pages 663-682.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wan-Lun Wang & Tsung-I Lin, 2023. "Model-based clustering via mixtures of unrestricted skew normal factor analyzers with complete and incomplete data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(3), pages 787-817, September.
    2. Lee, Sharon X. & McLachlan, Geoffrey J., 2022. "An overview of skew distributions in model-based clustering," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    3. Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2017. "Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 141-156.
    4. Reyhaneh Rikhtehgaran & Iraj Kazemi, 2013. "Semi-parametric Bayesian estimation of mixed-effects models using the multivariate skew-normal distribution," Computational Statistics, Springer, vol. 28(5), pages 2007-2027, October.
    5. Wei, Yuhong & Tang, Yang & McNicholas, Paul D., 2019. "Mixtures of generalized hyperbolic distributions and mixtures of skew-t distributions for model-based clustering with incomplete data," Computational Statistics & Data Analysis, Elsevier, vol. 130(C), pages 18-41.
    6. Cristina Tortora & Brian C. Franczak & Ryan P. Browne & Paul D. McNicholas, 2019. "A Mixture of Coalesced Generalized Hyperbolic Distributions," Journal of Classification, Springer;The Classification Society, vol. 36(1), pages 26-57, April.
    7. Hanze Zhang & Yangxin Huang, 2020. "Quantile regression-based Bayesian joint modeling analysis of longitudinal–survival data, with application to an AIDS cohort study," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(2), pages 339-368, April.
    8. Azzalini, Adelchi, 2022. "An overview on the progeny of the skew-normal family— A personal perspective," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    9. Yangxin Huang & X. Hu & Getachew Dagne, 2014. "Jointly modeling time-to-event and longitudinal data: a Bayesian approach," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 23(1), pages 95-121, March.
    10. Chénangnon Frédéric Tovissodé & Aliou Diop & Romain Glèlè Kakaï, 2021. "Inference in skew generalized t-link models for clustered binary outcome via a parameter-expanded EM algorithm," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-31, April.
    11. Lee, Sharon X. & McLachlan, Geoffrey J., 2021. "On formulations of skew factor models: Skew factors and/or skew errors," Statistics & Probability Letters, Elsevier, vol. 168(C).
    12. Lin, Tsung I. & Ho, Hsiu J. & Chen, Chiang L., 2009. "Analysis of multivariate skew normal models with incomplete data," Journal of Multivariate Analysis, Elsevier, vol. 100(10), pages 2337-2351, November.
    13. Wan-Lun Wang & Tsung-I Lin, 2022. "Robust clustering of multiply censored data via mixtures of t factor analyzers," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 22-53, March.
    14. Paula M. Murray & Ryan P. Browne & Paul D. McNicholas, 2020. "Mixtures of Hidden Truncation Hyperbolic Factor Analyzers," Journal of Classification, Springer;The Classification Society, vol. 37(2), pages 366-379, July.
    15. Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2017. "A mixture of SDB skew-t factor analyzers," Econometrics and Statistics, Elsevier, vol. 3(C), pages 160-168.
    16. Morris, Katherine & Punzo, Antonio & McNicholas, Paul D. & Browne, Ryan P., 2019. "Asymmetric clusters and outliers: Mixtures of multivariate contaminated shifted asymmetric Laplace distributions," Computational Statistics & Data Analysis, Elsevier, vol. 132(C), pages 145-166.
    17. McLachlan, Geoffrey J. & Lee, Sharon X., 2016. "Comment on “On nomenclature, and the relative merits of two formulations of skew distributions” by A. Azzalini, R. Browne, M. Genton, and P. McNicholas," Statistics & Probability Letters, Elsevier, vol. 116(C), pages 1-5.
    18. Hossein Boojari & Majid Khaledi & Firoozeh Rivaz, 2016. "A non-homogeneous skew-Gaussian Bayesian spatial model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 25(1), pages 55-73, March.
    19. Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2019. "Note of Clarification on ‘Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering’, by Murray, Browne, and McNicholas, J. Multivariate Anal. 161 (2017," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 475-476.
    20. Tu, Wangshu & Browne, Ryan & Subedi, Sanjeena, 2024. "A mixture of logistic skew-normal multinomial models," Computational Statistics & Data Analysis, Elsevier, vol. 196(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:33:y:2024:i:3:d:10.1007_s11749-024-00920-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.