IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v199y2024ics0167947324001002.html
   My bibliography  Save this article

Three-way data clustering based on the mean-mixture of matrix-variate normal distributions

Author

Listed:
  • Naderi, Mehrdad
  • Tamandi, Mostafa
  • Mirfarah, Elham
  • Wang, Wan-Lun
  • Lin, Tsung-I

Abstract

With the steady growth of computer technologies, the application of statistical techniques to analyze extensive datasets has garnered substantial attention. The analysis of three-way (matrix-variate) data has emerged as a burgeoning field that has inspired statisticians in recent years to develop novel analytical methods. This paper introduces a unified finite mixture model that relies on the mean-mixture of matrix-variate normal distributions. The strength of our proposed model lies in its capability to capture and cluster a wide range of three-way data that exhibit heterogeneous, asymmetric and leptokurtic features. A computationally feasible ECME algorithm is developed to compute the maximum likelihood (ML) estimates. Numerous simulation studies are conducted to investigate the asymptotic properties of the ML estimators, validate the effectiveness of the Bayesian information criterion in selecting the appropriate model, and assess the classification ability in presence of contaminated noise. The utility of the proposed methodology is demonstrated by analyzing a real-life data example.

Suggested Citation

  • Naderi, Mehrdad & Tamandi, Mostafa & Mirfarah, Elham & Wang, Wan-Lun & Lin, Tsung-I, 2024. "Three-way data clustering based on the mean-mixture of matrix-variate normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 199(C).
  • Handle: RePEc:eee:csdana:v:199:y:2024:i:c:s0167947324001002
    DOI: 10.1016/j.csda.2024.108016
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947324001002
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2024.108016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Reinaldo B. Arellano-Valle & Marc G. Genton, 2010. "Multivariate extended skew-t distributions and related families," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(3), pages 201-234.
    2. Naderi, Mehrdad & Hung, Wen-Liang & Lin, Tsung-I & Jamalizadeh, Ahad, 2019. "A novel mixture model using the multivariate normal mean–variance mixture of Birnbaum–Saunders distributions and its application to extrasolar planets," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 126-138.
    3. Salvatore D. Tomarchio & Paul D. McNicholas & Antonio Punzo, 2021. "Matrix Normal Cluster-Weighted Models," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 556-575, October.
    4. Elynn Y. Chen & Ruey S. Tsay & Rong Chen, 2020. "Constrained Factor Models for High-Dimensional Matrix-Variate Time Series," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 775-793, April.
    5. Rezaei, Amir & Yousefzadeh, Fatemeh & Arellano-Valle, Reinaldo B., 2020. "Scale and shape mixtures of matrix variate extended skew normal distributions," Journal of Multivariate Analysis, Elsevier, vol. 179(C).
    6. Branco, Márcia D. & Dey, Dipak K., 2001. "A General Class of Multivariate Skew-Elliptical Distributions," Journal of Multivariate Analysis, Elsevier, vol. 79(1), pages 99-113, October.
    7. Giovanni Millo & Gaetano Carmeci, 2011. "Non-life insurance consumption in Italy: a sub-regional panel data analysis," Journal of Geographical Systems, Springer, vol. 13(3), pages 273-298, September.
    8. Sarkar, Shuchismita & Zhu, Xuwen & Melnykov, Volodymyr & Ingrassia, Salvatore, 2020. "On parsimonious models for modeling matrix data," Computational Statistics & Data Analysis, Elsevier, vol. 142(C).
    9. Sylvia. Richardson & Peter J. Green, 1997. "On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(4), pages 731-792.
    10. Volodymyr Melnykov & Xuwen Zhu, 2019. "Studying crime trends in the USA over the years 2000–2012," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 325-341, March.
    11. Mehrdad Naderi & Andriette Bekker & Mohammad Arashi & Ahad Jamalizadeh, 2020. "A theoretical framework for Landsat data modeling based on the matrix variate mean-mixture of normal model," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-20, April.
    12. Tomarchio, Salvatore D. & Punzo, Antonio & Bagnato, Luca, 2020. "Two new matrix-variate distributions with application in model-based clustering," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    13. Ma, Xuan & Zhao, Jianhua & Wang, Yue & Shang, Changchun & Jiang, Fen, 2023. "Robust factored principal component analysis for matrix-valued outlier accommodation and detection," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    14. Arellano-Valle, Reinaldo B. & Azzalini, Adelchi & Ferreira, Clécio S. & Santoro, Karol, 2020. "A two-piece normal measurement error model," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    15. Wang, Dong & Liu, Xialu & Chen, Rong, 2019. "Factor models for matrix-valued high-dimensional time series," Journal of Econometrics, Elsevier, vol. 208(1), pages 231-248.
    16. Sharon Lee & Geoffrey McLachlan, 2013. "On mixtures of skew normal and skew $$t$$ -distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 241-266, September.
    17. Dehan Kong & Baiguo An & Jingwen Zhang & Hongtu Zhu, 2020. "L2RM: Low-Rank Linear Regression Models for High-Dimensional Matrix Responses," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 403-424, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Salvatore D. Tomarchio & Paul D. McNicholas & Antonio Punzo, 2021. "Matrix Normal Cluster-Weighted Models," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 556-575, October.
    2. Abbas Mahdavi & Narayanaswamy Balakrishnan & Ahad Jamalizadeh, 2024. "Robust Classification via Finite Mixtures of Matrix Variate Skew- t Distributions," Mathematics, MDPI, vol. 12(20), pages 1-17, October.
    3. Yuefeng Han & Rong Chen & Dan Yang & Cun-Hui Zhang, 2020. "Tensor Factor Model Estimation by Iterative Projection," Papers 2006.02611, arXiv.org, revised Jul 2024.
    4. Sharon Lee & Geoffrey McLachlan, 2013. "On mixtures of skew normal and skew $$t$$ -distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 241-266, September.
    5. Arellano-Valle, Reinaldo B. & Azzalini, Adelchi, 2021. "A formulation for continuous mixtures of multivariate normal distributions," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    6. Tomarchio, Salvatore D. & Punzo, Antonio & Bagnato, Luca, 2020. "Two new matrix-variate distributions with application in model-based clustering," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    7. Yin, Chuancun & Balakrishnan, Narayanaswamy, 2024. "Stochastic representations and probabilistic characteristics of multivariate skew-elliptical distributions," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    8. Wei, Yuhong & Tang, Yang & McNicholas, Paul D., 2019. "Mixtures of generalized hyperbolic distributions and mixtures of skew-t distributions for model-based clustering with incomplete data," Computational Statistics & Data Analysis, Elsevier, vol. 130(C), pages 18-41.
    9. He, Yong & Kong, Xinbing & Trapani, Lorenzo & Yu, Long, 2023. "One-way or two-way factor model for matrix sequences?," Journal of Econometrics, Elsevier, vol. 235(2), pages 1981-2004.
    10. Cheng Yu & Dong Li & Feiyu Jiang & Ke Zhu, 2023. "Matrix GARCH Model: Inference and Application," Papers 2306.05169, arXiv.org.
    11. Azzalini, Adelchi, 2022. "An overview on the progeny of the skew-normal family— A personal perspective," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    12. Li, Yan & Gao, Zhigen & Huang, Wei & Guo, Jianhua, 2023. "Matrix-variate data analysis by two-way factor model with replicated observations," Statistics & Probability Letters, Elsevier, vol. 202(C).
    13. Ying Lun Cheung, 2024. "Identification of matrix-valued factor models," Economics Bulletin, AccessEcon, vol. 44(2), pages 550-556.
    14. Kim, Hyoung-Moon & Ryu, Duchwan & Mallick, Bani K. & Genton, Marc G., 2014. "Mixtures of skewed Kalman filters," Journal of Multivariate Analysis, Elsevier, vol. 123(C), pages 228-251.
    15. Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2014. "Mixtures of skew-t factor analyzers," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 326-335.
    16. Tsung-I Lin & Pal Wu & Geoffrey McLachlan & Sharon Lee, 2015. "A robust factor analysis model using the restricted skew- $$t$$ t distribution," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(3), pages 510-531, September.
    17. Hashemi, Farzane & Naderi, Mehrdad & Jamalizadeh, Ahad & Bekker, Andriette, 2021. "A flexible factor analysis based on the class of mean-mixture of normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    18. Naderi, Mehrdad & Mirfarah, Elham & Wang, Wan-Lun & Lin, Tsung-I, 2023. "Robust mixture regression modeling based on the normal mean-variance mixture distributions," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    19. Mauro Bernardi & Roy Cerqueti & Arsen Palestini, 2020. "The Skew Normal multivariate risk measurement framework," Computational Management Science, Springer, vol. 17(1), pages 105-119, January.
    20. Azzalini, Adelchi & Browne, Ryan P. & Genton, Marc G. & McNicholas, Paul D., 2016. "On nomenclature for, and the relative merits of, two formulations of skew distributions," Statistics & Probability Letters, Elsevier, vol. 110(C), pages 201-206.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:199:y:2024:i:c:s0167947324001002. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.