IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v199y2024ics0167947324001002.html
   My bibliography  Save this article

Three-way data clustering based on the mean-mixture of matrix-variate normal distributions

Author

Listed:
  • Naderi, Mehrdad
  • Tamandi, Mostafa
  • Mirfarah, Elham
  • Wang, Wan-Lun
  • Lin, Tsung-I

Abstract

With the steady growth of computer technologies, the application of statistical techniques to analyze extensive datasets has garnered substantial attention. The analysis of three-way (matrix-variate) data has emerged as a burgeoning field that has inspired statisticians in recent years to develop novel analytical methods. This paper introduces a unified finite mixture model that relies on the mean-mixture of matrix-variate normal distributions. The strength of our proposed model lies in its capability to capture and cluster a wide range of three-way data that exhibit heterogeneous, asymmetric and leptokurtic features. A computationally feasible ECME algorithm is developed to compute the maximum likelihood (ML) estimates. Numerous simulation studies are conducted to investigate the asymptotic properties of the ML estimators, validate the effectiveness of the Bayesian information criterion in selecting the appropriate model, and assess the classification ability in presence of contaminated noise. The utility of the proposed methodology is demonstrated by analyzing a real-life data example.

Suggested Citation

  • Naderi, Mehrdad & Tamandi, Mostafa & Mirfarah, Elham & Wang, Wan-Lun & Lin, Tsung-I, 2024. "Three-way data clustering based on the mean-mixture of matrix-variate normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 199(C).
  • Handle: RePEc:eee:csdana:v:199:y:2024:i:c:s0167947324001002
    DOI: 10.1016/j.csda.2024.108016
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947324001002
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2024.108016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Reinaldo B. Arellano-Valle & Marc G. Genton, 2010. "Multivariate extended skew-t distributions and related families," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(3), pages 201-234.
    2. Naderi, Mehrdad & Hung, Wen-Liang & Lin, Tsung-I & Jamalizadeh, Ahad, 2019. "A novel mixture model using the multivariate normal mean–variance mixture of Birnbaum–Saunders distributions and its application to extrasolar planets," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 126-138.
    3. Salvatore D. Tomarchio & Paul D. McNicholas & Antonio Punzo, 2021. "Matrix Normal Cluster-Weighted Models," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 556-575, October.
    4. Elynn Y. Chen & Ruey S. Tsay & Rong Chen, 2020. "Constrained Factor Models for High-Dimensional Matrix-Variate Time Series," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 775-793, April.
    5. Rezaei, Amir & Yousefzadeh, Fatemeh & Arellano-Valle, Reinaldo B., 2020. "Scale and shape mixtures of matrix variate extended skew normal distributions," Journal of Multivariate Analysis, Elsevier, vol. 179(C).
    6. Branco, Márcia D. & Dey, Dipak K., 2001. "A General Class of Multivariate Skew-Elliptical Distributions," Journal of Multivariate Analysis, Elsevier, vol. 79(1), pages 99-113, October.
    7. Giovanni Millo & Gaetano Carmeci, 2011. "Non-life insurance consumption in Italy: a sub-regional panel data analysis," Journal of Geographical Systems, Springer, vol. 13(3), pages 273-298, September.
    8. Sarkar, Shuchismita & Zhu, Xuwen & Melnykov, Volodymyr & Ingrassia, Salvatore, 2020. "On parsimonious models for modeling matrix data," Computational Statistics & Data Analysis, Elsevier, vol. 142(C).
    9. Sylvia. Richardson & Peter J. Green, 1997. "On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(4), pages 731-792.
    10. Volodymyr Melnykov & Xuwen Zhu, 2019. "Studying crime trends in the USA over the years 2000–2012," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 325-341, March.
    11. Mehrdad Naderi & Andriette Bekker & Mohammad Arashi & Ahad Jamalizadeh, 2020. "A theoretical framework for Landsat data modeling based on the matrix variate mean-mixture of normal model," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-20, April.
    12. Tomarchio, Salvatore D. & Punzo, Antonio & Bagnato, Luca, 2020. "Two new matrix-variate distributions with application in model-based clustering," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    13. Ma, Xuan & Zhao, Jianhua & Wang, Yue & Shang, Changchun & Jiang, Fen, 2023. "Robust factored principal component analysis for matrix-valued outlier accommodation and detection," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    14. Arellano-Valle, Reinaldo B. & Azzalini, Adelchi & Ferreira, Clécio S. & Santoro, Karol, 2020. "A two-piece normal measurement error model," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    15. Wang, Dong & Liu, Xialu & Chen, Rong, 2019. "Factor models for matrix-valued high-dimensional time series," Journal of Econometrics, Elsevier, vol. 208(1), pages 231-248.
    16. Sharon Lee & Geoffrey McLachlan, 2013. "On mixtures of skew normal and skew $$t$$ -distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 241-266, September.
    17. Dehan Kong & Baiguo An & Jingwen Zhang & Hongtu Zhu, 2020. "L2RM: Low-Rank Linear Regression Models for High-Dimensional Matrix Responses," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 403-424, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Salvatore D. Tomarchio & Paul D. McNicholas & Antonio Punzo, 2021. "Matrix Normal Cluster-Weighted Models," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 556-575, October.
    2. Abbas Mahdavi & Narayanaswamy Balakrishnan & Ahad Jamalizadeh, 2024. "Robust Classification via Finite Mixtures of Matrix Variate Skew- t Distributions," Mathematics, MDPI, vol. 12(20), pages 1-17, October.
    3. Yuefeng Han & Rong Chen & Dan Yang & Cun-Hui Zhang, 2020. "Tensor Factor Model Estimation by Iterative Projection," Papers 2006.02611, arXiv.org, revised Jul 2024.
    4. Azzalini, Adelchi & Browne, Ryan P. & Genton, Marc G. & McNicholas, Paul D., 2016. "On nomenclature for, and the relative merits of, two formulations of skew distributions," Statistics & Probability Letters, Elsevier, vol. 110(C), pages 201-206.
    5. Lee, Sharon X. & McLachlan, Geoffrey J., 2022. "An overview of skew distributions in model-based clustering," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    6. Xialu Liu & John Guerard & Rong Chen & Ruey Tsay, 2024. "Improving Estimation of Portfolio Risk Using New Statistical Factors," Papers 2409.17182, arXiv.org.
    7. Mondal, Sagnik & Genton, Marc G., 2024. "A multivariate skew-normal-Tukey-h distribution," Journal of Multivariate Analysis, Elsevier, vol. 200(C).
    8. Zhaoxing Gao & Ruey S. Tsay, 2020. "A Two-Way Transformed Factor Model for Matrix-Variate Time Series," Papers 2011.09029, arXiv.org.
    9. Lee, Chung Eun & Zhang, Xin, 2024. "Conditional mean dimension reduction for tensor time series," Computational Statistics & Data Analysis, Elsevier, vol. 199(C).
    10. Chang, Jinyuan & Zhang, Henry & Yang, Lin & Yao, Qiwei, 2023. "Modelling matrix time series via a tensor CP-decomposition," LSE Research Online Documents on Economics 117644, London School of Economics and Political Science, LSE Library.
    11. Zinoviy Landsman & Udi Makov & Tomer Shushi, 2017. "Extended Generalized Skew-Elliptical Distributions and their Moments," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 79(1), pages 76-100, February.
    12. Sharon Lee & Geoffrey McLachlan, 2013. "On mixtures of skew normal and skew $$t$$ -distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 241-266, September.
    13. Arellano-Valle, Reinaldo B. & Azzalini, Adelchi, 2021. "A formulation for continuous mixtures of multivariate normal distributions," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    14. Federico Ferraccioli & Giovanna Menardi, 2023. "Modal clustering of matrix-variate data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(2), pages 323-345, June.
    15. Tomarchio, Salvatore D. & Punzo, Antonio & Bagnato, Luca, 2020. "Two new matrix-variate distributions with application in model-based clustering," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    16. Yin, Chuancun & Balakrishnan, Narayanaswamy, 2024. "Stochastic representations and probabilistic characteristics of multivariate skew-elliptical distributions," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    17. Alain Hecq & Ivan Ricardo & Ines Wilms, 2024. "Reduced-Rank Matrix Autoregressive Models: A Medium $N$ Approach," Papers 2407.07973, arXiv.org.
    18. McLachlan, Geoffrey J. & Lee, Sharon X., 2016. "Comment on “On nomenclature, and the relative merits of two formulations of skew distributions” by A. Azzalini, R. Browne, M. Genton, and P. McNicholas," Statistics & Probability Letters, Elsevier, vol. 116(C), pages 1-5.
    19. Xuwen Zhu & Yana Melnykov, 2022. "On Finite Mixture Modeling of Change-point Processes," Journal of Classification, Springer;The Classification Society, vol. 39(1), pages 3-22, March.
    20. Wei, Yuhong & Tang, Yang & McNicholas, Paul D., 2019. "Mixtures of generalized hyperbolic distributions and mixtures of skew-t distributions for model-based clustering with incomplete data," Computational Statistics & Data Analysis, Elsevier, vol. 130(C), pages 18-41.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:199:y:2024:i:c:s0167947324001002. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.