IDEAS home Printed from https://ideas.repec.org/a/spr/advdac/v16y2022i4d10.1007_s11634-021-00475-2.html
   My bibliography  Save this article

Least-squares bilinear clustering of three-way data

Author

Listed:
  • Pieter C. Schoonees

    (Erasmus University)

  • Patrick J. F. Groenen

    (Erasmus University Rotterdam)

  • Michel Velden

    (Erasmus University Rotterdam)

Abstract

A least-squares bilinear clustering framework for modelling three-way data, where each observation consists of an ordinary two-way matrix, is introduced. The method combines bilinear decompositions of the two-way matrices with clustering over observations. Different clusterings are defined for each part of the bilinear decomposition, which decomposes the matrix-valued observations into overall means, row margins, column margins and row–column interactions. Therefore up to four different classifications are defined jointly, one for each type of effect. The computational burden is greatly reduced by the orthogonality of the bilinear model, such that the joint clustering problem reduces to separate problems which can be handled independently. Three of these sub-problems are specific cases of k-means clustering; a special algorithm is formulated for the row–column interactions, which are displayed in clusterwise biplots. The method is illustrated via an empirical example and interpreting the interaction biplots are discussed. Supplemental materials for this paper are available online, which includes the dedicated R package, lsbclust.

Suggested Citation

  • Pieter C. Schoonees & Patrick J. F. Groenen & Michel Velden, 2022. "Least-squares bilinear clustering of three-way data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(4), pages 1001-1037, December.
  • Handle: RePEc:spr:advdac:v:16:y:2022:i:4:d:10.1007_s11634-021-00475-2
    DOI: 10.1007/s11634-021-00475-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11634-021-00475-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11634-021-00475-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hardy, Andre, 1996. "On the number of clusters," Computational Statistics & Data Analysis, Elsevier, vol. 23(1), pages 83-96, November.
    2. Pieter Kroonenberg & Jan Leeuw, 1980. "Principal component analysis of three-mode data by means of alternating least squares algorithms," Psychometrika, Springer;The Psychometric Society, vol. 45(1), pages 69-97, March.
    3. J. Carroll & Jih-Jie Chang, 1970. "Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition," Psychometrika, Springer;The Psychometric Society, vol. 35(3), pages 283-319, September.
    4. Kaye Basford & Geoffrey McLachlan, 1985. "The mixture method of clustering applied to three-way data," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 109-125, December.
    5. Wayne S. DeSarbo & J. Douglas Carroll & Donald R. Lehmann & John O'Shaughnessy, 1982. "Three-Way Multivariate Conjoint Analysis," Marketing Science, INFORMS, vol. 1(4), pages 323-350.
    6. Ledyard Tucker, 1966. "Some mathematical notes on three-mode factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 31(3), pages 279-311, September.
    7. Glenn Milligan & Martha Cooper, 1985. "An examination of procedures for determining the number of clusters in a data set," Psychometrika, Springer;The Psychometric Society, vol. 50(2), pages 159-179, June.
    8. Pieter Schoonees & Michel Velden & Patrick Groenen, 2015. "Constrained Dual Scaling for Detecting Response Styles in Categorical Data," Psychometrika, Springer;The Psychometric Society, vol. 80(4), pages 968-994, December.
    9. Lawrence Hubert & Phipps Arabie, 1985. "Comparing partitions," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 193-218, December.
    10. Vermunt, Jeroen K., 2007. "A hierarchical mixture model for clustering three-way data sets," Computational Statistics & Data Analysis, Elsevier, vol. 51(11), pages 5368-5376, July.
    11. Roberto Rocci & Maurizio Vichi, 2005. "Three-Mode Component Analysis with Crisp or Fuzzy Partition of Units," Psychometrika, Springer;The Psychometric Society, vol. 70(4), pages 715-736, December.
    12. J. Gower, 1975. "Generalized procrustes analysis," Psychometrika, Springer;The Psychometric Society, vol. 40(1), pages 33-51, March.
    13. Glenn Milligan, 1980. "An examination of the effect of six types of error perturbation on fifteen clustering algorithms," Psychometrika, Springer;The Psychometric Society, vol. 45(3), pages 325-342, September.
    14. Carl Eckart & Gale Young, 1936. "The approximation of one matrix by another of lower rank," Psychometrika, Springer;The Psychometric Society, vol. 1(3), pages 211-218, September.
    15. Tammo Bijmolt & Michel Velden, 2012. "Multiattribute perceptual mapping with idiosyncratic brand and attribute sets," Marketing Letters, Springer, vol. 23(3), pages 585-601, September.
    16. Maurizio Vichi & Roberto Rocci & Henk A.L. Kiers, 2007. "Simultaneous Component and Clustering Models for Three-way Data: Within and Between Approaches," Journal of Classification, Springer;The Classification Society, vol. 24(1), pages 71-98, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Arteaga Flórez, Andrea Lorena & De la Rosa Salazar, Diego Marcel, 2023. "Factores competitivos en el sector empresarial marroquinero. Caso: pymes marroquineras departamento de Nariño," Revista Tendencias, Universidad de Narino, vol. 24(2), pages 86-111, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Donatella Vicari & Paolo Giordani, 2023. "CPclus: Candecomp/Parafac Clustering Model for Three-Way Data," Journal of Classification, Springer;The Classification Society, vol. 40(2), pages 432-465, July.
    2. Naoto Yamashita & Shin-ichi Mayekawa, 2015. "A new biplot procedure with joint classification of objects and variables by fuzzy c-means clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 9(3), pages 243-266, September.
    3. Paolo Giordani & Roberto Rocci & Giuseppe Bove, 2020. "Factor Uniqueness of the Structural Parafac Model," Psychometrika, Springer;The Psychometric Society, vol. 85(3), pages 555-574, September.
    4. Alwin Stegeman & Tam Lam, 2014. "Three-Mode Factor Analysis by Means of Candecomp/Parafac," Psychometrika, Springer;The Psychometric Society, vol. 79(3), pages 426-443, July.
    5. Alwin Stegeman, 2018. "Simultaneous Component Analysis by Means of Tucker3," Psychometrika, Springer;The Psychometric Society, vol. 83(1), pages 21-47, March.
    6. Schoonees, P.C. & Groenen, P.J.F. & van de Velden, M., 2015. "Least-squares Bilinear Clustering of Three-way Data," Econometric Institute Research Papers EI2014-23, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    7. Mariela González-Narváez & María José Fernández-Gómez & Susana Mendes & José-Luis Molina & Omar Ruiz-Barzola & Purificación Galindo-Villardón, 2021. "Study of Temporal Variations in Species–Environment Association through an Innovative Multivariate Method: MixSTATICO," Sustainability, MDPI, vol. 13(11), pages 1-25, May.
    8. Henk Kiers, 1991. "Hierarchical relations among three-way methods," Psychometrika, Springer;The Psychometric Society, vol. 56(3), pages 449-470, September.
    9. Li, Pai-Ling & Chiou, Jeng-Min, 2011. "Identifying cluster number for subspace projected functional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2090-2103, June.
    10. Willem Kloot & Pieter Kroonenberg, 1985. "External analysis with three-mode principal component models," Psychometrika, Springer;The Psychometric Society, vol. 50(4), pages 479-494, December.
    11. Elisa Frutos-Bernal & Ángel Martín del Rey & Irene Mariñas-Collado & María Teresa Santos-Martín, 2022. "An Analysis of Travel Patterns in Barcelona Metro Using Tucker3 Decomposition," Mathematics, MDPI, vol. 10(7), pages 1-17, March.
    12. Yoshio Takane & Forrest Young & Jan Leeuw, 1977. "Nonmetric individual differences multidimensional scaling: An alternating least squares method with optimal scaling features," Psychometrika, Springer;The Psychometric Society, vol. 42(1), pages 7-67, March.
    13. Giuseppe Brandi & Ruggero Gramatica & Tiziana Di Matteo, 2019. "Unveil stock correlation via a new tensor-based decomposition method," Papers 1911.06126, arXiv.org, revised Apr 2020.
    14. Wilderjans, Tom & Ceulemans, Eva & Van Mechelen, Iven, 2009. "Simultaneous analysis of coupled data blocks differing in size: A comparison of two weighting schemes," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1086-1098, February.
    15. Modroño Herrán, Juan Ignacio & Fernández Aguirre, María Carmen & Landaluce Calvo, M. Isabel, 2003. "Una propuesta para el análisis de tablas múltiples," BILTOKI 1134-8984, Universidad del País Vasco - Departamento de Economía Aplicada III (Econometría y Estadística).
    16. Michael Brusco & Douglas Steinley, 2015. "Affinity Propagation and Uncapacitated Facility Location Problems," Journal of Classification, Springer;The Classification Society, vol. 32(3), pages 443-480, October.
    17. Federico Ferraccioli & Giovanna Menardi, 2023. "Modal clustering of matrix-variate data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(2), pages 323-345, June.
    18. Richard Sands & Forrest Young, 1980. "Component models for three-way data: An alternating least squares algorithm with optimal scaling features," Psychometrika, Springer;The Psychometric Society, vol. 45(1), pages 39-67, March.
    19. Michel Velden & Tammo Bijmolt, 2006. "Generalized canonical correlation analysis of matrices with missing rows: a simulation study," Psychometrika, Springer;The Psychometric Society, vol. 71(2), pages 323-331, June.
    20. Zhiyuan Zhang & Chen Ling & Hongjin He & Liqun Qi, 2024. "A tensor train approach for internet traffic data completion," Annals of Operations Research, Springer, vol. 339(3), pages 1461-1479, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:advdac:v:16:y:2022:i:4:d:10.1007_s11634-021-00475-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.