IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v33y2024i2d10.1007_s10260-023-00743-9.html
   My bibliography  Save this article

The parsimonious Gaussian mixture models with partitioned parameters and their application in clustering

Author

Listed:
  • Niloofar Aslani Akhore Olyaei

    (Shahid Beheshti University)

  • Mojtaba Khazaei

    (Shahid Beheshti University)

  • Dariush Najarzadeh

    (University of Tabriz
    University of Tabriz)

Abstract

Cluster analysis is a method that identifies similar groups of data without any prior knowledge of the relevant groups. One of the most widely used clustering methods is model-based clustering, in which data clustering is performed by fitting a probabilistic model to the data. Mixture of Gaussian distributions is a commonly used model in model-based clustering. Unfortunately, the number of covariance matrices parameters rapidly increases by increasing the number of variables or components in these models. So far, various classes of the parsimonious Gaussian mixture models, by applying various constraints on the covariance matrices, have been introduced to solve this problem. Unfortunately, the number of models in each of these classes is so small such that in practice it does not allow the study and selection of models with any number of parameters, which can vary between the minimum number (one parameter) and the maximum number (no constraints model) of parameters. In this paper, to deal with this problem a family of the parsimonious Gaussian mixture models is introduced. This is done by identifying and determining the appropriate partitions of the variances and correlation coefficients between variables among clusters. We call these models “the parsimonious Gaussian mixture models with partitioned parameters". The generalized Expectation-Conditional Maximization algorithm, by employing the Fisher scoring method within the algorithm, is used to compute the maximum likelihood estimates of parameters. Bayesian information criterion is used for comparing and selecting the best model. Also, the steepest ascent method is adapted to search the best model. Finally, performances of these models are examined on two real datasets and a brief simulation study.

Suggested Citation

  • Niloofar Aslani Akhore Olyaei & Mojtaba Khazaei & Dariush Najarzadeh, 2024. "The parsimonious Gaussian mixture models with partitioned parameters and their application in clustering," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 33(2), pages 407-437, April.
  • Handle: RePEc:spr:stmapp:v:33:y:2024:i:2:d:10.1007_s10260-023-00743-9
    DOI: 10.1007/s10260-023-00743-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10260-023-00743-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10260-023-00743-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:33:y:2024:i:2:d:10.1007_s10260-023-00743-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.