IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0231525.html
   My bibliography  Save this article

Class enumeration false positive in skew-t family of continuous growth mixture models

Author

Listed:
  • Kiero Guerra-Peña
  • Zoilo Emilio García-Batista
  • Sarah Depaoli
  • Luis Eduardo Garrido

Abstract

Growth Mixture Modeling (GMM) has gained great popularity in the last decades as a methodology for longitudinal data analysis. The usual assumption of normally distributed repeated measures has been shown as problematic in real-life data applications. Namely, performing normal GMM on data that is even slightly skewed can lead to an over selection of the number of latent classes. In order to ameliorate this unwanted result, GMM based on the skew t family of continuous distributions has been proposed. This family of distributions includes the normal, skew normal, t, and skew t. This simulation study aims to determine the efficiency of selecting the “true” number of latent groups in GMM based on the skew t family of continuous distributions, using fit indices and likelihood ratio tests. Results show that the skew t GMM was the only model considered that showed fit indices and LRT false positive rates under the 0.05 cutoff value across sample sizes and for normal, and skewed and kurtic data. Simulation results are corroborated by a real educational data application example. These findings favor the development of practical guides of the benefits and risks of using the GMM based on this family of distributions.

Suggested Citation

  • Kiero Guerra-Peña & Zoilo Emilio García-Batista & Sarah Depaoli & Luis Eduardo Garrido, 2020. "Class enumeration false positive in skew-t family of continuous growth mixture models," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-19, April.
  • Handle: RePEc:plo:pone00:0231525
    DOI: 10.1371/journal.pone.0231525
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0231525
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0231525&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0231525?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. C. Vale & Vincent Maurelli, 1983. "Simulating multivariate nonnormal distributions," Psychometrika, Springer;The Psychometric Society, vol. 48(3), pages 465-471, September.
    2. Gilles Celeux & Gilda Soromenho, 1996. "An entropy criterion for assessing the number of clusters in a mixture model," Journal of Classification, Springer;The Classification Society, vol. 13(2), pages 195-212, September.
    3. Bengt Muthén & Kerby Shedden, 1999. "Finite Mixture Modeling with Mixture Outcomes Using the EM Algorithm," Biometrics, The International Biometric Society, vol. 55(2), pages 463-469, June.
    4. Allen Fleishman, 1978. "A method for simulating non-normal distributions," Psychometrika, Springer;The Psychometric Society, vol. 43(4), pages 521-532, December.
    5. Kamel Jedidi & Harsharanjeet S. Jagpal & Wayne S. DeSarbo, 1997. "Finite-Mixture Structural Equation Models for Response-Based Segmentation and Unobserved Heterogeneity," Marketing Science, INFORMS, vol. 16(1), pages 39-59.
    6. Neal O. Jeffries, 2003. "A note on 'Testing the number of components in a normal mixture'," Biometrika, Biometrika Trust, vol. 90(4), pages 991-994, December.
    7. Sharon Lee & Geoffrey McLachlan, 2013. "On mixtures of skew normal and skew $$t$$ -distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 241-266, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicolas Depraetere & Martina Vandebroek, 2014. "Order selection in finite mixtures of linear regressions," Statistical Papers, Springer, vol. 55(3), pages 871-911, August.
    2. Jost Reinecke & Daniel Seddig, 2011. "Growth mixture models in longitudinal research," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(4), pages 415-434, December.
    3. Morgan, Grant B. & Hodge, Kari J. & Baggett, Aaron R., 2016. "Latent profile analysis with nonnormal mixtures: A Monte Carlo examination of model selection using fit indices," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 146-161.
    4. Sarstedt, Marko & Salcher, André, 2007. "Modellselektion in Finite Mixture PLS-Modellen," Discussion Papers in Business Administration 1394, University of Munich, Munich School of Management.
    5. Max Auerswald & Morten Moshagen, 2015. "Generating Correlated, Non-normally Distributed Data Using a Non-linear Structural Model," Psychometrika, Springer;The Psychometric Society, vol. 80(4), pages 920-937, December.
    6. Mohan D. Pant & Todd C. Headrick, 2017. "Simulating Uniform- and Triangular- Based Double Power Method Distributions," Journal of Statistical and Econometric Methods, SCIENPRESS Ltd, vol. 6(1), pages 1-1.
    7. Joanna F. Dipnall & Belinda J. Gabbe & Warwick J. Teague & Ben Beck, 2020. "Identifying Homogeneous Patterns of Injury in Paediatric Trauma Patients to Improve Risk-Adjusted Models of Mortality and Functional Outcomes," IJERPH, MDPI, vol. 17(3), pages 1-20, January.
    8. Emanuela Raffinetti & Pier Alda Ferrari, 2021. "A dependence measure flow tree through Monte Carlo simulations," Quality & Quantity: International Journal of Methodology, Springer, vol. 55(2), pages 467-496, April.
    9. M Hashem Pesaran & Takashi Yamagata, 2012. "Testing CAPM with a Large Number of Assets," Discussion Papers 12/05, Department of Economics, University of York.
    10. Pasquale Dolce & Cristina Davino & Domenico Vistocco, 2022. "Quantile composite-based path modeling: algorithms, properties and applications," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(4), pages 909-949, December.
    11. Williams, John & Temme, Dirk & Hildebrandt, Lutz, 2002. "A Monte Carlo study of structural equation models for finite mixtures," SFB 373 Discussion Papers 2002,48, Humboldt University of Berlin, Interdisciplinary Research Project 373: Quantification and Simulation of Economic Processes.
    12. Pennoni, Fulvia & Romeo, Isabella, 2016. "Latent Markov and growth mixture models for ordinal individual responses with covariates: a comparison," MPRA Paper 72939, University Library of Munich, Germany.
    13. Ke-Hai Yuan & Peter Bentler, 2002. "On robusiness of the normal-theory based asymptotic distributions of three reliability coefficient estimates," Psychometrika, Springer;The Psychometric Society, vol. 67(2), pages 251-259, June.
    14. repec:jss:jstsof:09:i04 is not listed on IDEAS
    15. Michael P. B. Gallaugher & Paul D. McNicholas, 2019. "On Fractionally-Supervised Classification: Weight Selection and Extension to the Multivariate t-Distribution," Journal of Classification, Springer;The Classification Society, vol. 36(2), pages 232-265, July.
    16. Anindita Chakravarty & Rajdeep Grewal & V. Sambamurthy, 2013. "Information Technology Competencies, Organizational Agility, and Firm Performance: Enabling and Facilitating Roles," Information Systems Research, INFORMS, vol. 24(4), pages 976-997, December.
    17. Heike Heidemeier & Anja Göritz, 2013. "Individual Differences in How Work and Nonwork Life Domains Contribute to Life Satisfaction: Using Factor Mixture Modeling for Classification," Journal of Happiness Studies, Springer, vol. 14(6), pages 1765-1788, December.
    18. Jeff Jones & Niels Waller, 2015. "The Normal-Theory and Asymptotic Distribution-Free (ADF) Covariance Matrix of Standardized Regression Coefficients: Theoretical Extensions and Finite Sample Behavior," Psychometrika, Springer;The Psychometric Society, vol. 80(2), pages 365-378, June.
    19. Njål Foldnes & Steffen Grønneberg, 2015. "How General is the Vale–Maurelli Simulation Approach?," Psychometrika, Springer;The Psychometric Society, vol. 80(4), pages 1066-1083, December.
    20. Hakan Demirtas, 2016. "A Note on the Relationship Between the Phi Coefficient and the Tetrachoric Correlation Under Nonnormal Underlying Distributions," The American Statistician, Taylor & Francis Journals, vol. 70(2), pages 143-148, May.
    21. Seuk Yen Phoong & Shi Ling Khek & Seuk Wai Phoong, 2022. "The Bibliometric Analysis on Finite Mixture Model," SAGE Open, , vol. 12(2), pages 21582440221, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0231525. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.