IDEAS home Printed from https://ideas.repec.org/a/sae/evarev/v45y2021i6p309-333.html
   My bibliography  Save this article

Finite Mixture Modeling for Program Evaluation: Resampling and Pre-processing Approaches

Author

Listed:
  • Zachary K. Collier
  • Haobai Zhang
  • Bridgette Johnson

Abstract

Background Finite mixture models cluster individuals into latent subgroups based on observed traits. However, inaccurate enumeration of clusters can have lasting implications on policy decisions and allocations of resources. Applied and methodological researchers accept no obvious best model fit statistic, and different measures could suggest different numbers of latent clusters. Objectives The purpose of this article is to evaluate and compare different cluster enumeration techniques. Research Design Study I demonstrates how recently proposed resampling methods result in no precise number of clusters on which all fit statistics agree. We recommend the pre-processing method in Study II as an alternative. Both studies used nationally representative data on working memory, cognitive flexibility, and inhibitory control. Conclusions The data plus priors method shows promise to address inconsistencies among fit measures and help applied researchers using finite mixture models in the future.

Suggested Citation

  • Zachary K. Collier & Haobai Zhang & Bridgette Johnson, 2021. "Finite Mixture Modeling for Program Evaluation: Resampling and Pre-processing Approaches," Evaluation Review, , vol. 45(6), pages 309-333, December.
  • Handle: RePEc:sae:evarev:v:45:y:2021:i:6:p:309-333
    DOI: 10.1177/0193841X211065619
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0193841X211065619
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0193841X211065619?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kazem Nasserinejad & Joost van Rosmalen & Wim de Kort & Emmanuel Lesaffre, 2017. "Comparison of Criteria for Choosing the Number of Classes in Bayesian Finite Mixture Models," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-23, January.
    2. Yuhong Yang, 2005. "Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation," Biometrika, Biometrika Trust, vol. 92(4), pages 937-950, December.
    3. Gilles Celeux & Gilda Soromenho, 1996. "An entropy criterion for assessing the number of clusters in a mixture model," Journal of Classification, Springer;The Classification Society, vol. 13(2), pages 195-212, September.
    4. Hamparsum Bozdogan, 1987. "Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions," Psychometrika, Springer;The Psychometric Society, vol. 52(3), pages 345-370, September.
    5. Bengt Muthén & Kerby Shedden, 1999. "Finite Mixture Modeling with Mixture Outcomes Using the EM Algorithm," Biometrics, The International Biometric Society, vol. 55(2), pages 463-469, June.
    6. Kim, Ji-Hyun, 2009. "Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap," Computational Statistics & Data Analysis, Elsevier, vol. 53(11), pages 3735-3745, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicolas Depraetere & Martina Vandebroek, 2014. "Order selection in finite mixtures of linear regressions," Statistical Papers, Springer, vol. 55(3), pages 871-911, August.
    2. Joanna F. Dipnall & Belinda J. Gabbe & Warwick J. Teague & Ben Beck, 2020. "Identifying Homogeneous Patterns of Injury in Paediatric Trauma Patients to Improve Risk-Adjusted Models of Mortality and Functional Outcomes," IJERPH, MDPI, vol. 17(3), pages 1-20, January.
    3. Kim, Daeyoung & Kim, Jong-Min & Liao, Shu-Min & Jung, Yoon-Sung, 2013. "Mixture of D-vine copulas for modeling dependence," Computational Statistics & Data Analysis, Elsevier, vol. 64(C), pages 1-19.
    4. Ana Oliveira-Brochado & Francisco Vitorino Martins, 2014. "Identifying Small Market Segments with Mixture Regression Models," International Journal of Finance, Insurance and Risk Management, International Journal of Finance, Insurance and Risk Management, vol. 4(4), pages 812-812.
    5. Marianna Virtanen & Jussi Vahtera & Jenny Head & Rosemary Dray-Spira & Annaleena Okuloff & Adam G Tabak & Marcel Goldberg & Jenni Ervasti & Markus Jokela & Archana Singh-Manoux & Jaana Pentti & Marie , 2015. "Work Disability among Employees with Diabetes: Latent Class Analysis of Risk Factors in Three Prospective Cohort Studies," PLOS ONE, Public Library of Science, vol. 10(11), pages 1-14, November.
    6. Landry, Craig & Remar, Daniel & Twinkle, Roy, 2022. "Economic Value of Restaurant Safety Measures and Propensity to Dine during the COVID-19 Pandemic," 2022 Annual Meeting, July 31-August 2, Anaheim, California 322234, Agricultural and Applied Economics Association.
    7. Pennoni, Fulvia & Romeo, Isabella, 2016. "Latent Markov and growth mixture models for ordinal individual responses with covariates: a comparison," MPRA Paper 72939, University Library of Munich, Germany.
    8. Anindita Chakravarty & Rajdeep Grewal & V. Sambamurthy, 2013. "Information Technology Competencies, Organizational Agility, and Firm Performance: Enabling and Facilitating Roles," Information Systems Research, INFORMS, vol. 24(4), pages 976-997, December.
    9. Heike Heidemeier & Anja Göritz, 2013. "Individual Differences in How Work and Nonwork Life Domains Contribute to Life Satisfaction: Using Factor Mixture Modeling for Classification," Journal of Happiness Studies, Springer, vol. 14(6), pages 1765-1788, December.
    10. Wei Zhao & Limin Peng & John Hanfelt, 2022. "Semiparametric latent class analysis of recurrent event data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(4), pages 1175-1197, September.
    11. Kim, Daeyoung & Seo, Byungtae, 2014. "Assessment of the number of components in Gaussian mixture models in the presence of multiple local maximizers," Journal of Multivariate Analysis, Elsevier, vol. 125(C), pages 100-120.
    12. S. Bacci & S. Pandolfi & F. Pennoni, 2014. "A comparison of some criteria for states selection in the latent Markov model for longitudinal data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(2), pages 125-145, June.
    13. Omar N. Solinger & Woody van Olffen & Robert A. Roe & Joeri Hofmans, 2013. "On Becoming (Un)Committed: A Taxonomy and Test of Newcomer Onboarding Scenarios," Organization Science, INFORMS, vol. 24(6), pages 1640-1661, December.
    14. Jost Reinecke & Daniel Seddig, 2011. "Growth mixture models in longitudinal research," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(4), pages 415-434, December.
    15. Ana Oliveira-Brochado & Francisco Vitorino Martins, 2008. "Determining the Number of Market Segments Using an Experimental Design," FEP Working Papers 263, Universidade do Porto, Faculdade de Economia do Porto.
    16. Tian, Amy Wei & Meyer, John P. & Ilic-Balas, Tatjana & Espinoza, Jose A. & Pepper, Susan, 2023. "In search of the pseudo-transformational leader: A person-centered approach," Journal of Business Research, Elsevier, vol. 158(C).
    17. Jumin Park & Debra K. Moser & Kathleen Griffith & Jeffrey R. Harring & Meg Johantgen, 2019. "Exploring Symptom Clusters in People With Heart Failure," Clinical Nursing Research, , vol. 28(2), pages 165-181, February.
    18. Danks, Nicholas P. & Sharma, Pratyush N. & Sarstedt, Marko, 2020. "Model selection uncertainty and multimodel inference in partial least squares structural equation modeling (PLS-SEM)," Journal of Business Research, Elsevier, vol. 113(C), pages 13-24.
    19. Morgan, Grant B. & Hodge, Kari J. & Baggett, Aaron R., 2016. "Latent profile analysis with nonnormal mixtures: A Monte Carlo examination of model selection using fit indices," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 146-161.
    20. Lu, Zhenqiu (Laura) & Zhang, Zhiyong, 2014. "Robust growth mixture models with non-ignorable missingness: Models, estimation, selection, and application," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 220-240.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:evarev:v:45:y:2021:i:6:p:309-333. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.