IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/103182.html
   My bibliography  Save this paper

Regularized latent class analysis with application in cognitive diagnosis

Author

Listed:
  • Chen, Yunxiao
  • Li, Xiaoou
  • Liu, Jingchen
  • Ying, Zhiliang

Abstract

Diagnostic classification models are confirmatory in the sense that the relationship between the latent attributes and responses to items is specified or parameterized. Such models are readily interpretable with each component of the model usually having a practical meaning. However, parameterized diagnostic classification models are sometimes too simple to capture all the data patterns, resulting in significant model lack of fit. In this paper, we attempt to obtain a compromise between interpretability and goodness of fit by regularizing a latent class model. Our approach starts with minimal assumptions on the data structure, followed by suitable regularization to reduce complexity, so that readily interpretable, yet flexible model is obtained. An expectation–maximization-type algorithm is developed for efficient computation. It is shown that the proposed approach enjoys good theoretical properties. Results from simulation studies and a real application are presented.

Suggested Citation

  • Chen, Yunxiao & Li, Xiaoou & Liu, Jingchen & Ying, Zhiliang, 2017. "Regularized latent class analysis with application in cognitive diagnosis," LSE Research Online Documents on Economics 103182, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:103182
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/103182/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jiahua Chen & Zehua Chen, 2008. "Extended Bayesian information criteria for model selection with large model spaces," Biometrika, Biometrika Trust, vol. 95(3), pages 759-771.
    2. Jimmy Torre & Jeffrey Douglas, 2004. "Higher-order latent trait models for cognitive diagnosis," Psychometrika, Springer;The Psychometric Society, vol. 69(3), pages 333-353, September.
    3. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    4. Hansheng Wang & Bo Li & Chenlei Leng, 2009. "Shrinkage tuning parameter selection with a diverging number of parameters," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(3), pages 671-683, June.
    5. Curtis Tatsuoka & Thomas Ferguson, 2003. "Sequential classification on partially ordered sets," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(1), pages 143-157, February.
    6. Curtis Tatsuoka, 2002. "Data analytic methods for latent partially ordered classification models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 51(3), pages 337-350, July.
    7. Yingying Fan & Cheng Yong Tang, 2013. "Tuning parameter selection in high dimensional penalized likelihood," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(3), pages 531-552, June.
    8. Hansheng Wang & Runze Li & Chih-Ling Tsai, 2007. "Tuning parameter selectors for the smoothly clipped absolute deviation method," Biometrika, Biometrika Trust, vol. 94(3), pages 553-568.
    9. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    10. Kikumi K. Tatsuoka, 1985. "A Probabilistic Model for Diagnosing Misconceptions By The Pattern Classification Approach," Journal of Educational and Behavioral Statistics, , vol. 10(1), pages 55-73, March.
    11. Wang, Tao & Zhu, Lixing, 2011. "Consistent tuning parameter selection in high dimensional sparse linear regression," Journal of Multivariate Analysis, Elsevier, vol. 102(7), pages 1141-1151, August.
    12. Jimmy de la Torre, 2011. "The Generalized DINA Model Framework," Psychometrika, Springer;The Psychometric Society, vol. 76(2), pages 179-199, April.
    13. Chen, Yunxiao & Liu, Jingchen & Xu, Gongjun & Ying, Zhiliang, 2015. "Statistical analysis of Q-matrix based diagnostic classification models," LSE Research Online Documents on Economics 103183, London School of Economics and Political Science, LSE Library.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhenghao Zeng & Yuqi Gu & Gongjun Xu, 2023. "A Tensor-EM Method for Large-Scale Latent Class Analysis with Binary Responses," Psychometrika, Springer;The Psychometric Society, vol. 88(2), pages 580-612, June.
    2. Chenchen Ma & Jing Ouyang & Gongjun Xu, 2023. "Learning Latent and Hierarchical Structures in Cognitive Diagnosis Models," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 175-207, March.
    3. Chun Wang & Jing Lu, 2021. "Learning Attribute Hierarchies From Data: Two Exploratory Approaches," Journal of Educational and Behavioral Statistics, , vol. 46(1), pages 58-84, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yunxiao Chen & Xiaoou Li & Jingchen Liu & Zhiliang Ying, 2017. "Regularized Latent Class Analysis with Application in Cognitive Diagnosis," Psychometrika, Springer;The Psychometric Society, vol. 82(3), pages 660-692, September.
    2. Chun Wang, 2021. "Using Penalized EM Algorithm to Infer Learning Trajectories in Latent Transition CDM," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 167-189, March.
    3. Chen, Yunxiao & Liu, Jingchen & Xu, Gongjun & Ying, Zhiliang, 2015. "Statistical analysis of Q-matrix based diagnostic classification models," LSE Research Online Documents on Economics 103183, London School of Economics and Political Science, LSE Library.
    4. Yingying Fan & Cheng Yong Tang, 2013. "Tuning parameter selection in high dimensional penalized likelihood," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(3), pages 531-552, June.
    5. Yuqi Gu & Jingchen Liu & Gongjun Xu & Zhiliang Ying, 2018. "Hypothesis Testing of the Q-matrix," Psychometrika, Springer;The Psychometric Society, vol. 83(3), pages 515-537, September.
    6. Juntao Wang & Yuan Li, 2023. "DINA Model with Entropy Penalization," Mathematics, MDPI, vol. 11(18), pages 1-16, September.
    7. Burman, Prabir & Paul, Debashis, 2017. "Smooth predictive model fitting in regression," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 165-179.
    8. Hans-Friedrich Köhn & Chia-Yi Chiu, 2017. "A Procedure for Assessing the Completeness of the Q-Matrices of Cognitively Diagnostic Tests," Psychometrika, Springer;The Psychometric Society, vol. 82(1), pages 112-132, March.
    9. Chenchen Ma & Jing Ouyang & Gongjun Xu, 2023. "Learning Latent and Hierarchical Structures in Cognitive Diagnosis Models," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 175-207, March.
    10. Sakyajit Bhattacharya & Paul McNicholas, 2014. "A LASSO-penalized BIC for mixture model selection," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(1), pages 45-61, March.
    11. Chen-Wei Liu & Björn Andersson & Anders Skrondal, 2020. "A Constrained Metropolis–Hastings Robbins–Monro Algorithm for Q Matrix Estimation in DINA Models," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 322-357, June.
    12. Hui Xiao & Yiguo Sun, 2019. "On Tuning Parameter Selection in Model Selection and Model Averaging: A Monte Carlo Study," JRFM, MDPI, vol. 12(3), pages 1-16, June.
    13. Motonori Oka & Kensuke Okada, 2023. "Scalable Bayesian Approach for the Dina Q-Matrix Estimation Combining Stochastic Optimization and Variational Inference," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 302-331, March.
    14. Hirose, Kei & Tateishi, Shohei & Konishi, Sadanori, 2013. "Tuning parameter selection in sparse regression modeling," Computational Statistics & Data Analysis, Elsevier, vol. 59(C), pages 28-40.
    15. Yuta Umezu & Yusuke Shimizu & Hiroki Masuda & Yoshiyuki Ninomiya, 2019. "AIC for the non-concave penalized likelihood method," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(2), pages 247-274, April.
    16. Kaixu Yang & Tapabrata Maiti, 2022. "Ultrahigh‐dimensional generalized additive model: Unified theory and methods," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(3), pages 917-942, September.
    17. Lian, Heng, 2014. "Semiparametric Bayesian information criterion for model selection in ultra-high dimensional additive models," Journal of Multivariate Analysis, Elsevier, vol. 123(C), pages 304-310.
    18. Wang, Tao & Zhu, Lixing, 2011. "Consistent tuning parameter selection in high dimensional sparse linear regression," Journal of Multivariate Analysis, Elsevier, vol. 102(7), pages 1141-1151, August.
    19. Lian, Heng, 2012. "A note on the consistency of Schwarz’s criterion in linear quantile regression with the SCAD penalty," Statistics & Probability Letters, Elsevier, vol. 82(7), pages 1224-1228.
    20. Fan, Rui & Lee, Ji Hyung & Shin, Youngki, 2023. "Predictive quantile regression with mixed roots and increasing dimensions: The ALQR approach," Journal of Econometrics, Elsevier, vol. 237(2).

    More about this item

    Keywords

    diagnostic classification models; latent class analysis; regularization; consistency; EM algorithm;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:103182. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.