IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/101122.html
   My bibliography  Save this paper

Structured latent factor analysis for large-scale data: identifiability, estimability, and their implications

Author

Listed:
  • Chen, Yunxiao
  • Li, Xiaoou
  • Zhang, Siliang

Abstract

Latent factor models are widely used to measure unobserved latent traits in so- cial and behavioral sciences, including psychology, education, and marketing. When used in a conrmatory manner, design information is incorporated as zero constraints on corresponding parameters, yielding structured (conrmatory) latent factor models. In this paper, we study how such design information aects the identiability and the estimation of a structured latent factor model. Insights are gained through both asymptotic and non-asymptotic analyses. Our asymptotic results are established under a regime where both the number of manifest variables and the sample size diverge, mo- tivated by applications to large-scale data. Under this regime, we dene the structural identiability of the latent factors and establish necessary and sucient conditions that ensure structural identiability. In addition, we propose an estimator which is shown to be consistent and rate optimal when structural identiability holds. Finally, a non-asymptotic error bound is derived for this estimator, through which the eect of design information is further quantied. Our results shed lights on the design of 1 large-scale measurement in education and psychology and have important implications on measurement validity and reliability.

Suggested Citation

  • Chen, Yunxiao & Li, Xiaoou & Zhang, Siliang, 2019. "Structured latent factor analysis for large-scale data: identifiability, estimability, and their implications," LSE Research Online Documents on Economics 101122, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:101122
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/101122/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. David Grayson & Herbert Marsh, 1994. "Identification with deficient rank loading matrices in confirmatory factor analysis: Multitrait-multimethod models," Psychometrika, Springer;The Psychometric Society, vol. 59(1), pages 121-134, March.
    2. Ghosh, Malay, 1995. "Inconsistent maximum likelihood estimators for the Rasch model," Statistics & Probability Letters, Elsevier, vol. 23(2), pages 165-170, May.
    3. Paul Holland, 1990. "On the sampling theory roundations of item response theory models," Psychometrika, Springer;The Psychometric Society, vol. 55(4), pages 577-601, December.
    4. Xuan Bi & Annie Qu & Junhui Wang & Xiaotong Shen, 2017. "A Group-Specific Recommender System," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1344-1353, July.
    5. Yunzhang Zhu & Xiaotong Shen & Changqing Ye, 2016. "Personalized Prediction and Sparsity Pursuit in Latent Factor Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(513), pages 241-252, March.
    6. Jay Bartroff & Matthew Finkelman & Tze Lai, 2008. "Modern Sequential Analysis and Its Applications to Computerized Adaptive Testing," Psychometrika, Springer;The Psychometric Society, vol. 73(3), pages 473-486, September.
    7. Yunxiao Chen & Xiaoou Li & Siliang Zhang, 2019. "Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 124-146, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhenghao Zeng & Yuqi Gu & Gongjun Xu, 2023. "A Tensor-EM Method for Large-Scale Latent Class Analysis with Binary Responses," Psychometrika, Springer;The Psychometric Society, vol. 88(2), pages 580-612, June.
    2. Yunxiao Chen, 2020. "A Continuous-Time Dynamic Choice Measurement Model for Problem-Solving Process Data," Psychometrika, Springer;The Psychometric Society, vol. 85(4), pages 1052-1075, December.
    3. Jinsong Chen, 2020. "A Partially Confirmatory Approach to the Multidimensional Item Response Theory with the Bayesian Lasso," Psychometrika, Springer;The Psychometric Society, vol. 85(3), pages 738-774, September.
    4. Yoav Bergner & Peter Halpin & Jill-Jênn Vie, 2022. "Multidimensional Item Response Theory in the Style of Collaborative Filtering," Psychometrika, Springer;The Psychometric Society, vol. 87(1), pages 266-288, March.
    5. Yang Liu, 2020. "A Riemannian Optimization Algorithm for Joint Maximum Likelihood Estimation of High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 439-468, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yoav Bergner & Peter Halpin & Jill-Jênn Vie, 2022. "Multidimensional Item Response Theory in the Style of Collaborative Filtering," Psychometrika, Springer;The Psychometric Society, vol. 87(1), pages 266-288, March.
    2. Chen, Yunxiao & Li, Xiaoou, 2022. "Determining the number of factors in high-dimensional generalized latent factor models," LSE Research Online Documents on Economics 111574, London School of Economics and Political Science, LSE Library.
    3. Siliang Zhang & Yunxiao Chen, 2022. "Computation for Latent Variable Model Estimation: A Unified Stochastic Proximal Framework," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1473-1502, December.
    4. Zhang, Siliang & Chen, Yunxiao, 2022. "Computation for latent variable model estimation: a unified stochastic proximal framework," LSE Research Online Documents on Economics 114489, London School of Economics and Political Science, LSE Library.
    5. Alexander Robitzsch, 2023. "Linking Error in the 2PL Model," J, MDPI, vol. 6(1), pages 1-27, January.
    6. Victor F. Araman & René A. Caldentey, 2022. "Diffusion Approximations for a Class of Sequential Experimentation Problems," Management Science, INFORMS, vol. 68(8), pages 5958-5979, August.
    7. Robitzsch, Alexander, 2020. "About Still Nonignorable Consequences of (Partially) Ignoring Missing Item Responses in Large-scale Assessment," OSF Preprints hmy45, Center for Open Science.
    8. Stanislav Kolenikov, 2009. "Confirmatory factor analysis using confa," Stata Journal, StataCorp LP, vol. 9(3), pages 329-373, September.
    9. Yinqiu He & Zi Wang & Gongjun Xu, 2021. "A Note on the Likelihood Ratio Test in High-Dimensional Exploratory Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 86(2), pages 442-463, June.
    10. Dong, Ruipeng & Li, Daoji & Zheng, Zemin, 2021. "Parallel integrative learning for large-scale multi-response regression with incomplete outcomes," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    11. Xuan Bi & Gediminas Adomavicius & William Li & Annie Qu, 2022. "Improving Sales Forecasting Accuracy: A Tensor Factorization Approach with Demand Awareness," INFORMS Journal on Computing, INFORMS, vol. 34(3), pages 1644-1660, May.
    12. Michael Maraun, 2017. "The object detection logic of latent variable technologies," Quality & Quantity: International Journal of Methodology, Springer, vol. 51(1), pages 239-259, January.
    13. Alexander Robitzsch, 2021. "A Comprehensive Simulation Study of Estimation Methods for the Rasch Model," Stats, MDPI, vol. 4(4), pages 1-23, October.
    14. Pascal Jordan & Martin Spiess, 2012. "Generalizations of Paradoxical Results in Multidimensional Item Response Theory," Psychometrika, Springer;The Psychometric Society, vol. 77(1), pages 127-152, January.
    15. Xiaojun Mao & Zhonglei Wang & Shu Yang, 2023. "Matrix completion under complex survey sampling," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(3), pages 463-492, June.
    16. Liu, Xinyi Lin & Wallin, Gabriel & Chen, Yunxiao & Moustaki, Irini, 2023. "Rotation to sparse loadings using Lp losses and related inference problems," LSE Research Online Documents on Economics 118349, London School of Economics and Political Science, LSE Library.
    17. Paula Fariña & Jorge González & Ernesto San Martín, 2019. "The Use of an Identifiability-Based Strategy for the Interpretation of Parameters in the 1PL-G and Rasch Models," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 511-528, June.
    18. Anders Skrondal & Sophia Rabe-Hesketh, 2022. "The Role of Conditional Likelihoods in Latent Variable Modeling," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 799-834, September.
    19. Stefano Noventa & Luca Stefanutti & Giulio Vidotto, 2014. "An Analysis of Item Response Theory and Rasch Models Based on the Most Probable Distribution Method," Psychometrika, Springer;The Psychometric Society, vol. 79(3), pages 377-402, July.
    20. Lu, Zexian & Chen, Yunxiao & Li, Xiaoou, 2022. "Optimal parallel sequential change detection under generalized performance measures," LSE Research Online Documents on Economics 118348, London School of Economics and Political Science, LSE Library.

    More about this item

    Keywords

    High-dimensional latent factor model; conrmatory factor analysis; identifiability of latent factors; structured low-rank matrix; large-scale psychological measurement;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:101122. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.