IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v86y2021i1d10.1007_s11336-021-09748-3.html
   My bibliography  Save this article

A Deep Learning Algorithm for High-Dimensional Exploratory Item Factor Analysis

Author

Listed:
  • Christopher J. Urban

    (University of North Carolina at Chapel Hill)

  • Daniel J. Bauer

    (University of North Carolina at Chapel Hill)

Abstract

Marginal maximum likelihood (MML) estimation is the preferred approach to fitting item response theory models in psychometrics due to the MML estimator’s consistency, normality, and efficiency as the sample size tends to infinity. However, state-of-the-art MML estimation procedures such as the Metropolis–Hastings Robbins–Monro (MH-RM) algorithm as well as approximate MML estimation procedures such as variational inference (VI) are computationally time-consuming when the sample size and the number of latent factors are very large. In this work, we investigate a deep learning-based VI algorithm for exploratory item factor analysis (IFA) that is computationally fast even in large data sets with many latent factors. The proposed approach applies a deep artificial neural network model called an importance-weighted autoencoder (IWAE) for exploratory IFA. The IWAE approximates the MML estimator using an importance sampling technique wherein increasing the number of importance-weighted (IW) samples drawn during fitting improves the approximation, typically at the cost of decreased computational efficiency. We provide a real data application that recovers results aligning with psychological theory across random starts. Via simulation studies, we show that the IWAE yields more accurate estimates as either the sample size or the number of IW samples increases (although factor correlation and intercepts estimates exhibit some bias) and obtains similar results to MH-RM in less time. Our simulations also suggest that the proposed approach performs similarly to and is potentially faster than constrained joint maximum likelihood estimation, a fast procedure that is consistent when the sample size and the number of items simultaneously tend to infinity.

Suggested Citation

  • Christopher J. Urban & Daniel J. Bauer, 2021. "A Deep Learning Algorithm for High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 1-29, March.
  • Handle: RePEc:spr:psycho:v:86:y:2021:i:1:d:10.1007_s11336-021-09748-3
    DOI: 10.1007/s11336-021-09748-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11336-021-09748-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11336-021-09748-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. R. Bock & Murray Aitkin, 1981. "Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm," Psychometrika, Springer;The Psychometric Society, vol. 46(4), pages 443-459, December.
    2. Michael Edwards, 2010. "A Markov Chain Monte Carlo Approach to Confirmatory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 75(3), pages 474-497, September.
    3. Philippe Huber & Elvezio Ronchetti & Maria‐Pia Victoria‐Feser, 2004. "Estimation of generalized linear latent variable models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(4), pages 893-908, November.
    4. Jianan Sun & Yunxiao Chen & Jingchen Liu & Zhiliang Ying & Tao Xin, 2016. "Latent Variable Selection for Multidimensional Item Response Theory Models via $$L_{1}$$ L 1 Regularization," Psychometrika, Springer;The Psychometric Society, vol. 81(4), pages 921-939, December.
    5. Li Cai, 2010. "High-dimensional Exploratory Item Factor Analysis by A Metropolis–Hastings Robbins–Monro Algorithm," Psychometrika, Springer;The Psychometric Society, vol. 75(1), pages 33-57, March.
    6. A. Béguin & C. Glas, 2001. "MCMC estimation and some model-fit analysis of multidimensional IRT models," Psychometrika, Springer;The Psychometric Society, vol. 66(4), pages 541-561, December.
    7. Ruey S. Tsay & Mohsen Pourahmadi, 2017. "Modelling structured correlation matrices," Biometrika, Biometrika Trust, vol. 104(1), pages 237-242.
    8. Chalmers, R. Philip, 2012. "mirt: A Multidimensional Item Response Theory Package for the R Environment," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i06).
    9. Carol Woods & David Thissen, 2006. "Item Response Theory with Estimation of the Latent Population Distribution Using Spline-Based Densities," Psychometrika, Springer;The Psychometric Society, vol. 71(2), pages 281-301, June.
    10. David M. Blei & Alp Kucukelbir & Jon D. McAuliffe, 2017. "Variational Inference: A Review for Statisticians," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 859-877, April.
    11. Haoran Zhang & Yunxiao Chen & Xiaoou Li, 2020. "A Note on Exploratory Item Factor Analysis by Singular Value Decomposition," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 358-372, June.
    12. Francis K. C. Hui & Emi Tanaka & David I. Warton, 2018. "Order selection and sparsity in latent variable models via the ordered factor LASSO," Biometrics, The International Biometric Society, vol. 74(4), pages 1311-1319, December.
    13. Rabe-Hesketh, Sophia & Skrondal, Anders & Pickles, Andrew, 2005. "Maximum likelihood estimation of limited and discrete dependent variable models with nested random effects," Journal of Econometrics, Elsevier, vol. 128(2), pages 301-323, October.
    14. Zhang, Haoran & Chen, Yunxiao & Li, Xiaoou, 2020. "A note on exploratory item factor analysis by singular value decomposition," LSE Research Online Documents on Economics 104166, London School of Economics and Political Science, LSE Library.
    15. Carol M. Woods & David Thissen, 2006. "Item Response Theory with Estimation of the Latent Population Distribution Using Spline-Based Densities," Psychometrika, Springer;The Psychometric Society, vol. 71(2), pages 281-301, June.
    16. Stephen Schilling & R. Bock, 2005. "High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature," Psychometrika, Springer;The Psychometric Society, vol. 70(3), pages 533-555, September.
    17. Yunxiao Chen & Xiaoou Li & Siliang Zhang, 2019. "Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 124-146, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. John Patrick Lalor & Pedro Rodriguez, 2023. "py-irt : A Scalable Item Response Theory Library for Python," INFORMS Journal on Computing, INFORMS, vol. 35(1), pages 5-13, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Siliang Zhang & Yunxiao Chen, 2022. "Computation for Latent Variable Model Estimation: A Unified Stochastic Proximal Framework," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1473-1502, December.
    2. Zhang, Siliang & Chen, Yunxiao, 2022. "Computation for latent variable model estimation: a unified stochastic proximal framework," LSE Research Online Documents on Economics 114489, London School of Economics and Political Science, LSE Library.
    3. Yunxiao Chen & Xiaoou Li & Siliang Zhang, 2019. "Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 124-146, March.
    4. Bianconcini, Silvia & Cagnone, Silvia, 2012. "Estimation of generalized linear latent variable models via fully exponential Laplace approximation," Journal of Multivariate Analysis, Elsevier, vol. 112(C), pages 183-193.
    5. Björn Andersson & Tao Xin, 2021. "Estimation of Latent Regression Item Response Theory Models Using a Second-Order Laplace Approximation," Journal of Educational and Behavioral Statistics, , vol. 46(2), pages 244-265, April.
    6. Yoav Bergner & Peter Halpin & Jill-Jênn Vie, 2022. "Multidimensional Item Response Theory in the Style of Collaborative Filtering," Psychometrika, Springer;The Psychometric Society, vol. 87(1), pages 266-288, March.
    7. Yang Liu, 2020. "A Riemannian Optimization Algorithm for Joint Maximum Likelihood Estimation of High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 439-468, June.
    8. Li Cai, 2010. "A Two-Tier Full-Information Item Factor Analysis Model with Applications," Psychometrika, Springer;The Psychometric Society, vol. 75(4), pages 581-612, December.
    9. Battauz, Michela & Vidoni, Paolo, 2022. "A likelihood-based boosting algorithm for factor analysis models with binary data," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    10. Yang Liu & Jan Hannig, 2017. "Generalized Fiducial Inference for Logistic Graded Response Models," Psychometrika, Springer;The Psychometric Society, vol. 82(4), pages 1097-1125, December.
    11. Yang Liu & Ji Seung Yang, 2018. "Bootstrap-Calibrated Interval Estimates for Latent Variable Scores in Item Response Theory," Psychometrika, Springer;The Psychometric Society, vol. 83(2), pages 333-354, June.
    12. Zhehan Jiang & Jonathan Templin, 2019. "Gibbs Samplers for Logistic Item Response Models via the Pólya–Gamma Distribution: A Computationally Efficient Data-Augmentation Strategy," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 358-374, June.
    13. Ping Chen & Chun Wang, 2021. "Using EM Algorithm for Finite Mixtures and Reformed Supplemented EM for MIRT Calibration," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 299-326, March.
    14. Li Cai, 2010. "High-dimensional Exploratory Item Factor Analysis by A Metropolis–Hastings Robbins–Monro Algorithm," Psychometrika, Springer;The Psychometric Society, vol. 75(1), pages 33-57, March.
    15. Vitoratou, Silia & Ntzoufras, Ioannis & Moustaki, Irini, 2016. "Explaining the behavior of joint and marginal Monte Carlo estimators in latent variable models with independence assumptions," LSE Research Online Documents on Economics 57685, London School of Economics and Political Science, LSE Library.
    16. Gregory Camilli & Jean-Paul Fox, 2015. "An Aggregate IRT Procedure for Exploratory Factor Analysis," Journal of Educational and Behavioral Statistics, , vol. 40(4), pages 377-401, August.
    17. Li Cai, 2010. "Metropolis-Hastings Robbins-Monro Algorithm for Confirmatory Item Factor Analysis," Journal of Educational and Behavioral Statistics, , vol. 35(3), pages 307-335, June.
    18. Ernesto San Martín & Jean-Marie Rolin & Luis Castro, 2013. "Identification of the 1PL Model with Guessing Parameter: Parametric and Semi-parametric Results," Psychometrika, Springer;The Psychometric Society, vol. 78(2), pages 341-379, April.
    19. Scott Monroe, 2021. "Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals," Journal of Educational and Behavioral Statistics, , vol. 46(3), pages 374-398, June.
    20. Yang Liu & Jan Hannig, 2016. "Generalized Fiducial Inference for Binary Logistic Item Response Models," Psychometrika, Springer;The Psychometric Society, vol. 81(2), pages 290-324, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:86:y:2021:i:1:d:10.1007_s11336-021-09748-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.