IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v89y2024i4d10.1007_s11336-024-09978-1.html
   My bibliography  Save this article

Optimizing Large-Scale Educational Assessment with a “Divide-and-Conquer” Strategy: Fast and Efficient Distributed Bayesian Inference in IRT Models

Author

Listed:
  • Sainan Xu

    (Northeast Normal University)

  • Jing Lu

    (Northeast Normal University)

  • Jiwei Zhang

    (Northeast Normal University)

  • Chun Wang

    (University of Washington)

  • Gongjun Xu

    (University of Michigan)

Abstract

With the growing attention on large-scale educational testing and assessment, the ability to process substantial volumes of response data becomes crucial. Current estimation methods within item response theory (IRT), despite their high precision, often pose considerable computational burdens with large-scale data, leading to reduced computational speed. This study introduces a novel “divide- and-conquer” parallel algorithm built on the Wasserstein posterior approximation concept, aiming to enhance computational speed while maintaining accurate parameter estimation. This algorithm enables drawing parameters from segmented data subsets in parallel, followed by an amalgamation of these parameters via Wasserstein posterior approximation. Theoretical support for the algorithm is established through asymptotic optimality under certain regularity assumptions. Practical validation is demonstrated using real-world data from the Programme for International Student Assessment. Ultimately, this research proposes a transformative approach to managing educational big data, offering a scalable, efficient, and precise alternative that promises to redefine traditional practices in educational assessments.

Suggested Citation

  • Sainan Xu & Jing Lu & Jiwei Zhang & Chun Wang & Gongjun Xu, 2024. "Optimizing Large-Scale Educational Assessment with a “Divide-and-Conquer” Strategy: Fast and Efficient Distributed Bayesian Inference in IRT Models," Psychometrika, Springer;The Psychometric Society, vol. 89(4), pages 1119-1147, December.
  • Handle: RePEc:spr:psycho:v:89:y:2024:i:4:d:10.1007_s11336-024-09978-1
    DOI: 10.1007/s11336-024-09978-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11336-024-09978-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11336-024-09978-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wim van der Linden, 2007. "A Hierarchical Framework for Modeling Speed and Accuracy on Test Items," Psychometrika, Springer;The Psychometric Society, vol. 72(3), pages 287-308, September.
    2. Steven Andrew Culpepper, 2016. "Revisiting the 4-Parameter Item Response Model: Bayesian Estimation and Application," Psychometrika, Springer;The Psychometric Society, vol. 81(4), pages 1142-1163, December.
    3. Matias Quiroz & Robert Kohn & Mattias Villani & Minh-Ngoc Tran, 2019. "Speeding Up MCMC by Efficient Data Subsampling," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 831-843, April.
    4. R. Bock & Murray Aitkin, 1981. "Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm," Psychometrika, Springer;The Psychometric Society, vol. 46(4), pages 443-459, December.
    5. Nicholas G. Polson & James G. Scott & Jesse Windle, 2013. "Bayesian Inference for Logistic Models Using Pólya--Gamma Latent Variables," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1339-1349, December.
    6. Håvard Rue & Sara Martino & Nicolas Chopin, 2009. "Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 319-392, April.
    7. A. Béguin & C. Glas, 2001. "MCMC estimation and some model-fit analysis of multidimensional IRT models," Psychometrika, Springer;The Psychometric Society, vol. 66(4), pages 541-561, December.
    8. James Joseph Balamuta & Steven Andrew Culpepper, 2022. "Exploratory Restricted Latent Class Models with Monotonicity Requirements under PÒLYA–GAMMA Data Augmentation," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 903-945, September.
    9. Zhehan Jiang & Jonathan Templin, 2019. "Gibbs Samplers for Logistic Item Response Models via the Pólya–Gamma Distribution: A Computationally Efficient Data-Augmentation Strategy," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 358-374, June.
    10. Stephen Schilling & R. Bock, 2005. "High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature," Psychometrika, Springer;The Psychometric Society, vol. 70(3), pages 533-555, September.
    11. Cheng Li & Sanvesh Srivastava & David B. Dunson, 2017. "Simple, scalable and accurate posterior interval estimation," Biometrika, Biometrika Trust, vol. 104(3), pages 665-680.
    12. Chun Wang & Gongjun Xu & Zhuoran Shang, 2018. "A Two-Stage Approach to Differentiating Normal and Aberrant Behavior in Computer Based Testing," Psychometrika, Springer;The Psychometric Society, vol. 83(1), pages 223-254, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael Edwards, 2010. "A Markov Chain Monte Carlo Approach to Confirmatory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 75(3), pages 474-497, September.
    2. Christian A. Gregory, 2020. "Are We Underestimating Food Insecurity? Partial Identification with a Bayesian 4-Parameter IRT Model," Journal of Classification, Springer;The Classification Society, vol. 37(3), pages 632-655, October.
    3. Battauz, Michela & Vidoni, Paolo, 2022. "A likelihood-based boosting algorithm for factor analysis models with binary data," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    4. Christopher J. Urban & Daniel J. Bauer, 2021. "A Deep Learning Algorithm for High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 1-29, March.
    5. Zhehan Jiang & Jonathan Templin, 2019. "Gibbs Samplers for Logistic Item Response Models via the Pólya–Gamma Distribution: A Computationally Efficient Data-Augmentation Strategy," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 358-374, June.
    6. Chanjin Zheng & Shaoyang Guo & Justin L Kern, 2021. "Fast Bayesian Estimation for the Four-Parameter Logistic Model (4PLM)," SAGE Open, , vol. 11(4), pages 21582440211, October.
    7. Bianconcini, Silvia & Cagnone, Silvia, 2012. "Estimation of generalized linear latent variable models via fully exponential Laplace approximation," Journal of Multivariate Analysis, Elsevier, vol. 112(C), pages 183-193.
    8. Li Cai, 2010. "Metropolis-Hastings Robbins-Monro Algorithm for Confirmatory Item Factor Analysis," Journal of Educational and Behavioral Statistics, , vol. 35(3), pages 307-335, June.
    9. Yunxiao Chen & Xiaoou Li & Siliang Zhang, 2019. "Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 124-146, March.
    10. Li Cai, 2010. "High-dimensional Exploratory Item Factor Analysis by A Metropolis–Hastings Robbins–Monro Algorithm," Psychometrika, Springer;The Psychometric Society, vol. 75(1), pages 33-57, March.
    11. Jiwei Zhang & Zhaoyuan Zhang & Jian Tao, 2021. "A Bayesian algorithm based on auxiliary variables for estimating GRM with non-ignorable missing data," Computational Statistics, Springer, vol. 36(4), pages 2643-2669, December.
    12. Shelby Haberman & Sandip Sinharay, 2010. "Reporting of Subscores Using Multidimensional Item Response Theory," Psychometrika, Springer;The Psychometric Society, vol. 75(2), pages 209-227, June.
    13. Steven Andrew Culpepper & James Joseph Balamuta, 2017. "A Hierarchical Model for Accuracy and Choice on Standardized Tests," Psychometrika, Springer;The Psychometric Society, vol. 82(3), pages 820-845, September.
    14. Vitoratou, Silia & Ntzoufras, Ioannis & Moustaki, Irini, 2016. "Explaining the behavior of joint and marginal Monte Carlo estimators in latent variable models with independence assumptions," LSE Research Online Documents on Economics 57685, London School of Economics and Political Science, LSE Library.
    15. Björn Andersson & Tao Xin, 2021. "Estimation of Latent Regression Item Response Theory Models Using a Second-Order Laplace Approximation," Journal of Educational and Behavioral Statistics, , vol. 46(2), pages 244-265, April.
    16. Yang Liu, 2020. "A Riemannian Optimization Algorithm for Joint Maximum Likelihood Estimation of High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 439-468, June.
    17. Gregory Camilli & Jean-Paul Fox, 2015. "An Aggregate IRT Procedure for Exploratory Factor Analysis," Journal of Educational and Behavioral Statistics, , vol. 40(4), pages 377-401, August.
    18. Li Cai, 2010. "A Two-Tier Full-Information Item Factor Analysis Model with Applications," Psychometrika, Springer;The Psychometric Society, vol. 75(4), pages 581-612, December.
    19. An, Xinming & Bentler, Peter M., 2012. "Efficient direct sampling MCEM algorithm for latent variable models with binary responses," Computational Statistics & Data Analysis, Elsevier, vol. 56(2), pages 231-244.
    20. Yi-Hsuan Lee & Zhiliang Ying, 2015. "A Mixture Cure-Rate Model for Responses and Response Times in Time-Limit Tests," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 748-775, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:89:y:2024:i:4:d:10.1007_s11336-024-09978-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.