IDEAS home Printed from https://ideas.repec.org/a/sae/jedbes/v48y2023i2p220-243.html
   My bibliography  Save this article

Deep Reinforcement Learning for Adaptive Learning Systems

Author

Listed:
  • Xiao Li
  • Hanchen Xu
  • Jinming Zhang

    (University of Illinois at Urbana-Champaign)

  • Hua-hua Chang

    (Purdue University)

Abstract

The adaptive learning problem concerns how to create an individualized learning plan (also referred to as a learning policy) that chooses the most appropriate learning materials based on a learner’s latent traits. In this article, we study an important yet less-addressed adaptive learning problem—one that assumes continuous latent traits. Specifically, we formulate the adaptive learning problem as a Markov decision process. We assume latent traits to be continuous with an unknown transition model and apply a model-free deep reinforcement learning algorithm—the deep Q-learning algorithm—that can effectively find the optimal learning policy from data on learners’ learning process without knowing the actual transition model of the learners’ continuous latent traits. To efficiently utilize available data, we also develop a transition model estimator that emulates the learner’s learning process using neural networks. The transition model estimator can be used in the deep Q-learning algorithm so that it can more efficiently discover the optimal learning policy for a learner. Numerical simulation studies verify that the proposed algorithm is very efficient in finding a good learning policy. Especially with the aid of a transition model estimator, it can find the optimal learning policy after training using a small number of learners.

Suggested Citation

  • Xiao Li & Hanchen Xu & Jinming Zhang & Hua-hua Chang, 2023. "Deep Reinforcement Learning for Adaptive Learning Systems," Journal of Educational and Behavioral Statistics, , vol. 48(2), pages 220-243, April.
  • Handle: RePEc:sae:jedbes:v:48:y:2023:i:2:p:220-243
    DOI: 10.3102/10769986221129847
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.3102/10769986221129847
    Download Restriction: no

    File URL: https://libkey.io/10.3102/10769986221129847?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hua-Hua Chang, 2015. "Psychometrics Behind Computerized Adaptive Testing," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 1-20, March.
    2. Susan Whitely, 1980. "Multicomponent latent trait models for ability tests," Psychometrika, Springer;The Psychometric Society, vol. 45(4), pages 479-494, December.
    3. Geoff Masters, 1982. "A rasch model for partial credit scoring," Psychometrika, Springer;The Psychometric Society, vol. 47(2), pages 149-174, June.
    4. Chun Wang, 2015. "On Latent Trait Estimation in Multidimensional Compensatory Item Response Models," Psychometrika, Springer;The Psychometric Society, vol. 80(2), pages 428-449, June.
    5. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    6. Jinming Zhang & Minge Xie & Xiaolan Song & Ting Lu, 2011. "Investigating the Impact of Uncertainty About Item Parameters on Ability Estimation," Psychometrika, Springer;The Psychometric Society, vol. 76(1), pages 97-118, January.
    7. Jinming Zhang, 2013. "A Procedure for Dimensionality Analyses of Response Data from Various Test Designs," Psychometrika, Springer;The Psychometric Society, vol. 78(1), pages 37-58, January.
    8. Thomas Warm, 1989. "Weighted likelihood estimation of ability in item response theory," Psychometrika, Springer;The Psychometric Society, vol. 54(3), pages 427-450, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David Magis & Norman Verhelst, 2017. "On the Finiteness of the Weighted Likelihood Estimator of Ability," Psychometrika, Springer;The Psychometric Society, vol. 82(3), pages 637-647, September.
    2. Janna Niens & Lisa Richter-Beuschel & Tobias C. Stubbe & Susanne Bögeholz, 2021. "Procedural Knowledge of Primary School Teachers in Madagascar for Teaching and Learning towards Land-Use- and Health-Related Sustainable Development Goals," Sustainability, MDPI, vol. 13(16), pages 1-36, August.
    3. Chun Wang, 2015. "On Latent Trait Estimation in Multidimensional Compensatory Item Response Models," Psychometrika, Springer;The Psychometric Society, vol. 80(2), pages 428-449, June.
    4. Marko Böhm & Jan Barkmann & Sabina Eggert & Claus H. Carstensen & Susanne Bögeholz, 2020. "Quantitative Modelling and Perspective Taking: Two Competencies of Decision Making for Sustainable Development," Sustainability, MDPI, vol. 12(17), pages 1-32, August.
    5. Sandip Sinharay, 2015. "The Asymptotic Distribution of Ability Estimates," Journal of Educational and Behavioral Statistics, , vol. 40(5), pages 511-528, October.
    6. David Andrich, 2010. "Sufficiency and Conditional Estimation of Person Parameters in the Polytomous Rasch Model," Psychometrika, Springer;The Psychometric Society, vol. 75(2), pages 292-308, June.
    7. Anders Skrondal & Sophia Rabe-Hesketh, 2022. "The Role of Conditional Likelihoods in Latent Variable Modeling," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 799-834, September.
    8. Georg Gittler & Gerhard Fischer, 2011. "IRT-Based Measurement of Short-Term Changes of Ability, With an Application to Assessing the “Mozart Effectâ€," Journal of Educational and Behavioral Statistics, , vol. 36(1), pages 33-75, February.
    9. Robert Zwitser & Gunter Maris, 2015. "Conditional Statistical Inference with Multistage Testing Designs," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 65-84, March.
    10. Ogasawara, Haruhiko, 2013. "Asymptotic cumulants of ability estimators using fallible item parameters," Journal of Multivariate Analysis, Elsevier, vol. 119(C), pages 144-162.
    11. David Magis & Gilles Raîche & Sébastien Béland, 2012. "A Didactic Presentation of Snijders’s lz* Index of Person Fit With Emphasis on Response Model Selection and Ability Estimation," Journal of Educational and Behavioral Statistics, , vol. 37(1), pages 57-81, February.
    12. Fumiko Samejima, 1997. "Departure from normal assumptions: A promise for future psychometrics with substantive mathematical modeling," Psychometrika, Springer;The Psychometric Society, vol. 62(4), pages 471-493, December.
    13. Tabea Feseker & Timo Gnambs & Cordula Artelt, 2021. "Setting a standard for low reading proficiency: A comparison of the bookmark procedure and constrained mixture Rasch model," PLOS ONE, Public Library of Science, vol. 16(11), pages 1-22, November.
    14. Chun Wang & Gongjun Xu & Xue Zhang, 2019. "Correction for Item Response Theory Latent Trait Measurement Error in Linear Mixed Effects Models," Psychometrika, Springer;The Psychometric Society, vol. 84(3), pages 673-700, September.
    15. David Magis, 2015. "A Note on Weighted Likelihood and Jeffreys Modal Estimation of Proficiency Levels in Polytomous Item Response Models," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 200-204, March.
    16. Chengyu Cui & Chun Wang & Gongjun Xu, 2024. "Variational Estimation for Multidimensional Generalized Partial Credit Model," Psychometrika, Springer;The Psychometric Society, vol. 89(3), pages 929-957, September.
    17. Maxwell Hong & Lizhen Lin & Ying Cheng, 2021. "Asymptotically Corrected Person Fit Statistics for Multidimensional Constructs with Simple Structure and Mixed Item Types," Psychometrika, Springer;The Psychometric Society, vol. 86(2), pages 464-488, June.
    18. César Merino-Soto & Gina Chávez-Ventura & Verónica López-Fernández & Guillermo M. Chans & Filiberto Toledano-Toledano, 2022. "Learning Self-Regulation Questionnaire (SRQ-L): Psychometric and Measurement Invariance Evidence in Peruvian Undergraduate Students," Sustainability, MDPI, vol. 14(18), pages 1-17, September.
    19. Tulika Saha & Sriparna Saha & Pushpak Bhattacharyya, 2020. "Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-28, July.
    20. Nana Kim & Daniel M. Bolt & James Wollack, 2022. "Noncompensatory MIRT For Passage-Based Tests," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 992-1009, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:jedbes:v:48:y:2023:i:2:p:220-243. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.