IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v269y2018i3p1072-1085.html
   My bibliography  Save this article

Student and school performance across countries: A machine learning approach

Author

Listed:
  • Masci, Chiara
  • Johnes, Geraint
  • Agasisti, Tommaso

Abstract

In this paper, we develop and apply novel machine learning and statistical methods to analyse the determinants of students’ PISA 2015 test scores in nine countries: Australia, Canada, France, Germany, Italy, Japan, Spain, UK and USA. The aim is to find out which student characteristics are associated with test scores and which school characteristics are associated to school value-added (measured at school level). A specific aim of our approach is to explore non-linearities in the associations between covariates and test scores, as well as to model interactions between school-level factors in affecting results. In order to address these issues, we apply a two-stage methodology using flexible tree-based methods. We first run multilevel regression trees in the first stage, to estimate school value-added. In the second stage, we relate the estimated school value-added to school level variables by means of regression trees and boosting. Results show that while several student and school level characteristics are significantly associated to students’ achievements, there are marked differences across countries. The proposed approach allows an improved description of the structurally different educational production functions across countries.

Suggested Citation

  • Masci, Chiara & Johnes, Geraint & Agasisti, Tommaso, 2018. "Student and school performance across countries: A machine learning approach," European Journal of Operational Research, Elsevier, vol. 269(3), pages 1072-1085.
  • Handle: RePEc:eee:ejores:v:269:y:2018:i:3:p:1072-1085
    DOI: 10.1016/j.ejor.2018.02.031
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221718301462
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2018.02.031?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Steven G. Rivkin & Eric A. Hanushek & John F. Kain, 2005. "Teachers, Schools, and Academic Achievement," Econometrica, Econometric Society, vol. 73(2), pages 417-458, March.
    2. Hanushek, Eric A & Rivkin, Steven G & Taylor, Lori L, 1996. "Aggregation and the Estimated Effects of School Resources," The Review of Economics and Statistics, MIT Press, vol. 78(4), pages 611-627, November.
    3. Stephen W. Raudenbush, 1988. "Educational Applications of Hierarchical Linear Models: A Review," Journal of Educational and Behavioral Statistics, , vol. 13(2), pages 85-116, June.
    4. Ian Plewis, 2011. "Contextual variations in ethnic group differences in educational attainments," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 174(2), pages 419-437, April.
    5. Masci, Chiara & Ieva, Francesca & Agasisti, Tommaso & Paganoni, Anna Maria, 2016. "Does class matter more than school? Evidence from a multilevel statistical analysis on Italian junior secondary school students," Socio-Economic Planning Sciences, Elsevier, vol. 54(C), pages 47-57.
    6. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    7. Tommaso Agasisti & Francesca Ieva & Anna Maria Paganoni, 2017. "Heterogeneity, school-effects and the North/South achievement gap in Italian secondary education: evidence from a three-level mixed model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(1), pages 157-180, March.
    8. Joshua D. Angrist & Victor Lavy, 1999. "Using Maimonides' Rule to Estimate the Effect of Class Size on Scholastic Achievement," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 114(2), pages 533-575.
    9. Fitzpatrick, Trevor & Mues, Christophe, 2016. "An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market," European Journal of Operational Research, Elsevier, vol. 249(2), pages 427-439.
    10. C. Masci & F. Ieva & T. Agasisti & A. M. Paganoni, 2017. "Bivariate multilevel models for the analysis of mathematics and reading pupils' achievements," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(7), pages 1296-1317, May.
    11. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    12. Savona, Roberto, 2014. "Hedge fund systemic risk signals," European Journal of Operational Research, Elsevier, vol. 236(1), pages 282-291.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bottmer, Lea & Croux, Christophe & Wilms, Ines, 2022. "Sparse regression for large data sets with outliers," European Journal of Operational Research, Elsevier, vol. 297(2), pages 782-794.
    2. Giménez, Víctor & Thieme, Claudio & Prior, Diego & Tortosa-Ausina, Emili, 2022. "Evaluation and determinants of preschool effectiveness in Chile," Socio-Economic Planning Sciences, Elsevier, vol. 81(C).
    3. Tsionas, Mike, 2022. "Efficiency estimation using probabilistic regression trees with an application to Chilean manufacturing industries," International Journal of Production Economics, Elsevier, vol. 249(C).
    4. Van Nguyen, Truong & Zhou, Li & Chong, Alain Yee Loong & Li, Boying & Pu, Xiaodie, 2020. "Predicting customer demand for remanufactured products: A data-mining approach," European Journal of Operational Research, Elsevier, vol. 281(3), pages 543-558.
    5. Camanho, Ana S. & Varriale, Luisa & Barbosa, Flávia & Sobral, Thiago, 2021. "Performance assessment of upper secondary schools in Italian regions using a circular pseudo-Malmquist index," European Journal of Operational Research, Elsevier, vol. 289(3), pages 1188-1208.
    6. Alice Bertoletti & Marta Cannistrà & Melisa Diaz Lema & Chiara Masci & Anna Mergoni & Lidia Rossi & Mara Soncin, 2023. "The Determinants of Mathematics Achievement: A Gender Perspective Using Multilevel Random Forest," Economies, MDPI, vol. 11(2), pages 1-20, January.
    7. Joyce de Souza Zanirato Maia & Ana Paula Arantes Bueno & Joao Ricardo Sato, 2023. "Applications of Artificial Intelligence Models in Educational Analytics and Decision Making: A Systematic Review," World, MDPI, vol. 4(2), pages 1-26, May.
    8. Antonella D’Agostino & Francesco Schirripa Spagnolo & Nicola Salvati, 2022. "Studying the relationship between anxiety and school achievement: evidence from PISA data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(1), pages 1-20, March.
    9. Yong Shi & Wei Dai & Wen Long & Bo Li, 2021. "Deep Kernel Gaussian Process Based Financial Market Predictions," Papers 2105.12293, arXiv.org.
    10. Rebai, Sonia & Ben Yahia, Fatma & Essid, Hédi, 2020. "A graphically based machine learning approach to predict secondary schools performance in Tunisia," Socio-Economic Planning Sciences, Elsevier, vol. 70(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chiara Masci & Francesca Ieva & Tommaso Agasisti & Anna Maria Paganoni, 2021. "Evaluating class and school effects on the joint student achievements in different subjects: a bivariate semiparametric model with random coefficients," Computational Statistics, Springer, vol. 36(4), pages 2337-2377, December.
    2. Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2017. "Econom\'etrie et Machine Learning," Papers 1708.06992, arXiv.org, revised Mar 2018.
    3. Filmer,Deon P. & Nahata,Vatsal & Sabarwal,Shwetlena, 2021. "Preparation, Practice, and Beliefs : A Machine Learning Approach to Understanding Teacher Effectiveness," Policy Research Working Paper Series 9847, The World Bank.
    4. Bernal, Pedro & Mittag, Nikolas & Qureshi, Javaeria A., 2016. "Estimating effects of school quality using multiple proxies," Labour Economics, Elsevier, vol. 39(C), pages 1-10.
    5. Meghir, Costas & Rivkin, Steven, 2011. "Econometric Methods for Research in Education," Handbook of the Economics of Education, in: Erik Hanushek & Stephen Machin & Ludger Woessmann (ed.), Handbook of the Economics of Education, edition 1, volume 3, chapter 1, pages 1-87, Elsevier.
    6. Chen, Shunqin & Guo, Zhengfeng & Zhao, Xinlei, 2021. "Predicting mortgage early delinquency with machine learning methods," European Journal of Operational Research, Elsevier, vol. 290(1), pages 358-372.
    7. Eric A. Hanushek, "undated". "The Evidence on Class Size," Wallis Working Papers WP10, University of Rochester - Wallis Institute of Political Economy.
    8. Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2018. "Économétrie & Machine Learning," Working Papers hal-01568851, HAL.
    9. Dante Contreras & Daniel Hojman & Manuel Matas & Patricio Rodríguez & Nicolás Suárez, 2018. "The impact of commuting time over educational achievement: A machine learning approach," Working Papers wp472, University of Chile, Department of Economics.
    10. Maria Iacovou, 2002. "Class Size in the Early Years: Is Smaller Really Better?," Education Economics, Taylor & Francis Journals, vol. 10(3), pages 261-290.
    11. Michael Bates & Michael Dinerstein & Andrew C. Johnston & Isaac Sorkin, 2022. "Teacher Labor Market Equilibrium and Student Achievement," CESifo Working Paper Series 9551, CESifo.
    12. Erik Heilmann & Janosch Henze & Heike Wetzel, 2021. "Machine learning in energy forecasts with an application to high frequency electricity consumption data," MAGKS Papers on Economics 202135, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    13. Manuel J. García Rodríguez & Vicente Rodríguez Montequín & Francisco Ortega Fernández & Joaquín M. Villanueva Balsera, 2019. "Public Procurement Announcements in Spain: Regulations, Data Analysis, and Award Price Estimator Using Machine Learning," Complexity, Hindawi, vol. 2019, pages 1-20, November.
    14. Thompson, Paul N., 2021. "Is four less than five? Effects of four-day school weeks on student achievement in Oregon," Journal of Public Economics, Elsevier, vol. 193(C).
    15. Galdo, Virgilio & Li, Yue & Rama, Martin, 2021. "Identifying urban areas by combining human judgment and machine learning: An application to India," Journal of Urban Economics, Elsevier, vol. 125(C).
    16. Joshua D. Angrist & Jörn-Steffen Pischke, 2010. "The Credibility Revolution in Empirical Economics: How Better Research Design Is Taking the Con out of Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 24(2), pages 3-30, Spring.
    17. Madio, Leonardo & Principe, Francesco, 2023. "Who supports liberal policies? A tale of two referendums in Italy," Economics Letters, Elsevier, vol. 232(C).
    18. Gourley, Patrick, 2021. "Back to basics: How reading the text and taking notes improves learning," International Review of Economics Education, Elsevier, vol. 37(C).
    19. Arthur Blouin & Julian Dyer, 2021. "How Cultures Converge: An Empirical Investigation of Trade and Linguistic Exchange," Working Papers tecipa-691, University of Toronto, Department of Economics.
    20. Mona Aghdaee & Bonny Parkinson & Kompal Sinha & Yuanyuan Gu & Rajan Sharma & Emma Olin & Henry Cutler, 2022. "An examination of machine learning to map non‐preference based patient reported outcome measures to health state utility values," Health Economics, John Wiley & Sons, Ltd., vol. 31(8), pages 1525-1557, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:269:y:2018:i:3:p:1072-1085. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.