IDEAS home Printed from https://ideas.repec.org/a/spr/jcsosc/v7y2024i2d10.1007_s42001-024-00281-8.html
   My bibliography  Save this article

Comparing nine machine learning classifiers for school-dropouts using a revised performance measure

Author

Listed:
  • Sahar Saeed Rezk

    (Cairo University)

  • Kamal Samy Selim

    (Cairo University)

Abstract

Addressing the pervasive issue of school-dropout in Egypt is imperative for advancing the country's educational system and fostering its social and economic progress. Recently, there is a growing interest in leveraging Machine Learning techniques as proactive tools for identifying students at-risk of dropping out so as to carry out timely interventions. This study implements nine supervised Machine Learning algorithms, namely Decision Trees, K-Nearest Neighbours, Logistic Regression, Naïve Bayes, Support Vector Machines, AdaBoost, Bagging, Random Forest, and Stacking, and compares their results to figure out the best performing one for classifying at-risk students in the Egyptian compulsory schools. Utilizing a dataset of a nationally representative sample survey, 52 meticulous classification experiments combining classifiers and resampling techniques are conducted. For the classifiers admitting hyper-parameter optimization, 32 initial parameter settings entailing parameter-space searches, using GridSearch heuristic algorithm, are tried to determine the best performing configuration models for classification. Rather than relying on disparate performance measures for comparing the resulting classifications, such as accuracy and F-score, this research proposes the weighted harmonic mean of several performance measures as a unified evaluation criterion. By resorting to this single criterion for comparisons, the Support Vector Machines classifier, conjoint with Random Under-Sampling and Synthetic Minority Over-sampling Technique for treating class imbalance, is evaluated as the best performing classification model. Because of its ability to provide classification rules in explicit functional forms, Support Vector Machines enables interpreting the embedded features in a similar way like the Logistic Regression classifier. Consequently, the best results reached could guide to develop an early predicting system aiming to support the efforts to eradicate the persisting problem of school-dropouts in Egypt over time.

Suggested Citation

  • Sahar Saeed Rezk & Kamal Samy Selim, 2024. "Comparing nine machine learning classifiers for school-dropouts using a revised performance measure," Journal of Computational Social Science, Springer, vol. 7(2), pages 1555-1597, October.
  • Handle: RePEc:spr:jcsosc:v:7:y:2024:i:2:d:10.1007_s42001-024-00281-8
    DOI: 10.1007/s42001-024-00281-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s42001-024-00281-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s42001-024-00281-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Langsten, Ray & Hassan, Tahra, 2018. "Primary education completion in Egypt: Trends and determinants," International Journal of Educational Development, Elsevier, vol. 59(C), pages 136-145.
    2. Annalina Sarra & Lara Fontanella & Simone Zio, 2019. "Identifying Students at Risk of Academic Failure Within the Educational Data Mining Framework," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 146(1), pages 41-60, November.
    3. Hazal Colak Oz & Çiçek Güven & Gonzalo Nápoles, 2023. "School dropout prediction and feature importance exploration in Malawi using household panel data: machine learning approach," Journal of Computational Social Science, Springer, vol. 6(1), pages 245-287, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. El-Saadani, Somaya & Metwally, Soha, 2019. "Inequality of opportunity linked to disability in school enrollment among youth: Evidence from Egypt," International Journal of Educational Development, Elsevier, vol. 67(C), pages 73-84.
    2. Aragonés-González, María & Rosser-Limiñana, Ana & Gil-González, Diana, 2020. "Coeducation and gender equality in education systems: A scoping review," Children and Youth Services Review, Elsevier, vol. 111(C).
    3. Raghul Gandhi Venkatesan & Bagavandas Mappillairaju, 2024. "Early student dropout detection in Indian secondary education with special reference to selected districts in Tamil Nadu: a machine learning-based survival analysis approach," Journal of Computational Social Science, Springer, vol. 7(3), pages 2309-2331, December.
    4. Malte Sandner & Alexander Patzina & Silke Anger & Sarah Bernhard & Hans Dietrich, 2023. "The COVID-19 pandemic, well-being, and transitions to post-secondary education," Review of Economics of the Household, Springer, vol. 21(2), pages 461-483, June.
    5. Manhisse, Nelson & Ogawa, Keiichi, 2024. "Smallholder households and children’s schooling in primary education in Mozambique," International Journal of Educational Development, Elsevier, vol. 105(C).
    6. Usala, Cristian & Primerano, Ilaria & Santelli, Francesco & Ragozini, Giancarlo, 2024. "The more the better? How degree programs’ variety affects university students’ churn risk," Socio-Economic Planning Sciences, Elsevier, vol. 94(C).
    7. Rafaella L. S. Nascimento & Roberta A. de A. Fagundes & Renata M. C. R. Souza, 2022. "Statistical Learning for Predicting School Dropout in Elementary Education: A Comparative Study," Annals of Data Science, Springer, vol. 9(4), pages 801-828, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcsosc:v:7:y:2024:i:2:d:10.1007_s42001-024-00281-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.