IDEAS home Printed from https://ideas.repec.org/a/spr/reihed/v60y2019i7d10.1007_s11162-019-09546-y.html
   My bibliography  Save this article

Predicting University Students’ Academic Success and Major Using Random Forests

Author

Listed:
  • Cédric Beaulac

    (University of Toronto)

  • Jeffrey S. Rosenthal

    (University of Toronto)

Abstract

In this article, a large data set containing every course taken by every undergraduate student in a major university in Canada over 10 years is analysed. Modern machine learning algorithms can use large data sets to build useful tools for the data provider, in this case, the university. In this article, two classifiers are constructed using random forests. To begin, the first two semesters of courses completed by a student are used to predict if they will obtain an undergraduate degree. Secondly, for the students that completed a program, their major is predicted using once again the first few courses they have registered to. A classification tree is an intuitive and powerful classifier and building a random forest of trees improves this classifier. Random forests also allow for reliable variable importance measurements. These measures explain what variables are useful to the classifiers and can be used to better understand what is statistically related to the students’ situation. The results are two accurate classifiers and a variable importance analysis that provides useful information to university administrations.

Suggested Citation

  • Cédric Beaulac & Jeffrey S. Rosenthal, 2019. "Predicting University Students’ Academic Success and Major Using Random Forests," Research in Higher Education, Springer;Association for Institutional Research, vol. 60(7), pages 1048-1064, November.
  • Handle: RePEc:spr:reihed:v:60:y:2019:i:7:d:10.1007_s11162-019-09546-y
    DOI: 10.1007/s11162-019-09546-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11162-019-09546-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11162-019-09546-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Eddelbuettel, Dirk & Francois, Romain, 2011. "Rcpp: Seamless R and C++ Integration," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i08).
    2. A Susan M Niessen & Rob R Meijer & Jorge N Tendeiro, 2016. "Predicting Performance in Higher Education Using Proximal Predictors," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-14, April.
    3. Ost, Ben, 2010. "The role of peers and grades in determining major persistence in the sciences," Economics of Education Review, Elsevier, vol. 29(6), pages 923-934, December.
    4. Richard Sabot & John Wakeman-Linn, 1991. "Grade Inflation and Course Choice," Journal of Economic Perspectives, American Economic Association, vol. 5(1), pages 159-170, Winter.
    5. Rong Chen & Stephen L. DesJardins, 2010. "Investigating the Impact of Financial Aid on Student Dropout Risks: Racial and Ethnic Differences," The Journal of Higher Education, Taylor & Francis Journals, vol. 81(2), pages 179-208, March.
    6. Kim H. & Loh W.Y., 2001. "Classification Trees With Unbiased Multiway Splits," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 589-604, June.
    7. Sarah Randall Johnson & Frances King Stage, 2018. "Academic Engagement and Student Success: Do High-Impact Practices Mean Higher Graduation Rates?," The Journal of Higher Education, Taylor & Francis Journals, vol. 89(5), pages 753-781, September.
    8. Talia Bar & Vrinda Kadiyali & Asaf Zussman, 2009. "Grade Information and Grade Inflation: The Cornell Experiment," Journal of Economic Perspectives, American Economic Association, vol. 23(3), pages 93-108, Summer.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhichao Wang & Valentin Zelenyuk, 2021. "Performance Analysis of Hospitals in Australia and its Peers: A Systematic Review," CEPA Working Papers Series WP012021, School of Economics, University of Queensland, Australia.
    2. Ali Bakdur & Fumito Masui & Michal Ptaszynski, 2021. "Predicting Increase in Demand for Public Buses in University Students Daily Life Needs: Case Study Based on a City in Japan," Sustainability, MDPI, vol. 13(9), pages 1-28, May.
    3. Hani Brdesee & Wafaa Alsaggaf, 2021. "Is There a Real Need for the Preparatory Years in Higher Education? An Educational Data Analysis for College and Future Career Readiness," Social Sciences, MDPI, vol. 10(10), pages 1-16, October.
    4. Badiee, Aghdas & Moshtari, Mohammad & Berenguer, Gemma, 2024. "A systematic review of operations research and management science modeling techniques in the study of higher education institutions," Socio-Economic Planning Sciences, Elsevier, vol. 93(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael C Herron & Zachary D Markovich, 2017. "Student sorting and implications for grade inflation," Rationality and Society, , vol. 29(3), pages 355-386, August.
    2. Veronica Minaya, 2020. "Do Differential Grading Standards Across Fields Matter for Major Choice? Evidence from a Policy Change in Florida," Research in Higher Education, Springer;Association for Institutional Research, vol. 61(8), pages 943-965, December.
    3. Danilowicz-Gösele, Kamila, 2016. ""A" is the aim?," University of Göttingen Working Papers in Economics 291, University of Goettingen, Department of Economics.
    4. Ralph Stinebrickner & Todd R. Stinebrickner, 2014. "A Major in Science? Initial Beliefs and Final Outcomes for College Major and Dropout," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(1), pages 426-472.
    5. Jehiel, Philippe & Leduc, Mathieu V., 2024. "Can affirmative action policies be inefficiently persistent?," European Economic Review, Elsevier, vol. 166(C).
    6. Marcus D. Casey & Jeffrey Cline & Ben Ost & Javaeria A. Qureshi, 2018. "Academic Probation, Student Performance, And Strategic Course‐Taking," Economic Inquiry, Western Economic Association International, vol. 56(3), pages 1646-1677, July.
    7. Keng, Shao-Hsun, 2020. "Gender bias and statistical discrimination against female instructors in student evaluations of teaching," Labour Economics, Elsevier, vol. 66(C).
    8. Martin Gregor, 2021. "Electives Shopping, Grading Policies and Grading Competition," Economica, London School of Economics and Political Science, vol. 88(350), pages 364-398, April.
    9. Hernández-Julián, Rey & Looney, Adam, 2016. "Measuring inflation in grades: An application of price indexing to undergraduate grades," Economics of Education Review, Elsevier, vol. 55(C), pages 220-232.
    10. Shao-Hsun Keng, 2016. "The Effect of a Stricter Academic Dismissal Policy on Course Selection, Student Effort, and Grading Leniency," Education Finance and Policy, MIT Press, vol. 11(2), pages 203-224, Spring.
    11. Griffith, Amanda L. & Sovero, Veronica, 2021. "Under pressure: How faculty gender and contract uncertainty impact students’ grades," Economics of Education Review, Elsevier, vol. 83(C).
    12. Robertas Zubrickas, 2015. "Optimal Grading," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 56(3), pages 751-776, August.
    13. Ost, Ben, 2010. "The role of peers and grades in determining major persistence in the sciences," Economics of Education Review, Elsevier, vol. 29(6), pages 923-934, December.
    14. Ehrenberg, Ronald G., 2010. "Analyzing the factors that influence persistence rates in STEM field, majors: Introduction to the symposium," Economics of Education Review, Elsevier, vol. 29(6), pages 888-891, December.
    15. Rebecca Summary & William Weber, 2012. "Grade inflation or productivity growth? An analysis of changing grade distributions at a regional university," Journal of Productivity Analysis, Springer, vol. 38(1), pages 95-107, August.
    16. Nordin, Martin & Heckley, Gawain & Gerdtham, Ulf, 2019. "The impact of grade inflation on higher education enrolment and earnings," Economics of Education Review, Elsevier, vol. 73(C).
    17. Strobl, Carolin & Boulesteix, Anne-Laure & Augustin, Thomas, 2007. "Unbiased split selection for classification trees based on the Gini Index," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 483-501, September.
    18. Pia M. Orrenius & Madeline Zavodny, 2015. "Does Immigration Affect Whether US Natives Major in Science and Engineering?," Journal of Labor Economics, University of Chicago Press, vol. 33(S1), pages 79-108.
    19. Carvajal, Daniel & Franco, Catalina & Isaksson, Siri, 2024. "Will Artificial Intelligence Get in the Way of Achieving Gender Equality?," Discussion Paper Series in Economics 3/2024, Norwegian School of Economics, Department of Economics, revised 31 Oct 2024.
    20. Fernández de Marcos Giménez de los Galanes, Alberto, 2022. "Data-driven stabilizations of goodness-of-fit tests," DES - Working Papers. Statistics and Econometrics. WS 35324, Universidad Carlos III de Madrid. Departamento de Estadística.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:reihed:v:60:y:2019:i:7:d:10.1007_s11162-019-09546-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.