IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v8y2020i10p1756-d426903.html
   My bibliography  Save this article

A Comparative Performance Assessment of Ensemble Learning for Credit Scoring

Author

Listed:
  • Yiheng Li

    (College of Management and Economics, Tianjin University, Tianjin 300072, China)

  • Weidong Chen

    (College of Management and Economics, Tianjin University, Tianjin 300072, China)

Abstract

Extensive research has been performed by organizations and academics on models for credit scoring, an important financial management activity. With novel machine learning models continue to be proposed, ensemble learning has been introduced into the application of credit scoring, several researches have addressed the supremacy of ensemble learning. In this research, we provide a comparative performance evaluation of ensemble algorithms, i.e., random forest, AdaBoost, XGBoost, LightGBM and Stacking, in terms of accuracy (ACC), area under the curve (AUC), Kolmogorov–Smirnov statistic (KS), Brier score (BS), and model operating time in terms of credit scoring. Moreover, five popular baseline classifiers, i.e., neural network (NN), decision tree (DT), logistic regression (LR), Naïve Bayes (NB), and support vector machine (SVM) are considered to be benchmarks. Experimental findings reveal that the performance of ensemble learning is better than individual learners, except for AdaBoost. In addition, random forest has the best performance in terms of five metrics, XGBoost and LightGBM are close challengers. Among five baseline classifiers, logistic regression outperforms the other classifiers over the most of evaluation metrics. Finally, this study also analyzes reasons for the poor performance of some algorithms and give some suggestions on the choice of credit scoring models for financial institutions.

Suggested Citation

  • Yiheng Li & Weidong Chen, 2020. "A Comparative Performance Assessment of Ensemble Learning for Credit Scoring," Mathematics, MDPI, vol. 8(10), pages 1-19, October.
  • Handle: RePEc:gam:jmathe:v:8:y:2020:i:10:p:1756-:d:426903
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/8/10/1756/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/8/10/1756/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Desai, Vijay S. & Crook, Jonathan N. & Overstreet, George A., 1996. "A comparison of neural networks and linear scoring models in the credit union environment," European Journal of Operational Research, Elsevier, vol. 95(1), pages 24-37, November.
    2. D. J. Hand & W. E. Henley, 1997. "Statistical Classification Methods in Consumer Credit Scoring: a Review," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 160(3), pages 523-541, September.
    3. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    4. K Kennedy & B Mac Namee & S J Delany, 2013. "Using semi-supervised classifiers for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 64(4), pages 513-529, April.
    5. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 71-111.
    6. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    7. Orgler, Yair E, 1970. "A Credit Scoring Model for Commercial Loans," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 2(4), pages 435-445, November.
    8. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    9. Weizhang Liang & Suizhi Luo & Guoyan Zhao & Hao Wu, 2020. "Predicting Hard Rock Pillar Stability Using GBDT, XGBoost, and LightGBM Algorithms," Mathematics, MDPI, vol. 8(5), pages 1-17, May.
    10. Eisenbeis, Robert A, 1977. "Pitfalls in the Application of Discriminant Analysis in Business, Finance, and Economics," Journal of Finance, American Finance Association, vol. 32(3), pages 875-900, June.
    11. Mohammed Akour & Izzat Alsmadi & Iyad Alazzam, 2017. "Software fault proneness prediction: a comparative study between bagging, boosting, and stacking ensemble and base learner methods," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 9(1), pages 1-16.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yang Liu & Fei Huang & Lili Ma & Qingguo Zeng & Jiale Shi, 2024. "Credit scoring prediction leveraging interpretable ensemble learning," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 286-308, March.
    2. Babaei, Golnoosh & Giudici, Paolo & Raffinetti, Emanuela, 2023. "Explainable FinTech lending," Journal of Economics and Business, Elsevier, vol. 125.
    3. Indy Man Kit Ho & Anthony Weldon & Jason Tze Ho Yong & Candy Tze Tim Lam & Jaime Sampaio, 2023. "Using Machine Learning Algorithms to Pool Data from Meta-Analysis for the Prediction of Countermovement Jump Improvement," IJERPH, MDPI, vol. 20(10), pages 1-15, May.
    4. Mohamed Hamitouche & Jose-Luis Molina, 2022. "A Review of AI Methods for the Prediction of High-Flow Extremal Hydrology," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(10), pages 3859-3876, August.
    5. Guner Altan & Server Demirci, 2022. "Credit Scoring on Cash Flow Table with Machine Learning: XGBoost Approach," Journal of Economic Policy Researches, Istanbul University, Faculty of Economics, vol. 9(2), pages 397-424, July.
    6. Babek Erdebilli & Burcu Devrim-İçtenbaş, 2022. "Ensemble Voting Regression Based on Machine Learning for Predicting Medical Waste: A Case from Turkey," Mathematics, MDPI, vol. 10(14), pages 1-16, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    2. José Willer Prado & Valderí Castro Alcântara & Francisval Melo Carvalho & Kelly Carvalho Vieira & Luiz Kennedy Cruz Machado & Dany Flávio Tonelli, 2016. "Multivariate analysis of credit risk and bankruptcy research data: a bibliometric study involving different knowledge fields (1968–2014)," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 1007-1029, March.
    3. Mabe, Queen Magadi & Lin, Wei, 2018. "Determinants of Corporate Failure: The Case of the Johannesburg Stock Exchange," MPRA Paper 88485, University Library of Munich, Germany.
    4. Enrico Supino & Nicola Piras, 2022. "Le performance dei modelli di credit scoring in contesti di forte instabilit? macroeconomica: il ruolo delle Reti Neurali Artificiali," MANAGEMENT CONTROL, FrancoAngeli Editore, vol. 2022(2), pages 41-61.
    5. Adriana Csikosova & Maria Janoskova & Katarina Culkova, 2020. "Application of Discriminant Analysis for Avoiding the Risk of Quarry Operation Failure," JRFM, MDPI, vol. 13(10), pages 1-14, September.
    6. Youssef Zizi & Mohamed Oudgou & Abdeslam El Moudden, 2020. "Determinants and Predictors of SMEs’ Financial Failure: A Logistic Regression Approach," Risks, MDPI, vol. 8(4), pages 1-21, October.
    7. Rasa Kanapickiene & Renatas Spicas, 2019. "Credit Risk Assessment Model for Small and Micro-Enterprises: The Case of Lithuania," Risks, MDPI, vol. 7(2), pages 1-23, June.
    8. Balcaen, Sofie & Ooghe, Hubert, 2006. "35 years of studies on business failure: an overview of the classic statistical methodologies and their related problems," The British Accounting Review, Elsevier, vol. 38(1), pages 63-93.
    9. Yao-Zhi Xu & Jian-Lin Zhang & Ying Hua & Lin-Yue Wang, 2019. "Dynamic Credit Risk Evaluation Method for E-Commerce Sellers Based on a Hybrid Artificial Intelligence Model," Sustainability, MDPI, vol. 11(19), pages 1-17, October.
    10. Lkhagvadorj Munkhdalai & Tsendsuren Munkhdalai & Oyun-Erdene Namsrai & Jong Yun Lee & Keun Ho Ryu, 2019. "An Empirical Comparison of Machine-Learning Methods on Bank Client Credit Assessments," Sustainability, MDPI, vol. 11(3), pages 1-23, January.
    11. Ha-Thu Nguyen, 2015. "How is credit scoring used to predict default in China?," EconomiX Working Papers 2015-1, University of Paris Nanterre, EconomiX.
    12. Laitinen, Erkki K., 2007. "Classification accuracy and correlation: LDA in failure prediction," European Journal of Operational Research, Elsevier, vol. 183(1), pages 210-225, November.
    13. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    14. Juan Laborda & Seyong Ryoo, 2021. "Feature Selection in a Credit Scoring Model," Mathematics, MDPI, vol. 9(7), pages 1-22, March.
    15. ANDREICA Madalina Ecaterina & ANDREICA Mugurel Ionut & ANDREICA Marin, 2009. "Using financial ratios to identify Romanian distressed companies," Economia. Seria Management, Faculty of Management, Academy of Economic Studies, Bucharest, Romania, vol. 12(1 Special), pages 46-55, July.
    16. Madalina Ecaterina POPESCU & Marin ANDREICA & Ion-Petru POPESCU, 2017. "Decision Support Solution To Business Failure Prediction," Proceedings of the INTERNATIONAL MANAGEMENT CONFERENCE, Faculty of Management, Academy of Economic Studies, Bucharest, Romania, vol. 11(1), pages 99-106, November.
    17. Gestel, Tony Van & Baesens, Bart & Suykens, Johan A.K. & Van den Poel, Dirk & Baestaens, Dirk-Emma & Willekens, Marleen, 2006. "Bayesian kernel based classification for financial distress detection," European Journal of Operational Research, Elsevier, vol. 172(3), pages 979-1003, August.
    18. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    19. Carlos Serrano-Cinca, 1997. "Feedforward neural networks in the classification of financial information," The European Journal of Finance, Taylor & Francis Journals, vol. 3(3), pages 183-202.
    20. Scalzer, Rodrigo S. & Rodrigues, Adriano & Macedo, Marcelo Álvaro da S. & Wanke, Peter, 2019. "Financial distress in electricity distributors from the perspective of Brazilian regulation," Energy Policy, Elsevier, vol. 125(C), pages 250-259.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:8:y:2020:i:10:p:1756-:d:426903. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.