IDEAS home Printed from https://ideas.repec.org/a/gam/jjrfmx/v16y2023i4p221-d1114264.html
   My bibliography  Save this article

Explaining Deep Learning Models for Credit Scoring with SHAP: A Case Study Using Open Banking Data

Author

Listed:
  • Lars Ole Hjelkrem

    (Department of International Business, Faculty of Economics, Norwegian University of Science and Technology (NTNU), Larsgårdsvegen 2, 6025 Ålesund, Norway)

  • Petter Eilif de Lange

    (Department of International Business, Faculty of Economics, Norwegian University of Science and Technology (NTNU), Larsgårdsvegen 2, 6025 Ålesund, Norway)

Abstract

Predicting creditworthiness is an important task in the banking industry, as it allows banks to make informed lending decisions and manage risk. In this paper, we investigate the performance of two different deep learning credit scoring models developed on the textual descriptions of customer transactions available from open banking APIs. The first model is a deep learning model trained from scratch, while the second model uses transfer learning with a multilingual BERT model. We evaluate the predictive performance of these models using the area under the receiver operating characteristic curve (AUC) and Brier score. We find that a deep learning model trained from scratch outperforms a BERT transformer model finetuned on the same data. Furthermore, we find that SHAP can be used to explain such models both on a global level and for explaining rejections of actual applications.

Suggested Citation

  • Lars Ole Hjelkrem & Petter Eilif de Lange, 2023. "Explaining Deep Learning Models for Credit Scoring with SHAP: A Case Study Using Open Banking Data," JRFM, MDPI, vol. 16(4), pages 1-19, April.
  • Handle: RePEc:gam:jjrfmx:v:16:y:2023:i:4:p:221-:d:1114264
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1911-8074/16/4/221/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1911-8074/16/4/221/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dominique Guegan & Peter Martey Addo & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01835164, HAL.
    2. B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
    3. Desai, Vijay S. & Crook, Jonathan N. & Overstreet, George A., 1996. "A comparison of neural networks and linear scoring models in the credit union environment," European Journal of Operational Research, Elsevier, vol. 95(1), pages 24-37, November.
    4. Dominique Guegan, 2018. "Credit Risk Analysis Using machine and Deep Learning Models," Post-Print halshs-01889154, HAL.
    5. Peter Martey Addo & Dominique Guegan & Bertrand Hassani, 2018. "Credit Risk Analysis using Machine and Deep learning models," Working Papers 2018:08, Department of Economics, University of Venice "Ca' Foscari".
    6. Peter Martey Addo & Dominique Guegan & Bertrand Hassani, 2018. "Credit Risk Analysis using Machine and Deep Learning models," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01719983, HAL.
    7. Dominique Guegan & Peter Martey Addo & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Post-Print halshs-01835164, HAL.
    8. Dominique Guegan, 2018. "Credit Risk Analysis Using machine and Deep Learning Models," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01889154, HAL.
    9. Peter Martey Addo & Dominique Guegan & Bertrand Hassani, 2018. "Credit Risk Analysis using Machine and Deep Learning models," Post-Print halshs-01719983, HAL.
    10. Peter Martey Addo & Dominique Guégan & Bertrand Hassani, 2018. "Credit Risk Analysis using Machine and Deep learning models," Documents de travail du Centre d'Economie de la Sorbonne 18003, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
    11. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    12. Sebastian Bach & Alexander Binder & Grégoire Montavon & Frederick Klauschen & Klaus-Robert Müller & Wojciech Samek, 2015. "On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation," PLOS ONE, Public Library of Science, vol. 10(7), pages 1-46, July.
    13. Peter Martey Addo & Dominique Guegan & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Risks, MDPI, vol. 6(2), pages 1-20, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tigges, Maximilian & Mestwerdt, Sönke & Tschirner, Sebastian & Mauer, René, 2024. "Who gets the money? A qualitative analysis of fintech lending and credit scoring through the adoption of AI and alternative data," Technological Forecasting and Social Change, Elsevier, vol. 205(C).
    2. Yang Liu & Tianxing Yang & Liwei Tian & Bincheng Huang & Jiaming Yang & Zihan Zeng, 2024. "Ada-XG-CatBoost: A Combined Forecasting Model for Gross Ecosystem Product (GEP) Prediction," Sustainability, MDPI, vol. 16(16), pages 1-19, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chen, Shunqin & Guo, Zhengfeng & Zhao, Xinlei, 2021. "Predicting mortgage early delinquency with machine learning methods," European Journal of Operational Research, Elsevier, vol. 290(1), pages 358-372.
    2. Huei-Wen Teng & Michael Lee, 2019. "Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default," Review of Pacific Basin Financial Markets and Policies (RPBFMP), World Scientific Publishing Co. Pte. Ltd., vol. 22(03), pages 1-27, September.
    3. Gunnarsson, Björn Rafn & vanden Broucke, Seppe & Baesens, Bart & Óskarsdóttir, María & Lemahieu, Wilfried, 2021. "Deep learning for credit scoring: Do or don’t?," European Journal of Operational Research, Elsevier, vol. 295(1), pages 292-305.
    4. Salima Smiti & Makram Soui, 2020. "Bankruptcy Prediction Using Deep Learning Approach Based on Borderline SMOTE," Information Systems Frontiers, Springer, vol. 22(5), pages 1067-1083, October.
    5. Hossein Hassani & Xu Huang & Emmanuel Silva & Mansi Ghodsi, 2020. "Deep Learning and Implementations in Banking," Annals of Data Science, Springer, vol. 7(3), pages 433-446, September.
    6. Paritosh Navinchandra Jha & Marco Cucculelli, 2021. "A New Model Averaging Approach in Predicting Credit Risk Default," Risks, MDPI, vol. 9(6), pages 1-15, June.
    7. Dan Wang & Zhi Chen & Ionut Florescu, 2021. "A Sparsity Algorithm with Applications to Corporate Credit Rating," Papers 2107.10306, arXiv.org.
    8. Roy Cerqueti & Francesca Pampurini & Annagiulia Pezzola & Anna Grazia Quaranta, 2022. "Dangerous liasons and hot customers for banks," Review of Quantitative Finance and Accounting, Springer, vol. 59(1), pages 65-89, July.
    9. Keerthana Sivamayil & Elakkiya Rajasekar & Belqasem Aljafari & Srete Nikolovski & Subramaniyaswamy Vairavasundaram & Indragandhi Vairavasundaram, 2023. "A Systematic Study on Reinforcement Learning Based Applications," Energies, MDPI, vol. 16(3), pages 1-23, February.
    10. Amirhosein Mosavi & Yaser Faghan & Pedram Ghamisi & Puhong Duan & Sina Faizollahzadeh Ardabili & Ely Salwana & Shahab S. Band, 2020. "Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics," Mathematics, MDPI, vol. 8(10), pages 1-42, September.
    11. Anastasios Petropoulos & Vasilis Siakoulis & Evaggelos Stavroulakis & Aristotelis Klamargias, 2019. "A robust machine learning approach for credit risk analysis of large loan level datasets using deep learning and extreme gradient boosting," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Are post-crisis statistical initiatives completed?, volume 49, Bank for International Settlements.
    12. Anastasios Petropoulos & Vasilis Siakoulis & Evaggelos Stavroulakis & Aristotelis Klamargias, 2019. "A robust machine learning approach for credit risk analysis of large loan-level datasets using deep learning and extreme gradient boosting," IFC Bulletins chapters, in: Bank for International Settlements (ed.), The use of big data analytics and artificial intelligence in central banking, volume 50, Bank for International Settlements.
    13. Nenad Milojević & Srdjan Redzepagic, 2021. "Prospects of Artificial Intelligence and Machine Learning Application in Banking Risk Management," Journal of Central Banking Theory and Practice, Central bank of Montenegro, vol. 10(3), pages 41-57.
    14. Irving Fisher Committee, 2019. "The use of big data analytics and artificial intelligence in central banking," IFC Bulletins, Bank for International Settlements, number 50.
    15. Yaseen Ghulam & Kamini Dhruva & Sana Naseem & Sophie Hill, 2018. "The Interaction of Borrower and Loan Characteristics in Predicting Risks of Subprime Automobile Loans," Risks, MDPI, vol. 6(3), pages 1-21, September.
    16. Li-Chen Cheng & Wei-Ting Lu & Benjamin Yeo, 2023. "Predicting abnormal trading behavior from internet rumor propagation: a machine learning approach," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-23, December.
    17. Roman P. Bulyga & Alexey A. Sitnov & Liudmila V. Kashirskaya & Irina V. Safonova, 2020. "Transparency of credit institutions," Entrepreneurship and Sustainability Issues, VsI Entrepreneurship and Sustainability Center, vol. 7(4), pages 3158-3172, June.
    18. Revathi Bhuvaneswari & Antonio Segalini, 2020. "Determining Secondary Attributes for Credit Evaluation in P2P Lending," Papers 2006.13921, arXiv.org.
    19. Parisa Golbayani & Ionuc{t} Florescu & Rupak Chatterjee, 2020. "A comparative study of forecasting Corporate Credit Ratings using Neural Networks, Support Vector Machines, and Decision Trees," Papers 2007.06617, arXiv.org.
    20. K. S. Naik, 2021. "Predicting Credit Risk for Unsecured Lending: A Machine Learning Approach," Papers 2110.02206, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jjrfmx:v:16:y:2023:i:4:p:221-:d:1114264. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.