Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy–interpretability trade–off

My bibliography Save this article

Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy–interpretability trade–off

Author

Listed:

Luis J. Mena
(Universidad Politecnica de Sinaloa)
Vicente García
(Universidad Autonoma de Ciudad Juarez)
Vanessa G. Félix
(Universidad Politecnica de Sinaloa
Universidad Autonoma de Occidente)
Rodolfo Ostos
(Universidad Politecnica de Sinaloa
Universidad Autonoma de Occidente)
Rafael Martínez-Peláez
(Universidad Politecnica de Sinaloa
Universidad Catolica del Norte)
Alberto Ochoa-Brust
(Universidad de Colima)
Pablo Velarde-Alvarado
(Universidad Autonoma de Nayarit)

Registered:

Abstract

Machine learning for financial risk prediction has garnered substantial interest in recent decades. However, the class imbalance problem and the dilemma of accuracy gain by loss interpretability have yet to be widely studied. Symbolic classifiers have emerged as a promising solution for forecasting banking failures and estimating creditworthiness as it addresses class imbalance while maintaining both accuracy and interpretability. This paper aims to evaluate the effectiveness of REMED, a symbolic classifier, in the context of financial risk management, and focuses on its ability to handle class imbalance and provide interpretable decision rules. Through empirical analysis of a real-world imbalanced financial dataset from the Federal Deposit Insurance Corporation, we demonstrate that REMED effectively handles class imbalance, improving performance accuracy metrics while ensuring interpretability through a concise and easily understandable rule system. A comparative analysis is conducted against two well-known rule-generating approaches, J48 and JRip. The findings suggest that, with further development and validation, REMED can be implemented as a competitive approach to improve predictive accuracy on imbalanced financial datasets without compromising model interpretability.

Suggested Citation

Luis J. Mena & Vicente García & Vanessa G. Félix & Rodolfo Ostos & Rafael Martínez-Peláez & Alberto Ochoa-Brust & Pablo Velarde-Alvarado, 2024. "Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy–interpretability trade–off," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-11, December.

Handle: RePEc:pal:palcom:v:11:y:2024:i:1:d:10.1057_s41599-024-04047-5
DOI: 10.1057/s41599-024-04047-5

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Ahmed Almustfa Hussin Adam Khatir & Marco Bee, 2022. "Machine Learning Models and Data-Balancing Techniques for Credit Scoring: What Is the Best Combination?," Risks, MDPI, vol. 10(9), pages 1-22, August.
Cukierman, Alex, 2019. "A retrospective on the subprime crisis and its aftermath ten years after Lehman’s collapse," Economic Systems, Elsevier, vol. 43(3).
Martin Leo & Suneel Sharma & K. Maddulety, 2019. "Machine Learning in Banking Risk Management: A Literature Review," Risks, MDPI, vol. 7(1), pages 1-22, March.
Michael Bücker & Gero Szepannek & Alicja Gosiewska & Przemyslaw Biecek, 2022. "Transparency, auditability, and explainability of machine learning models in credit scoring," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 73(1), pages 70-90, January.
Shen, Feng & Zhao, Xingchao & Li, Zhiyong & Li, Ke & Meng, Zhiyi, 2019. "A novel ensemble classification model based on neural networks and a classifier optimisation technique for imbalanced credit risk evaluation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 526(C).
Chen, Yujia & Calabrese, Raffaella & Martin-Barragan, Belen, 2024. "Interpretable machine learning for imbalanced credit scoring datasets," European Journal of Operational Research, Elsevier, vol. 312(1), pages 357-372.
Li Shang & Biao Zhou & Jiannan Li & Decai Tang & Valentina Boamah & Zhiwei Pan, 2024. "Evaluating financial fragility: a case study of Chinese banking and finance systems," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-9, December.
Tim Jones & G. Stacy Sirmans, 2019. "Understanding Subprime Mortgage Default," Journal of Real Estate Literature, Taylor & Francis Journals, vol. 27(1), pages 27-52, August.
Hong Wang & Qingsong Xu & Lifeng Zhou, 2015. "Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-20, February.
Martens, David & Baesens, Bart & Van Gestel, Tony & Vanthienen, Jan, 2007. "Comprehensible credit scoring models using rule extraction from support vector machines," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1466-1476, December.
Laura Cristina Lanzarini & Augusto Villa Monte & Aurelio F. Bariviera & Patricia Jimbo Santana, 2017. "Simplifying credit scoring rules using LVQ+PSO," Papers 1704.04450, arXiv.org.
Jing Quan & Xuelian Sun, 2024. "Credit risk assessment using the factorization machine model with feature interactions," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-10, December.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Nadia Ayed & Khemaies Bougatef, 2024. "Performance Assessment of Logistic Regression (LR), Artificial Neural Network (ANN), Fuzzy Inference System (FIS) and Adaptive Neuro-Fuzzy System (ANFIS) in Predicting Default Probability: The Case of," Computational Economics, Springer;Society for Computational Economics, vol. 64(3), pages 1803-1835, September.
Janssens, Bram & Schetgen, Lisa & Bogaert, Matthias & Meire, Matthijs & Van den Poel, Dirk, 2024. "360 Degrees rumor detection: When explanations got some explaining to do," European Journal of Operational Research, Elsevier, vol. 317(2), pages 366-381.
Przemys{l}aw Biecek & Marcin Chlebus & Janusz Gajda & Alicja Gosiewska & Anna Kozak & Dominik Ogonowski & Jakub Sztachelski & Piotr Wojewnik, 2021. "Enabling Machine Learning Algorithms for Credit Scoring -- Explainable Artificial Intelligence (XAI) methods for clear understanding complex predictive models," Papers 2104.06735, arXiv.org.
Bauer, Kevin & Pfeuffer, Nicolas & Abdel-Karim, Benjamin M. & Hinz, Oliver & Kosfeld, Michael, 2020. "The terminator of social welfare? The economic consequences of algorithmic discrimination," SAFE Working Paper Series 287, Leibniz Institute for Financial Research SAFE.
Andrés Alonso & José Manuel Carbó, 2022. "Accuracy of explanations of machine learning models for credit decisions," Working Papers 2222, Banco de España.
John R. J. Thompson & Longlong Feng & R. Mark Reesor & Chuck Grace, 2021. "Know Your Clients’ Behaviours: A Cluster Analysis of Financial Transactions," JRFM, MDPI, vol. 14(2), pages 1-29, January.
- John R. J. Thompson & Longlong Feng & R. Mark Reesor & Chuck Grace, 2020. "Know Your Clients' behaviours: a cluster analysis of financial transactions," Papers 2005.03625, arXiv.org, revised May 2020.
Li, Yibei & Wang, Ximei & Djehiche, Boualem & Hu, Xiaoming, 2020. "Credit scoring by incorporating dynamic networked information," European Journal of Operational Research, Elsevier, vol. 286(3), pages 1103-1112.
- Yibei Li & Ximei Wang & Boualem Djehiche & Xiaoming Hu, 2019. "Credit Scoring by Incorporating Dynamic Networked Information," Papers 1905.11795, arXiv.org, revised Oct 2019.
Loterman, Gert & Brown, Iain & Martens, David & Mues, Christophe & Baesens, Bart, 2012. "Benchmarking regression algorithms for loss given default modeling," International Journal of Forecasting, Elsevier, vol. 28(1), pages 161-170.
Yu, Lean & Wang, Shouyang & Lai, Kin Keung, 2009. "An intelligent-agent-based fuzzy group decision making model for financial multicriteria decision support: The case of credit scoring," European Journal of Operational Research, Elsevier, vol. 195(3), pages 942-959, June.
Shiqi Fang & Zexun Chen & Jake Ansell, 2024. "Peer-induced Fairness: A Causal Approach for Algorithmic Fairness Auditing," Papers 2408.02558, arXiv.org, revised Sep 2024.
Keerthana Sivamayil & Elakkiya Rajasekar & Belqasem Aljafari & Srete Nikolovski & Subramaniyaswamy Vairavasundaram & Indragandhi Vairavasundaram, 2023. "A Systematic Study on Reinforcement Learning Based Applications," Energies, MDPI, vol. 16(3), pages 1-23, February.
Bauer, Kevin & Nofer, Michael & Abdel-Karim, Benjamin M. & Hinz, Oliver, 2022. "The effects of discontinuing machine learning decision support," SAFE Working Paper Series 370, Leibniz Institute for Financial Research SAFE.
Dmytro Kovalenko & Olga Afanasieva & Nani Zabuta & Tetiana Boiko & Rosen Rosenov Baltov, 2021. "Model of Assessing the Overdue Debts in a Commercial Bank Using Neuro-Fuzzy Technologies," JRFM, MDPI, vol. 14(5), pages 1-20, May.
Abdussalam Aljadani & Bshair Alharthi & Mohammed A. Farsi & Hossam Magdy Balaha & Mahmoud Badawy & Mostafa A. Elhosseini, 2023. "Mathematical Modeling and Analysis of Credit Scoring Using the LIME Explainer: A Comprehensive Approach," Mathematics, MDPI, vol. 11(19), pages 1-28, September.
Guansan Du & Frank Elston, 2022. "RETRACTED ARTICLE: Financial risk assessment to improve the accuracy of financial prediction in the internet financial industry using data analytics models," Operations Management Research, Springer, vol. 15(3), pages 925-940, December.
Myvel Nabil, 2024. "Evaluating the Effect of Climate Risk on Financial Fragility in Arab Countries," International Journal of Economics and Finance, Canadian Center of Science and Education, vol. 16(12), pages 104-104, December.
Ni Zhan, 2021. "Where does the Stimulus go? Deep Generative Model for Commercial Banking Deposits," Papers 2101.09230, arXiv.org.
Onder Ozgur & Erdal Tanas Karagol & Fatih Cemil Ozbugday, 2021. "Machine learning approach to drivers of bank lending: evidence from an emerging economy," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-29, December.
Juan Laborda & Seyong Ryoo, 2021. "Feature Selection in a Credit Scoring Model," Mathematics, MDPI, vol. 9(7), pages 1-22, March.
Yang Liu & Fei Huang & Lili Ma & Qingguo Zeng & Jiale Shi, 2024. "Credit scoring prediction leveraging interpretable ensemble learning," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 286-308, March.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:palcom:v:11:y:2024:i:1:d:10.1057_s41599-024-04047-5. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: https://www.nature.com/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy–interpretability trade–off

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data