IDEAS home Printed from https://ideas.repec.org/a/pal/palcom/v11y2024i1d10.1057_s41599-024-04047-5.html
   My bibliography  Save this article

Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy–interpretability trade–off

Author

Listed:
  • Luis J. Mena

    (Universidad Politecnica de Sinaloa)

  • Vicente García

    (Universidad Autonoma de Ciudad Juarez)

  • Vanessa G. Félix

    (Universidad Politecnica de Sinaloa
    Universidad Autonoma de Occidente)

  • Rodolfo Ostos

    (Universidad Politecnica de Sinaloa
    Universidad Autonoma de Occidente)

  • Rafael Martínez-Peláez

    (Universidad Politecnica de Sinaloa
    Universidad Catolica del Norte)

  • Alberto Ochoa-Brust

    (Universidad de Colima)

  • Pablo Velarde-Alvarado

    (Universidad Autonoma de Nayarit)

Abstract

Machine learning for financial risk prediction has garnered substantial interest in recent decades. However, the class imbalance problem and the dilemma of accuracy gain by loss interpretability have yet to be widely studied. Symbolic classifiers have emerged as a promising solution for forecasting banking failures and estimating creditworthiness as it addresses class imbalance while maintaining both accuracy and interpretability. This paper aims to evaluate the effectiveness of REMED, a symbolic classifier, in the context of financial risk management, and focuses on its ability to handle class imbalance and provide interpretable decision rules. Through empirical analysis of a real-world imbalanced financial dataset from the Federal Deposit Insurance Corporation, we demonstrate that REMED effectively handles class imbalance, improving performance accuracy metrics while ensuring interpretability through a concise and easily understandable rule system. A comparative analysis is conducted against two well-known rule-generating approaches, J48 and JRip. The findings suggest that, with further development and validation, REMED can be implemented as a competitive approach to improve predictive accuracy on imbalanced financial datasets without compromising model interpretability.

Suggested Citation

  • Luis J. Mena & Vicente García & Vanessa G. Félix & Rodolfo Ostos & Rafael Martínez-Peláez & Alberto Ochoa-Brust & Pablo Velarde-Alvarado, 2024. "Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy–interpretability trade–off," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-11, December.
  • Handle: RePEc:pal:palcom:v:11:y:2024:i:1:d:10.1057_s41599-024-04047-5
    DOI: 10.1057/s41599-024-04047-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/s41599-024-04047-5
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1057/s41599-024-04047-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tim Jones & G. Stacy Sirmans, 2019. "Understanding Subprime Mortgage Default," Journal of Real Estate Literature, Taylor & Francis Journals, vol. 27(1), pages 27-52, August.
    2. Hong Wang & Qingsong Xu & Lifeng Zhou, 2015. "Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-20, February.
    3. Cukierman, Alex, 2019. "A retrospective on the subprime crisis and its aftermath ten years after Lehman’s collapse," Economic Systems, Elsevier, vol. 43(3).
    4. Martens, David & Baesens, Bart & Van Gestel, Tony & Vanthienen, Jan, 2007. "Comprehensible credit scoring models using rule extraction from support vector machines," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1466-1476, December.
    5. Martin Leo & Suneel Sharma & K. Maddulety, 2019. "Machine Learning in Banking Risk Management: A Literature Review," Risks, MDPI, vol. 7(1), pages 1-22, March.
    6. Laura Cristina Lanzarini & Augusto Villa Monte & Aurelio F. Bariviera & Patricia Jimbo Santana, 2017. "Simplifying credit scoring rules using LVQ+PSO," Papers 1704.04450, arXiv.org.
    7. Michael Bücker & Gero Szepannek & Alicja Gosiewska & Przemyslaw Biecek, 2022. "Transparency, auditability, and explainability of machine learning models in credit scoring," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 73(1), pages 70-90, January.
    8. Ahmed Almustfa Hussin Adam Khatir & Marco Bee, 2022. "Machine Learning Models and Data-Balancing Techniques for Credit Scoring: What Is the Best Combination?," Risks, MDPI, vol. 10(9), pages 1-22, August.
    9. Li Shang & Biao Zhou & Jiannan Li & Decai Tang & Valentina Boamah & Zhiwei Pan, 2024. "Evaluating financial fragility: a case study of Chinese banking and finance systems," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-9, December.
    10. Shen, Feng & Zhao, Xingchao & Li, Zhiyong & Li, Ke & Meng, Zhiyi, 2019. "A novel ensemble classification model based on neural networks and a classifier optimisation technique for imbalanced credit risk evaluation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 526(C).
    11. Chen, Yujia & Calabrese, Raffaella & Martin-Barragan, Belen, 2024. "Interpretable machine learning for imbalanced credit scoring datasets," European Journal of Operational Research, Elsevier, vol. 312(1), pages 357-372.
    12. Jing Quan & Xuelian Sun, 2024. "Credit risk assessment using the factorization machine model with feature interactions," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-10, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nadia Ayed & Khemaies Bougatef, 2024. "Performance Assessment of Logistic Regression (LR), Artificial Neural Network (ANN), Fuzzy Inference System (FIS) and Adaptive Neuro-Fuzzy System (ANFIS) in Predicting Default Probability: The Case of," Computational Economics, Springer;Society for Computational Economics, vol. 64(3), pages 1803-1835, September.
    2. Janssens, Bram & Schetgen, Lisa & Bogaert, Matthias & Meire, Matthijs & Van den Poel, Dirk, 2024. "360 Degrees rumor detection: When explanations got some explaining to do," European Journal of Operational Research, Elsevier, vol. 317(2), pages 366-381.
    3. Przemys{l}aw Biecek & Marcin Chlebus & Janusz Gajda & Alicja Gosiewska & Anna Kozak & Dominik Ogonowski & Jakub Sztachelski & Piotr Wojewnik, 2021. "Enabling Machine Learning Algorithms for Credit Scoring -- Explainable Artificial Intelligence (XAI) methods for clear understanding complex predictive models," Papers 2104.06735, arXiv.org.
    4. Bauer, Kevin & Pfeuffer, Nicolas & Abdel-Karim, Benjamin M. & Hinz, Oliver & Kosfeld, Michael, 2020. "The terminator of social welfare? The economic consequences of algorithmic discrimination," SAFE Working Paper Series 287, Leibniz Institute for Financial Research SAFE.
    5. Andrés Alonso & José Manuel Carbó, 2022. "Accuracy of explanations of machine learning models for credit decisions," Working Papers 2222, Banco de España.
    6. John R. J. Thompson & Longlong Feng & R. Mark Reesor & Chuck Grace, 2021. "Know Your Clients’ Behaviours: A Cluster Analysis of Financial Transactions," JRFM, MDPI, vol. 14(2), pages 1-29, January.
    7. Li, Yibei & Wang, Ximei & Djehiche, Boualem & Hu, Xiaoming, 2020. "Credit scoring by incorporating dynamic networked information," European Journal of Operational Research, Elsevier, vol. 286(3), pages 1103-1112.
    8. Loterman, Gert & Brown, Iain & Martens, David & Mues, Christophe & Baesens, Bart, 2012. "Benchmarking regression algorithms for loss given default modeling," International Journal of Forecasting, Elsevier, vol. 28(1), pages 161-170.
    9. Yu, Lean & Wang, Shouyang & Lai, Kin Keung, 2009. "An intelligent-agent-based fuzzy group decision making model for financial multicriteria decision support: The case of credit scoring," European Journal of Operational Research, Elsevier, vol. 195(3), pages 942-959, June.
    10. Shiqi Fang & Zexun Chen & Jake Ansell, 2024. "Peer-induced Fairness: A Causal Approach for Algorithmic Fairness Auditing," Papers 2408.02558, arXiv.org, revised Sep 2024.
    11. Keerthana Sivamayil & Elakkiya Rajasekar & Belqasem Aljafari & Srete Nikolovski & Subramaniyaswamy Vairavasundaram & Indragandhi Vairavasundaram, 2023. "A Systematic Study on Reinforcement Learning Based Applications," Energies, MDPI, vol. 16(3), pages 1-23, February.
    12. Bauer, Kevin & Nofer, Michael & Abdel-Karim, Benjamin M. & Hinz, Oliver, 2022. "The effects of discontinuing machine learning decision support," SAFE Working Paper Series 370, Leibniz Institute for Financial Research SAFE.
    13. Dmytro Kovalenko & Olga Afanasieva & Nani Zabuta & Tetiana Boiko & Rosen Rosenov Baltov, 2021. "Model of Assessing the Overdue Debts in a Commercial Bank Using Neuro-Fuzzy Technologies," JRFM, MDPI, vol. 14(5), pages 1-20, May.
    14. Abdussalam Aljadani & Bshair Alharthi & Mohammed A. Farsi & Hossam Magdy Balaha & Mahmoud Badawy & Mostafa A. Elhosseini, 2023. "Mathematical Modeling and Analysis of Credit Scoring Using the LIME Explainer: A Comprehensive Approach," Mathematics, MDPI, vol. 11(19), pages 1-28, September.
    15. Guansan Du & Frank Elston, 2022. "RETRACTED ARTICLE: Financial risk assessment to improve the accuracy of financial prediction in the internet financial industry using data analytics models," Operations Management Research, Springer, vol. 15(3), pages 925-940, December.
    16. Ni Zhan, 2021. "Where does the Stimulus go? Deep Generative Model for Commercial Banking Deposits," Papers 2101.09230, arXiv.org.
    17. Onder Ozgur & Erdal Tanas Karagol & Fatih Cemil Ozbugday, 2021. "Machine learning approach to drivers of bank lending: evidence from an emerging economy," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-29, December.
    18. Juan Laborda & Seyong Ryoo, 2021. "Feature Selection in a Credit Scoring Model," Mathematics, MDPI, vol. 9(7), pages 1-22, March.
    19. Yang Liu & Fei Huang & Lili Ma & Qingguo Zeng & Jiale Shi, 2024. "Credit scoring prediction leveraging interpretable ensemble learning," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 286-308, March.
    20. Anil Kumar & Suneel Sharma & Mehregan Mahdavi, 2021. "Machine Learning (ML) Technologies for Digital Credit Scoring in Rural Finance: A Literature Review," Risks, MDPI, vol. 9(11), pages 1-15, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:palcom:v:11:y:2024:i:1:d:10.1057_s41599-024-04047-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: https://www.nature.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.