IDEAS home Printed from https://ideas.repec.org/a/pal/palcom/v11y2024i1d10.1057_s41599-024-04047-5.html
   My bibliography  Save this article

Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy–interpretability trade–off

Author

Listed:
  • Luis J. Mena

    (Universidad Politecnica de Sinaloa)

  • Vicente García

    (Universidad Autonoma de Ciudad Juarez)

  • Vanessa G. Félix

    (Universidad Politecnica de Sinaloa
    Universidad Autonoma de Occidente)

  • Rodolfo Ostos

    (Universidad Politecnica de Sinaloa
    Universidad Autonoma de Occidente)

  • Rafael Martínez-Peláez

    (Universidad Politecnica de Sinaloa
    Universidad Catolica del Norte)

  • Alberto Ochoa-Brust

    (Universidad de Colima)

  • Pablo Velarde-Alvarado

    (Universidad Autonoma de Nayarit)

Abstract

Machine learning for financial risk prediction has garnered substantial interest in recent decades. However, the class imbalance problem and the dilemma of accuracy gain by loss interpretability have yet to be widely studied. Symbolic classifiers have emerged as a promising solution for forecasting banking failures and estimating creditworthiness as it addresses class imbalance while maintaining both accuracy and interpretability. This paper aims to evaluate the effectiveness of REMED, a symbolic classifier, in the context of financial risk management, and focuses on its ability to handle class imbalance and provide interpretable decision rules. Through empirical analysis of a real-world imbalanced financial dataset from the Federal Deposit Insurance Corporation, we demonstrate that REMED effectively handles class imbalance, improving performance accuracy metrics while ensuring interpretability through a concise and easily understandable rule system. A comparative analysis is conducted against two well-known rule-generating approaches, J48 and JRip. The findings suggest that, with further development and validation, REMED can be implemented as a competitive approach to improve predictive accuracy on imbalanced financial datasets without compromising model interpretability.

Suggested Citation

  • Luis J. Mena & Vicente García & Vanessa G. Félix & Rodolfo Ostos & Rafael Martínez-Peláez & Alberto Ochoa-Brust & Pablo Velarde-Alvarado, 2024. "Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy–interpretability trade–off," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-11, December.
  • Handle: RePEc:pal:palcom:v:11:y:2024:i:1:d:10.1057_s41599-024-04047-5
    DOI: 10.1057/s41599-024-04047-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/s41599-024-04047-5
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1057/s41599-024-04047-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Michael Bücker & Gero Szepannek & Alicja Gosiewska & Przemyslaw Biecek, 2022. "Transparency, auditability, and explainability of machine learning models in credit scoring," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 73(1), pages 70-90, January.
    2. Tim Jones & G. Stacy Sirmans, 2019. "Understanding Subprime Mortgage Default," Journal of Real Estate Literature, Taylor & Francis Journals, vol. 27(1), pages 27-52, August.
    3. Hong Wang & Qingsong Xu & Lifeng Zhou, 2015. "Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-20, February.
    4. Ahmed Almustfa Hussin Adam Khatir & Marco Bee, 2022. "Machine Learning Models and Data-Balancing Techniques for Credit Scoring: What Is the Best Combination?," Risks, MDPI, vol. 10(9), pages 1-22, August.
    5. Li Shang & Biao Zhou & Jiannan Li & Decai Tang & Valentina Boamah & Zhiwei Pan, 2024. "Evaluating financial fragility: a case study of Chinese banking and finance systems," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-9, December.
    6. Cukierman, Alex, 2019. "A retrospective on the subprime crisis and its aftermath ten years after Lehman’s collapse," Economic Systems, Elsevier, vol. 43(3).
    7. Shen, Feng & Zhao, Xingchao & Li, Zhiyong & Li, Ke & Meng, Zhiyi, 2019. "A novel ensemble classification model based on neural networks and a classifier optimisation technique for imbalanced credit risk evaluation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 526(C).
    8. Chen, Yujia & Calabrese, Raffaella & Martin-Barragan, Belen, 2024. "Interpretable machine learning for imbalanced credit scoring datasets," European Journal of Operational Research, Elsevier, vol. 312(1), pages 357-372.
    9. Martens, David & Baesens, Bart & Van Gestel, Tony & Vanthienen, Jan, 2007. "Comprehensible credit scoring models using rule extraction from support vector machines," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1466-1476, December.
    10. Martin Leo & Suneel Sharma & K. Maddulety, 2019. "Machine Learning in Banking Risk Management: A Literature Review," Risks, MDPI, vol. 7(1), pages 1-22, March.
    11. Jing Quan & Xuelian Sun, 2024. "Credit risk assessment using the factorization machine model with feature interactions," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-10, December.
    12. Laura Cristina Lanzarini & Augusto Villa Monte & Aurelio F. Bariviera & Patricia Jimbo Santana, 2017. "Simplifying credit scoring rules using LVQ+PSO," Papers 1704.04450, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nadia Ayed & Khemaies Bougatef, 2024. "Performance Assessment of Logistic Regression (LR), Artificial Neural Network (ANN), Fuzzy Inference System (FIS) and Adaptive Neuro-Fuzzy System (ANFIS) in Predicting Default Probability: The Case of," Computational Economics, Springer;Society for Computational Economics, vol. 64(3), pages 1803-1835, September.
    2. Janssens, Bram & Schetgen, Lisa & Bogaert, Matthias & Meire, Matthijs & Van den Poel, Dirk, 2024. "360 Degrees rumor detection: When explanations got some explaining to do," European Journal of Operational Research, Elsevier, vol. 317(2), pages 366-381.
    3. Przemys{l}aw Biecek & Marcin Chlebus & Janusz Gajda & Alicja Gosiewska & Anna Kozak & Dominik Ogonowski & Jakub Sztachelski & Piotr Wojewnik, 2021. "Enabling Machine Learning Algorithms for Credit Scoring -- Explainable Artificial Intelligence (XAI) methods for clear understanding complex predictive models," Papers 2104.06735, arXiv.org.
    4. Bauer, Kevin & Pfeuffer, Nicolas & Abdel-Karim, Benjamin M. & Hinz, Oliver & Kosfeld, Michael, 2020. "The terminator of social welfare? The economic consequences of algorithmic discrimination," SAFE Working Paper Series 287, Leibniz Institute for Financial Research SAFE.
    5. Andrés Alonso & José Manuel Carbó, 2022. "Accuracy of explanations of machine learning models for credit decisions," Working Papers 2222, Banco de España.
    6. John R. J. Thompson & Longlong Feng & R. Mark Reesor & Chuck Grace, 2021. "Know Your Clients’ Behaviours: A Cluster Analysis of Financial Transactions," JRFM, MDPI, vol. 14(2), pages 1-29, January.
    7. Derhami, Shahab & Smith, Alice E., 2017. "An integer programming approach for fuzzy rule-based classification systems," European Journal of Operational Research, Elsevier, vol. 256(3), pages 924-934.
    8. Li, Yibei & Wang, Ximei & Djehiche, Boualem & Hu, Xiaoming, 2020. "Credit scoring by incorporating dynamic networked information," European Journal of Operational Research, Elsevier, vol. 286(3), pages 1103-1112.
    9. Faraz Ahmed & Kehkashan Nizam & Zubair Sajid & Sunain Qamar & Ahsan, 2024. "Striking a Balance: Evaluating Credit Risk with Traditional and Machine Learning Models," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 13(3), pages 30-35.
    10. Loterman, Gert & Brown, Iain & Martens, David & Mues, Christophe & Baesens, Bart, 2012. "Benchmarking regression algorithms for loss given default modeling," International Journal of Forecasting, Elsevier, vol. 28(1), pages 161-170.
    11. Zhang, Zhiwang & Gao, Guangxia & Shi, Yong, 2014. "Credit risk evaluation using multi-criteria optimization classifier with kernel, fuzzification and penalty factors," European Journal of Operational Research, Elsevier, vol. 237(1), pages 335-348.
    12. Yu, Lean & Wang, Shouyang & Lai, Kin Keung, 2009. "An intelligent-agent-based fuzzy group decision making model for financial multicriteria decision support: The case of credit scoring," European Journal of Operational Research, Elsevier, vol. 195(3), pages 942-959, June.
    13. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    14. Shiqi Fang & Zexun Chen & Jake Ansell, 2024. "Peer-induced Fairness: A Causal Approach for Algorithmic Fairness Auditing," Papers 2408.02558, arXiv.org, revised Sep 2024.
    15. Ming-Fu Hsu & Ying-Shao Hsin & Fu-Jiing Shiue, 2022. "Business analytics for corporate risk management and performance improvement," Annals of Operations Research, Springer, vol. 315(2), pages 629-669, August.
    16. Yajiao Tang & Junkai Ji & Yulin Zhu & Shangce Gao & Zheng Tang & Yuki Todo, 2019. "A Differential Evolution-Oriented Pruning Neural Network Model for Bankruptcy Prediction," Complexity, Hindawi, vol. 2019, pages 1-21, August.
    17. Keerthana Sivamayil & Elakkiya Rajasekar & Belqasem Aljafari & Srete Nikolovski & Subramaniyaswamy Vairavasundaram & Indragandhi Vairavasundaram, 2023. "A Systematic Study on Reinforcement Learning Based Applications," Energies, MDPI, vol. 16(3), pages 1-23, February.
    18. Bauer, Kevin & Nofer, Michael & Abdel-Karim, Benjamin M. & Hinz, Oliver, 2022. "The effects of discontinuing machine learning decision support," SAFE Working Paper Series 370, Leibniz Institute for Financial Research SAFE.
    19. Pejman Peykani & Mostafa Sargolzaei & Mohammad Hashem Botshekan & Camelia Oprean-Stan & Amir Takaloo, 2023. "Optimization of Asset and Liability Management of Banks with Minimum Possible Changes," Mathematics, MDPI, vol. 11(12), pages 1-24, June.
    20. Doumpos, Michael & Zopounidis, Constantin, 2011. "Preference disaggregation and statistical learning for multicriteria decision support: A review," European Journal of Operational Research, Elsevier, vol. 209(3), pages 203-214, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:palcom:v:11:y:2024:i:1:d:10.1057_s41599-024-04047-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: https://www.nature.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.