IDEAS home Printed from https://ideas.repec.org/a/eee/reveco/v76y2021icp40-54.html
   My bibliography  Save this article

Using boosting algorithms to predict bank failure: An untold story

Author

Listed:
  • Pham, Xuan T.T.
  • Ho, Tin H.

Abstract

From a modeling point of view, our work provides a novel approach to better use XGBoost for bank failure prediction, determining the essential technical aspects that can improve the predictive accuracy. Of these technical aspects, the two crucial factors are assigning correct values to target variables and careful predictor selection (through ANOVA, correlation, information value tests, and weight of evidence). We also highlight that bank failure could be predicted four to five quarters earlier when all predictive signals simultaneously appear. Hence, we strongly suggest using quarterly data instead of yearly data. In addition to practical implications, our present work also contributed to the existing literature. We confirm the results of existing studies that emphasized that XGBoost has strong predictive power (Carmona, Climent, and Momparler (2018)). Moreover, we provide evidence that XGBoost outperforms other models in the same boosting family, including gradient boosting and AdaBoost, through an intensive comparison of predictive power. These contributions might facilitate future work on bank failure prediction.

Suggested Citation

  • Pham, Xuan T.T. & Ho, Tin H., 2021. "Using boosting algorithms to predict bank failure: An untold story," International Review of Economics & Finance, Elsevier, vol. 76(C), pages 40-54.
  • Handle: RePEc:eee:reveco:v:76:y:2021:i:c:p:40-54
    DOI: 10.1016/j.iref.2021.05.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1059056021001131
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.iref.2021.05.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    2. Gennaioli, Nicola & Martin, Alberto & Rossi, Stefano, 2018. "Banks, government Bonds, and Default: What do the data Say?," Journal of Monetary Economics, Elsevier, vol. 98(C), pages 98-113.
    3. Karlan, Dean & Morduch, Jonathan, 2010. "Access to Finance," Handbook of Development Economics, in: Dani Rodrik & Mark Rosenzweig (ed.), Handbook of Development Economics, edition 1, volume 5, chapter 0, pages 4703-4784, Elsevier.
    4. Chiaramonte, Laura & Croci, Ettore & Poli, Federica, 2015. "Should we trust the Z-score? Evidence from the European Banking Industry," Global Finance Journal, Elsevier, vol. 28(C), pages 111-131.
    5. Kaushik Bhattacharya, 2003. "How good is the BankScope database? A cross-validation exercise with correction factors for market concentration measures," BIS Working Papers 133, Bank for International Settlements.
    6. Climent, Francisco & Momparler, Alexandre & Carmona, Pedro, 2019. "Anticipating bank distress in the Eurozone: An Extreme Gradient Boosting approach," Journal of Business Research, Elsevier, vol. 101(C), pages 885-896.
    7. Ravi Kumar, P. & Ravi, V., 2007. "Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review," European Journal of Operational Research, Elsevier, vol. 180(1), pages 1-28, July.
    8. Friedman, Jerome H., 2002. "Stochastic gradient boosting," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 367-378, February.
    9. Beutel, Johannes & List, Sophia & von Schweinitz, Gregor, 2019. "Does machine learning help us predict banking crises?," Journal of Financial Stability, Elsevier, vol. 45(C).
    10. Canbas, Serpil & Cabuk, Altan & Kilic, Suleyman Bilgin, 2005. "Prediction of commercial bank failure via multivariate statistical analysis of financial structures: The Turkish case," European Journal of Operational Research, Elsevier, vol. 166(2), pages 528-546, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Citterio, Alberto, 2024. "Bank failure prediction models: Review and outlook," Socio-Economic Planning Sciences, Elsevier, vol. 92(C).
    2. Hamed Mirashk & Amir Albadvi & Mehrdad Kargari & Mohammad Ali Rastegar, 2024. "News Sentiment and Liquidity Risk Forecasting: Insights from Iranian Banks," Risks, MDPI, vol. 12(11), pages 1-32, October.
    3. Zhiyong Li & Chen Feng & Ying Tang, 2022. "Bank efficiency and failure prediction: a nonparametric and dynamic model based on data envelopment analysis," Annals of Operations Research, Springer, vol. 315(1), pages 279-315, August.
    4. Chen, Dangxing & Ye, Jiahui & Ye, Weicheng, 2023. "Interpretable selective learning in credit risk," Research in International Business and Finance, Elsevier, vol. 65(C).
    5. Kristóf, Tamás & Virág, Miklós, 2022. "EU-27 bank failure prediction with C5.0 decision trees and deep learning neural networks," Research in International Business and Finance, Elsevier, vol. 61(C).
    6. Jiaming Liu & Chengzhang Li & Peng Ouyang & Jiajia Liu & Chong Wu, 2023. "Interpreting the prediction results of the tree‐based gradient boosting models for financial distress prediction with an explainable machine learning approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(5), pages 1112-1137, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Manthoulis, Georgios & Doumpos, Michalis & Zopounidis, Constantin & Galariotis, Emilios, 2020. "An ordinal classification framework for bank failure prediction: Methodology and empirical evidence for US banks," European Journal of Operational Research, Elsevier, vol. 282(2), pages 786-801.
    2. Citterio, Alberto, 2024. "Bank failure prediction models: Review and outlook," Socio-Economic Planning Sciences, Elsevier, vol. 92(C).
    3. Li Xian Liu & Shuangzhe Liu & Milind Sathye, 2021. "Predicting Bank Failures: A Synthesis of Literature and Directions for Future Research," JRFM, MDPI, vol. 14(10), pages 1-24, October.
    4. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    5. Carmona, Pedro & Dwekat, Aladdin & Mardawi, Zeena, 2022. "No more black boxes! Explaining the predictions of a machine learning XGBoost classifier algorithm in business failure," Research in International Business and Finance, Elsevier, vol. 61(C).
    6. Kristóf, Tamás & Virág, Miklós, 2022. "EU-27 bank failure prediction with C5.0 decision trees and deep learning neural networks," Research in International Business and Finance, Elsevier, vol. 61(C).
    7. Jabeur, Sami Ben & Gharib, Cheima & Mefteh-Wali, Salma & Arfi, Wissal Ben, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    8. Fethi, Meryem Duygun & Pasiouras, Fotios, 2010. "Assessing bank efficiency and performance with operational research and artificial intelligence techniques: A survey," European Journal of Operational Research, Elsevier, vol. 204(2), pages 189-198, July.
    9. repec:zbw:bofrdp:2009_035 is not listed on IDEAS
    10. Demyanyk, Yuliya & Hasan, Iftekhar, 2009. "Financial crises and bank failures: a review of prediction methods," Bank of Finland Research Discussion Papers 35/2009, Bank of Finland.
    11. Buckmann, Marcus & Gallego Marquez, Paula & Gimpelewicz, Mariana & Kapadia, Sujit & Rismanchi, Katie, 2021. "The more the merrier? Evidence from the global financial crisis on the value of multiple requirements in bank regulation," Bank of England working papers 905, Bank of England.
    12. Zeineb Affes & Rania Hentati-Kaffel, 2016. "Predicting US banks bankruptcy: logit versus Canonical Discriminant analysis," Post-Print halshs-01281948, HAL.
    13. José Alejandro Fernández Fernández & Virginia Bejarano Vázquez & Juan Antonio Vicente Virseda, 2019. "Evaluación de riesgos con Data Mining: el sistema financiero español," Remef - Revista Mexicana de Economía y Finanzas Nueva Época REMEF (The Mexican Journal of Economics and Finance), Instituto Mexicano de Ejecutivos de Finanzas, IMEF, vol. 14(3), pages 309-328, Julio - S.
    14. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    15. Zeineb Affes & Rania Hentati-Kaffel, 2016. "Predicting US banks bankruptcy: logit versus Canonical Discriminant analysis," Documents de travail du Centre d'Economie de la Sorbonne 16016, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
    16. Vadlamani Ravi & Vadlamani Madhav, 2020. "Optimizing the reliability of a bank with Logistic Regression and Particle Swarm Optimization," Papers 2004.11122, arXiv.org.
    17. Ioannidis, Christos & Pasiouras, Fotios & Zopounidis, Constantin, 2010. "Assessing bank soundness with classification techniques," Omega, Elsevier, vol. 38(5), pages 345-357, October.
    18. Tyler Pike & Horacio Sapriza & Tom Zimmermann, 2019. "Bottom-up Leading Macroeconomic Indicators: An Application to Non-Financial Corporate Defaults using Machine Learning," Finance and Economics Discussion Series 2019-070, Board of Governors of the Federal Reserve System (U.S.).
    19. Papanikolaou, Nikolaos I., 2018. "To be bailed out or to be left to fail? A dynamic competing risks hazard analysis," Journal of Financial Stability, Elsevier, vol. 34(C), pages 61-85.
    20. fernández, María t. Tascón & gutiérrez, Francisco J. Castaño, 2012. "Variables y Modelos Para La Identificación y Predicción Del Fracaso Empresarial: Revisión de La Investigación Empírica Reciente," Revista de Contabilidad - Spanish Accounting Review, Elsevier, vol. 15(1), pages 7-58.
    21. Suss, Joel & Treitel, Henry, 2019. "Predicting bank distress in the UK with machine learning," Bank of England working papers 831, Bank of England.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reveco:v:76:y:2021:i:c:p:40-54. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/620165 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.