IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2411.01928.html
   My bibliography  Save this paper

Datasets for Advanced Bankruptcy Prediction: A survey and Taxonomy

Author

Listed:
  • Xinlin Wang
  • Zs'ofia Kraussl
  • Mats Brorsson

Abstract

Bankruptcy prediction is an important research area that heavily relies on data science. It aims to help investors, managers, and regulators better understand the operational status of corporations and predict potential financial risks in advance. To improve prediction, researchers and practitioners have begun to utilize a variety of different types of data, ranging from traditional financial indicators to unstructured data, to aid in the construction and optimization of bankruptcy forecasting models. Over time, not only instrumentalized data improved, but also instrumentalized methodology for data structuring, cleaning, and analysis. With the aid of advanced analytical techniques that deploy machine learning and deep learning algorithms, bankruptcy assessment became more accurate over time. However, due to the sensitivity of financial data, the scarcity of valid public datasets remains a key bottleneck for the rapid modeling and evaluation of machine learning algorithms for targeted tasks. This study therefore introduces a taxonomy of datasets for bankruptcy research, and summarizes their characteristics. This paper also proposes a set of metrics to measure the quality and the informativeness of public datasets The taxonomy, coupled with the informativeness measure, thus aims at providing valuable insights to better assist researchers and practitioners in developing potential applications for various aspects of credit assessment and decision making by pointing at appropriate datasets for their studies.

Suggested Citation

  • Xinlin Wang & Zs'ofia Kraussl & Mats Brorsson, 2024. "Datasets for Advanced Bankruptcy Prediction: A survey and Taxonomy," Papers 2411.01928, arXiv.org.
  • Handle: RePEc:arx:papers:2411.01928
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2411.01928
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Róbert Štefko & Jarmila Horváthová & Martina Mokrišová, 2021. "The Application of Graphic Methods and the DEA in Predicting the Risk of Bankruptcy," JRFM, MDPI, vol. 14(5), pages 1-19, May.
    2. Gianfranco Lombardo & Mattia Pellegrino & George Adosoglou & Stefano Cagnoni & Panos M. Pardalos & Agostino Poggi, 2022. "Machine Learning for Bankruptcy Prediction in the American Stock Market: Dataset and Benchmarks," Future Internet, MDPI, vol. 14(8), pages 1-23, August.
    3. Ravi Kumar, P. & Ravi, V., 2007. "Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review," European Journal of Operational Research, Elsevier, vol. 180(1), pages 1-28, July.
    4. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    5. Róbert Štefko & Jarmila Horváthová & Martina Mokrišová, 2020. "Bankruptcy Prediction with the Use of Data Envelopment Analysis: An Empirical Study of Slovak Businesses," JRFM, MDPI, vol. 13(9), pages 1-15, September.
    6. Yu Zhao & Shaopeng Wei & Yu Guo & Qing Yang & Xingyan Chen & Qing Li & Fuzhen Zhuang & Ji Liu & Gang Kou, 2022. "Combining Intra-Risk and Contagion Risk for Enterprise Bankruptcy Prediction Using Graph Neural Networks," Papers 2202.03874, arXiv.org, revised Jul 2022.
    7. Wu, Y. & Gaunt, C. & Gray, S., 2010. "A comparison of alternative bankruptcy prediction models," Journal of Contemporary Accounting and Economics, Elsevier, vol. 6(1), pages 34-45.
    8. Altman, Edward I. & Marco, Giancarlo & Varetto, Franco, 1994. "Corporate distress diagnosis: Comparisons using linear discriminant analysis and neural networks (the Italian experience)," Journal of Banking & Finance, Elsevier, vol. 18(3), pages 505-529, May.
    9. Ciampi, Francesco, 2015. "Corporate governance characteristics and default prediction modeling for small enterprises. An empirical analysis of Italian firms," Journal of Business Research, Elsevier, vol. 68(5), pages 1012-1025.
    10. Trivedi, Shrawan Kumar, 2020. "A study on credit scoring modeling with different feature selection and machine learning approaches," Technology in Society, Elsevier, vol. 63(C).
    11. Aneta Ptak-Chmielewska, 2019. "Predicting Micro-Enterprise Failures Using Data Mining Techniques," JRFM, MDPI, vol. 12(1), pages 1-17, February.
    12. Platt, Harlan D. & Platt, Marjorie B., 2006. "Understanding Differences Between Financial Distress and Bankruptcy," Review of Applied Economics, Lincoln University, Department of Financial and Business Systems, vol. 2(2), pages 1-17.
    13. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    14. repec:eme:ijlma0:ijlma-04-2014-0032 is not listed on IDEAS
    15. Juan Laborda & Seyong Ryoo, 2021. "Feature Selection in a Credit Scoring Model," Mathematics, MDPI, vol. 9(7), pages 1-22, March.
    16. Maria Kovacova & Tomas Kliestik & Katarina Valaskova & Pavol Durana & Zuzana Juhaszova, 2019. "Systematic review of variables applied in bankruptcy prediction models of Visegrad group countries," Oeconomia Copernicana, Institute of Economic Research, vol. 10(4), pages 743-772, December.
    17. Fernandes, Guilherme Barreto & Artes, Rinaldo, 2016. "Spatial dependence in credit risk and its improvement in credit scoring," European Journal of Operational Research, Elsevier, vol. 249(2), pages 517-524.
    18. Stewart Jones, 2017. "Corporate bankruptcy prediction: a high dimensional analysis," Review of Accounting Studies, Springer, vol. 22(3), pages 1366-1422, September.
    19. Stewart Jones, 2023. "A literature survey of corporate failure prediction models," Journal of Accounting Literature, Emerald Group Publishing Limited, vol. 45(2), pages 364-405, March.
    20. Oliver Lukason & Art Andresson, 2019. "Tax Arrears Versus Financial Ratios in Bankruptcy Prediction," JRFM, MDPI, vol. 12(4), pages 1-13, December.
    21. du Jardin, Philippe, 2015. "Bankruptcy prediction using terminal failure processes," European Journal of Operational Research, Elsevier, vol. 242(1), pages 286-303.
    22. Liang, Deron & Lu, Chia-Chi & Tsai, Chih-Fong & Shih, Guan-An, 2016. "Financial ratios and corporate governance indicators in bankruptcy prediction: A comprehensive study," European Journal of Operational Research, Elsevier, vol. 252(2), pages 561-572.
    23. Petropoulos, Anastasios & Siakoulis, Vasilis & Stavroulakis, Evangelos & Vlachogiannakis, Nikolaos E., 2020. "Predicting bank insolvencies using machine learning techniques," International Journal of Forecasting, Elsevier, vol. 36(3), pages 1092-1113.
    24. Hyeongjun Kim & Hoon Cho & Doojin Ryu, 2022. "Corporate Bankruptcy Prediction Using Machine Learning Methodologies with a Focus on Sequential Data," Computational Economics, Springer;Society for Computational Economics, vol. 59(3), pages 1231-1249, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yu Zhao & Huaming Du & Qing Li & Fuzhen Zhuang & Ji Liu & Gang Kou, 2022. "A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data Perspective," Papers 2211.14997, arXiv.org, revised May 2023.
    2. Sami Ben Jabeur & Nicolae Stef & Pedro Carmona, 2023. "Bankruptcy Prediction using the XGBoost Algorithm and Variable Importance Feature Engineering," Computational Economics, Springer;Society for Computational Economics, vol. 61(2), pages 715-741, February.
    3. Elena Gregova & Katarina Valaskova & Peter Adamko & Milos Tumpach & Jaroslav Jaros, 2020. "Predicting Financial Distress of Slovak Enterprises: Comparison of Selected Traditional and Learning Algorithms Methods," Sustainability, MDPI, vol. 12(10), pages 1-17, May.
    4. Oliver Lukason & Art Andresson, 2019. "Tax Arrears Versus Financial Ratios in Bankruptcy Prediction," JRFM, MDPI, vol. 12(4), pages 1-13, December.
    5. Tomasz Korol, 2019. "Dynamic Bankruptcy Prediction Models for European Enterprises," JRFM, MDPI, vol. 12(4), pages 1-15, December.
    6. Veganzones, David & Séverin, Eric & Chlibi, Souhir, 2023. "Influence of earnings management on forecasting corporate failure," International Journal of Forecasting, Elsevier, vol. 39(1), pages 123-143.
    7. Borchert, Philipp & Coussement, Kristof & De Caigny, Arno & De Weerdt, Jochen, 2023. "Extending business failure prediction models with textual website content using deep learning," European Journal of Operational Research, Elsevier, vol. 306(1), pages 348-357.
    8. Ben Jabeur, Sami & Serret, Vanessa, 2023. "Bankruptcy prediction using fuzzy convolutional neural networks," Research in International Business and Finance, Elsevier, vol. 64(C).
    9. Mohammad Mahdi Mousavi & Jamal Ouenniche, 2018. "Multi-criteria ranking of corporate distress prediction models: empirical evaluation and methodological contributions," Annals of Operations Research, Springer, vol. 271(2), pages 853-886, December.
    10. Jabeur, Sami Ben & Gharib, Cheima & Mefteh-Wali, Salma & Arfi, Wissal Ben, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    11. Zhou, Fanyin & Fu, Lijun & Li, Zhiyong & Xu, Jiawei, 2022. "The recurrence of financial distress: A survival analysis," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1100-1115.
    12. Serrano-Cinca, Carlos & Gutiérrez-Nieto, Begoña & Bernate-Valbuena, Martha, 2019. "The use of accounting anomalies indicators to predict business failure," European Management Journal, Elsevier, vol. 37(3), pages 353-375.
    13. Noora Alzayed & Rasol Eskandari & Hassan Yazdifar, 2023. "Bank failure prediction: corporate governance and financial indicators," Review of Quantitative Finance and Accounting, Springer, vol. 61(2), pages 601-631, August.
    14. Jarmila Horváthová & Martina Mokrišová & Martin Bača, 2023. "Bankruptcy Prediction for Sustainability of Businesses: The Application of Graph Theoretical Modeling," Mathematics, MDPI, vol. 11(24), pages 1-20, December.
    15. Vadlamani Ravi & Vadlamani Madhav, 2020. "Optimizing the reliability of a bank with Logistic Regression and Particle Swarm Optimization," Papers 2004.11122, arXiv.org.
    16. Sarbjit Singh Oberoi & Sayan Banerjee, 2023. "Bankruptcy Prediction of Indian Banks Using Advanced Analytics," Economic Studies journal, Bulgarian Academy of Sciences - Economic Research Institute, issue 4, pages 22-41.
    17. Yi Cao & Xiaoquan Liu & Jia Zhai & Shan Hua, 2022. "A two‐stage Bayesian network model for corporate bankruptcy prediction," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(1), pages 455-472, January.
    18. Keijo Kohv & Oliver Lukason, 2021. "What Best Predicts Corporate Bank Loan Defaults? An Analysis of Three Different Variable Domains," Risks, MDPI, vol. 9(2), pages 1-19, January.
    19. Theodore Metaxas & Athanasios Romanopoulos, 2023. "A Literature Review on the Financial Determinants of Hotel Default," JRFM, MDPI, vol. 16(7), pages 1-19, July.
    20. Youssef Zizi & Amine Jamali-Alaoui & Badreddine El Goumi & Mohamed Oudgou & Abdeslam El Moudden, 2021. "An Optimal Model of Financial Distress Prediction: A Comparative Study between Neural Networks and Logistic Regression," Risks, MDPI, vol. 9(11), pages 1-24, November.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2411.01928. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.