IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2208.11334.html
   My bibliography  Save this paper

Next-Year Bankruptcy Prediction from Textual Data: Benchmark and Baselines

Author

Listed:
  • Henri Arno
  • Klaas Mulier
  • Joke Baeck
  • Thomas Demeester

Abstract

Models for bankruptcy prediction are useful in several real-world scenarios, and multiple research contributions have been devoted to the task, based on structured (numerical) as well as unstructured (textual) data. However, the lack of a common benchmark dataset and evaluation strategy impedes the objective comparison between models. This paper introduces such a benchmark for the unstructured data scenario, based on novel and established datasets, in order to stimulate further research into the task. We describe and evaluate several classical and neural baseline models, and discuss benefits and flaws of different strategies. In particular, we find that a lightweight bag-of-words model based on static in-domain word representations obtains surprisingly good results, especially when taking textual data from several years into account. These results are critically assessed, and discussed in light of particular aspects of the data and the task. All code to replicate the data and experimental results will be released.

Suggested Citation

  • Henri Arno & Klaas Mulier & Joke Baeck & Thomas Demeester, 2022. "Next-Year Bankruptcy Prediction from Textual Data: Benchmark and Baselines," Papers 2208.11334, arXiv.org.
  • Handle: RePEc:arx:papers:2208.11334
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2208.11334
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Balcaen, Sofie & Ooghe, Hubert, 2006. "35 years of studies on business failure: an overview of the classic statistical methodologies and their related problems," The British Accounting Review, Elsevier, vol. 38(1), pages 63-93.
    2. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    3. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 71-111.
    4. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    5. Bernanke, Ben S, 1981. "Bankruptcy, Liquidity, and Recession," American Economic Review, American Economic Association, vol. 71(2), pages 155-159, May.
    6. Shumway, Tyler, 2001. "Forecasting Bankruptcy More Accurately: A Simple Hazard Model," The Journal of Business, University of Chicago Press, vol. 74(1), pages 101-124, January.
    7. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    8. Ohlson, Ja, 1980. "Financial Ratios And The Probabilistic Prediction Of Bankruptcy," Journal of Accounting Research, Wiley Blackwell, vol. 18(1), pages 109-131.
    9. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure - Reply," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 123-127.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhou, Fanyin & Fu, Lijun & Li, Zhiyong & Xu, Jiawei, 2022. "The recurrence of financial distress: A survival analysis," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1100-1115.
    2. Adriana Csikosova & Maria Janoskova & Katarina Culkova, 2020. "Application of Discriminant Analysis for Avoiding the Risk of Quarry Operation Failure," JRFM, MDPI, vol. 13(10), pages 1-14, September.
    3. Serrano-Cinca, Carlos & Gutiérrez-Nieto, Begoña & Bernate-Valbuena, Martha, 2019. "The use of accounting anomalies indicators to predict business failure," European Management Journal, Elsevier, vol. 37(3), pages 353-375.
    4. Kumar, Rahul & Deb, Soumya Guha & Mukherjee, Shubhadeep, 2020. "Do words reveal the latent truth? Identifying communication patterns of corporate losers," Journal of Behavioral and Experimental Finance, Elsevier, vol. 26(C).
    5. Mohammad Mahdi Mousavi & Jamal Ouenniche & Kaoru Tone, 2023. "A dynamic performance evaluation of distress prediction models," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(4), pages 756-784, July.
    6. Scalzer, Rodrigo S. & Rodrigues, Adriano & Macedo, Marcelo Álvaro da S. & Wanke, Peter, 2019. "Financial distress in electricity distributors from the perspective of Brazilian regulation," Energy Policy, Elsevier, vol. 125(C), pages 250-259.
    7. Xavier Brédart & Eric Séverin & David Veganzones, 2021. "Human resources and corporate failure prediction modeling: Evidence from Belgium," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(7), pages 1325-1341, November.
    8. Frank Ranganai Matenda & Mabutho Sibanda & Eriyoti Chikodza & Victor Gumbo, 2022. "Bankruptcy prediction for private firms in developing economies: a scoping review and guidance for future research," Management Review Quarterly, Springer, vol. 72(4), pages 927-966, December.
    9. Francesco Ciampi & Valentina Cillo & Fabio Fiano, 2020. "Combining Kohonen maps and prior payment behavior for small enterprise default prediction," Small Business Economics, Springer, vol. 54(4), pages 1007-1039, April.
    10. Gila Burde, 2018. "Improved Methods for Predicting the Financial Vulnerability of Nonprofit Organizations," Administrative Sciences, MDPI, vol. 8(1), pages 1-8, February.
    11. Andrzej Geise & Magdalena Kuczmarska & Jarosław Pawlowski, 2021. "Corporate Failure Prediction of Construction Companies in Poland: Evidence from Logit Model," European Research Studies Journal, European Research Studies Journal, vol. 0(1), pages 99-116.
    12. Christian Lohmann & Thorsten Ohliger, 2020. "Bankruptcy prediction and the discriminatory power of annual reports: empirical evidence from financially distressed German companies," Journal of Business Economics, Springer, vol. 90(1), pages 137-172, February.
    13. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    14. Haoming Wang & Xiangdong Liu, 2021. "Undersampling bankruptcy prediction: Taiwan bankruptcy data," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-17, July.
    15. Ahsan Habib & Mabel D' Costa & Hedy Jiaying Huang & Md. Borhan Uddin Bhuiyan & Li Sun, 2020. "Determinants and consequences of financial distress: review of the empirical literature," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 60(S1), pages 1023-1075, April.
    16. Hamid Waqas & Rohani Md-Rus, 2018. "Predicting financial distress: Applicability of O-score model for Pakistani firms," Business and Economic Horizons (BEH), Prague Development Center, vol. 14(2), pages 389-401, April.
    17. Giesecke, Kay & Longstaff, Francis A. & Schaefer, Stephen & Strebulaev, Ilya, 2011. "Corporate bond default risk: A 150-year perspective," Journal of Financial Economics, Elsevier, vol. 102(2), pages 233-250.
    18. Mousavi, Mohammad M. & Ouenniche, Jamal & Xu, Bing, 2015. "Performance evaluation of bankruptcy prediction models: An orientation-free super-efficiency DEA-based framework," International Review of Financial Analysis, Elsevier, vol. 42(C), pages 64-75.
    19. David Veganzones, 2022. "Corporate failure prediction using threshold‐based models," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(5), pages 956-979, August.
    20. Ilyes Abid & Farid Mkaouar & Olfa Kaabia, 2018. "Dynamic analysis of the forecasting bankruptcy under presence of unobserved heterogeneity," Annals of Operations Research, Springer, vol. 262(2), pages 241-256, March.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2208.11334. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.