IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2401.12652.html
   My bibliography  Save this paper

From Numbers to Words: Multi-Modal Bankruptcy Prediction Using the ECL Dataset

Author

Listed:
  • Henri Arno
  • Klaas Mulier
  • Joke Baeck
  • Thomas Demeester

Abstract

In this paper, we present ECL, a novel multi-modal dataset containing the textual and numerical data from corporate 10K filings and associated binary bankruptcy labels. Furthermore, we develop and critically evaluate several classical and neural bankruptcy prediction models using this dataset. Our findings suggest that the information contained in each data modality is complementary for bankruptcy prediction. We also see that the binary bankruptcy prediction target does not enable our models to distinguish next year bankruptcy from an unhealthy financial situation resulting in bankruptcy in later years. Finally, we explore the use of LLMs in the context of our task. We show how GPT-based models can be used to extract meaningful summaries from the textual data but zero-shot bankruptcy prediction results are poor. All resources required to access and update the dataset or replicate our experiments are available on github.com/henriarnoUG/ECL.

Suggested Citation

  • Henri Arno & Klaas Mulier & Joke Baeck & Thomas Demeester, 2024. "From Numbers to Words: Multi-Modal Bankruptcy Prediction Using the ECL Dataset," Papers 2401.12652, arXiv.org.
  • Handle: RePEc:arx:papers:2401.12652
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2401.12652
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    2. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 71-111.
    3. Bernanke, Ben S, 1981. "Bankruptcy, Liquidity, and Recession," American Economic Review, American Economic Association, vol. 71(2), pages 155-159, May.
    4. Ohlson, Ja, 1980. "Financial Ratios And The Probabilistic Prediction Of Bankruptcy," Journal of Accounting Research, Wiley Blackwell, vol. 18(1), pages 109-131.
    5. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure - Reply," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 123-127.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Henri Arno & Klaas Mulier & Joke Baeck & Thomas Demeester, 2022. "Next-Year Bankruptcy Prediction from Textual Data: Benchmark and Baselines," Papers 2208.11334, arXiv.org.
    2. Zhao, Qi & Xu, Weijun & Ji, Yucheng, 2023. "Predicting financial distress of Chinese listed companies using machine learning: To what extent does textual disclosure matter?," International Review of Financial Analysis, Elsevier, vol. 89(C).
    3. Zhou, Fanyin & Fu, Lijun & Li, Zhiyong & Xu, Jiawei, 2022. "The recurrence of financial distress: A survival analysis," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1100-1115.
    4. Vladislav V. Afanasev & Yulia A. Tarasova, 2022. "Default Prediction for Housing and Utilities Management Firms Using Non-Financial Data," Finansovyj žhurnal — Financial Journal, Financial Research Institute, Moscow 125375, Russia, issue 6, pages 91-110, December.
    5. Jiaming Liu & Chengzhang Li & Peng Ouyang & Jiajia Liu & Chong Wu, 2023. "Interpreting the prediction results of the tree‐based gradient boosting models for financial distress prediction with an explainable machine learning approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(5), pages 1112-1137, August.
    6. Ben Jabeur, Sami & Serret, Vanessa, 2023. "Bankruptcy prediction using fuzzy convolutional neural networks," Research in International Business and Finance, Elsevier, vol. 64(C).
    7. Elsayed, Mohamed & Elshandidy, Tamer, 2020. "Do narrative-related disclosures predict corporate failure? Evidence from UK non-financial publicly quoted firms," International Review of Financial Analysis, Elsevier, vol. 71(C).
    8. Andrzej Geise & Magdalena Kuczmarska & Jarosław Pawlowski, 2021. "Corporate Failure Prediction of Construction Companies in Poland: Evidence from Logit Model," European Research Studies Journal, European Research Studies Journal, vol. 0(1), pages 99-116.
    9. Antonio Davila & George Foster & Xiaobin He & Carlos Shimizu, 2015. "The rise and fall of startups: Creation and destruction of revenue and jobs by young companies," Australian Journal of Management, Australian School of Business, vol. 40(1), pages 6-35, February.
    10. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    11. Guido Max Mantovani & Gregory Gadzinski, 2022. "How to Rate the Financial Performance of Private Companies? A Tailored Integrated Rating Methodology Applied to North-Eastern Italian Districts," JRFM, MDPI, vol. 15(11), pages 1-18, October.
    12. Enrico Supino & Nicola Piras, 2022. "Le performance dei modelli di credit scoring in contesti di forte instabilit? macroeconomica: il ruolo delle Reti Neurali Artificiali," MANAGEMENT CONTROL, FrancoAngeli Editore, vol. 2022(2), pages 41-61.
    13. Adriana Csikosova & Maria Janoskova & Katarina Culkova, 2020. "Application of Discriminant Analysis for Avoiding the Risk of Quarry Operation Failure," JRFM, MDPI, vol. 13(10), pages 1-14, September.
    14. Haoming Wang & Xiangdong Liu, 2021. "Undersampling bankruptcy prediction: Taiwan bankruptcy data," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-17, July.
    15. Trueck, Stefan & Rachev, Svetlozar T., 2008. "Rating Based Modeling of Credit Risk," Elsevier Monographs, Elsevier, edition 1, number 9780123736833.
    16. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    17. Arati Kale & Devendra Kale & Sriram Villupuram, 2024. "Decomposition of risk for small size and low book-to-market stocks," Journal of Asset Management, Palgrave Macmillan, vol. 25(1), pages 96-112, February.
    18. Ahsan Habib & Mabel D' Costa & Hedy Jiaying Huang & Md. Borhan Uddin Bhuiyan & Li Sun, 2020. "Determinants and consequences of financial distress: review of the empirical literature," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 60(S1), pages 1023-1075, April.
    19. Serrano-Cinca, Carlos & Gutiérrez-Nieto, Begoña & Bernate-Valbuena, Martha, 2019. "The use of accounting anomalies indicators to predict business failure," European Management Journal, Elsevier, vol. 37(3), pages 353-375.
    20. Buchetti, Bruno & Miquel-Flores, Ixart & Perdichizzi, Salvatore & Reghezza, Alessio & Lin, Luca X., 2024. "Loan guarantee and portfolio greening: evidence from European credit registers," Working Paper Series 2916, European Central Bank.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2401.12652. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.