IDEAS home Printed from https://ideas.repec.org/a/wsi/ijfexx/v08y2021i03ns2424786321410048.html
   My bibliography  Save this article

The extraction of early warning features for predicting financial distress based on XGBoost model and shap framework

Author

Listed:
  • He Yang

    (School of Math. & Stats., Zhengzhou University, Zhengzhou 450001, P. R. China)

  • Emma Li

    (#x2020;Henan Experimental High School, Zhengzhou 450001, P. R. China)

  • Yi Fang Cai

    (School of Math. & Stats., Zhengzhou University, Zhengzhou 450001, P. R. China)

  • Jiapei Li

    (#x2021;Henan Key Lab of Financial Engineering, Zhengzhou University, Zhengzhou 450001, P. R. China)

  • George X. Yuan

    (#xA7;Business School, Guangxi University, Nanning 530004, P. R. China¶Business School, Sun Yat-Sen University, Guangzhou 510275, P. R. China∥Business School, Chengdu University, Chengdu 610106, P. R. China**BBD Technology Co., Ltd., No. 966-#9 Building, Tianfu Avenue, Chengdu 610093, P. R. China)

Abstract

The purpose of this paper is to establish a framework for the extraction of early warning risk features for the predicting financial distress based on XGBoost model and SHAP. It is well known that the way to construct early warning risk features to predict financial distress of companies is very important, and by comparing with the traditional statistical methods, though the data-driven machine learning for the financial early warning, modelling has a better performance in terms of prediction accuracy, but it also brings the difficulty such as the one the corresponding model may be not explained well. Recently, eXtreme Gradient Boosting (XGBoost), an ensemble learning algorithm based on extreme gradient boosting, has become a hot topic in the area of machine learning research field due to its strong nonlinear information recognition ability and high prediction accuracy in the practice. In this study, the XGBoost algorithm is used to extract early warning features for the predicting financial distress for listed companies, with 76 financial risk features from seven categories of aspects, and 14 non-financial risk features from four categories of aspects, which are collected to establish an early warning system for the predication of financial distress. With applications, we conduct the empirical testing respect to AUC, KS and Kappa, the numerical results show that by comparing with the Logistic model, our method based on XGBoost model established in this paper has much better ability to predict the financial distress risk of listed companies. Moreover, under the framework of SHAP (SHAPley Additive exPlanations), we are able to give a reasonable explanation for important risk features and influencing ways affecting the financial distress visibly. The results given by this paper show that the XGBoost approach to model early warning features for financial distress does not only preform a better prediction accuracy, but also is explainable, which is significant for the identification of early warning to the financial distress risk for listed companies in the practice.

Suggested Citation

  • He Yang & Emma Li & Yi Fang Cai & Jiapei Li & George X. Yuan, 2021. "The extraction of early warning features for predicting financial distress based on XGBoost model and shap framework," International Journal of Financial Engineering (IJFE), World Scientific Publishing Co. Pte. Ltd., vol. 8(03), pages 1-24, September.
  • Handle: RePEc:wsi:ijfexx:v:08:y:2021:i:03:n:s2424786321410048
    DOI: 10.1142/S2424786321410048
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S2424786321410048
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S2424786321410048?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijfexx:v:08:y:2021:i:03:n:s2424786321410048. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscientific.com/worldscinet/ijfe .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.