IDEAS home Printed from https://ideas.repec.org/a/eee/jbfina/v102y2019icp177-192.html
   My bibliography  Save this article

Grabit: Gradient tree-boosted Tobit models for default prediction

Author

Listed:
  • Sigrist, Fabio
  • Hirnschall, Christoph

Abstract

A frequent problem in binary classification is class imbalance between a minority and a majority class such as defaults and non-defaults in default prediction. In this article, we introduce a novel binary classification model, the Grabit model, which is obtained by applying gradient tree boosting to the Tobit model. We show how this model can leverage auxiliary data to obtain increased predictive accuracy for imbalanced data. We apply the Grabit model to predicting defaults on loans made to Swiss small and medium-sized enterprises (SME) and obtain a large and significant improvement in predictive performance compared to other state-of-the-art approaches.

Suggested Citation

  • Sigrist, Fabio & Hirnschall, Christoph, 2019. "Grabit: Gradient tree-boosted Tobit models for default prediction," Journal of Banking & Finance, Elsevier, vol. 102(C), pages 177-192.
  • Handle: RePEc:eee:jbfina:v:102:y:2019:i:c:p:177-192
    DOI: 10.1016/j.jbankfin.2019.03.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378426619300573
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jbankfin.2019.03.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Charles L. Merwin, 1942. "Financing Small Corporations in Five Manufacturing Industries, 1926–36," NBER Books, National Bureau of Economic Research, Inc, number merw42-1.
    2. Koopman, Siem Jan & Lucas, Andre & Monteiro, Andre, 2008. "The multi-state latent factor intensity model for credit rating transitions," Journal of Econometrics, Elsevier, vol. 142(1), pages 399-424, January.
    3. Bauer, Julian & Agarwal, Vineet, 2014. "Are hazard models superior to traditional bankruptcy prediction approaches? A comprehensive test," Journal of Banking & Finance, Elsevier, vol. 40(C), pages 432-442.
    4. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    5. A. Adam Ding & Shaonan Tian & Yan Yu & Hui Guo, 2012. "A Class of Discrete Transformation Survival Models With Application to Default Probability Prediction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 990-1003, September.
    6. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    7. Djeundje, Viani Biatat & Crook, Jonathan, 2018. "Incorporating heterogeneity and macroeconomic variables into multi-state delinquency models for credit cards," European Journal of Operational Research, Elsevier, vol. 271(2), pages 697-709.
    8. Maddala,G. S., 1986. "Limited-Dependent and Qualitative Variables in Econometrics," Cambridge Books, Cambridge University Press, number 9780521338257, September.
    9. Papke, Leslie E. & Wooldridge, Jeffrey M., 2008. "Panel data methods for fractional response variables with an application to test pass rates," Journal of Econometrics, Elsevier, vol. 145(1-2), pages 121-133, July.
    10. Buhlmann P. & Yu B., 2003. "Boosting With the L2 Loss: Regression and Classification," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 324-339, January.
    11. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    12. Edward I. Altman & Gabriele Sabato, 2013. "MODELING CREDIT RISK FOR SMEs: EVIDENCE FROM THE US MARKET," World Scientific Book Chapters, in: Oliviero Roggi & Edward I Altman (ed.), Managing and Measuring Risk Emerging Global Standards and Regulations After the Financial Crisis, chapter 9, pages 251-279, World Scientific Publishing Co. Pte. Ltd..
    13. Zmijewski, Me, 1984. "Methodological Issues Related To The Estimation Of Financial Distress Prediction Models," Journal of Accounting Research, Wiley Blackwell, vol. 22, pages 59-82.
    14. Jones, Stewart & Johnstone, David & Wilson, Roy, 2015. "An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes," Journal of Banking & Finance, Elsevier, vol. 56(C), pages 72-85.
    15. Rosett, Richard N & Nelson, Forrest D, 1975. "Estimation of the Two-Limit Probit Regression Model," Econometrica, Econometric Society, vol. 43(1), pages 141-146, January.
    16. Bruno Sansó & Lelys Guenni, 1999. "Venezuelan Rainfall Data Analysed by Using a Bayesian Space–time Model," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(3), pages 345-362.
    17. Papke, Leslie E & Wooldridge, Jeffrey M, 1996. "Econometric Methods for Fractional Response Variables with an Application to 401(K) Plan Participation Rates," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 11(6), pages 619-632, Nov.-Dec..
    18. Stewart Jones & David Johnstone & Roy Wilson, 2017. "Predicting Corporate Bankruptcy: An Evaluation of Alternative Statistical Frameworks," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 44(1-2), pages 3-34, January.
    19. Sigrist, Fabio & Stahel, Werner A., 2011. "Using the Censored Gamma Distribution for Modeling Fractional Response Variables with an Application to Loss Given Default," ASTIN Bulletin, Cambridge University Press, vol. 41(2), pages 673-710, November.
    20. Shumway, Tyler, 2001. "Forecasting Bankruptcy More Accurately: A Simple Hazard Model," The Journal of Business, University of Chicago Press, vol. 74(1), pages 101-124, January.
    21. Tian, Shaonan & Yu, Yan & Guo, Hui, 2015. "Variable selection and corporate bankruptcy forecasts," Journal of Banking & Finance, Elsevier, vol. 52(C), pages 89-100.
    22. P G Moffatt, 2005. "Hurdle models of loan default," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1063-1071, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pascal Kundig & Fabio Sigrist, 2024. "A Spatio-Temporal Machine Learning Model for Mortgage Credit Risk: Default Probabilities and Loan Portfolios," Papers 2410.02846, arXiv.org.
    2. Francesco Ciampi & Alessandro Giannozzi & Giacomo Marzi & Edward I. Altman, 2021. "Rethinking SME default prediction: a systematic literature review and future perspectives," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(3), pages 2141-2188, March.
    3. Königstorfer, Florian & Thalmann, Stefan, 2020. "Applications of Artificial Intelligence in commercial banks – A research agenda for behavioral finance," Journal of Behavioral and Experimental Finance, Elsevier, vol. 27(C).
    4. Nicoleta Bărbuță-Mișu & Mara Madaleno, 2020. "Assessment of Bankruptcy Risk of Large Companies: European Countries Evolution Analysis," JRFM, MDPI, vol. 13(3), pages 1-28, March.
    5. He, Yunwen, 2021. "Using your regular contacts as collateral: The information value of call logs," The North American Journal of Economics and Finance, Elsevier, vol. 58(C).
    6. Sigrist, Fabio & Leuenberger, Nicola, 2023. "Machine learning for corporate default risk: Multi-period prediction, frailty correlation, loan portfolios, and tail probabilities," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1390-1406.
    7. Paritosh Navinchandra Jha & Marco Cucculelli, 2021. "A New Model Averaging Approach in Predicting Credit Risk Default," Risks, MDPI, vol. 9(6), pages 1-15, June.
    8. Longyue Liang & Bo Liu & Zhi Su & Xuanye Cai, 2024. "Forecasting corporate financial performance with deep learning and interpretable ALE method: Evidence from China," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(7), pages 2540-2571, November.
    9. Jingjing Long & Cuiqing Jiang & Stanko Dimitrov & Zhao Wang, 2022. "Clues from networks: quantifying relational risk for credit risk evaluation of SMEs," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-41, December.
    10. Li, Huan & Wu, Weixing, 2024. "Loan default predictability with explainable machine learning," Finance Research Letters, Elsevier, vol. 60(C).
    11. Hoang Hiep Nguyen & Jean-Laurent Viviani & Sami Ben Jabeur, 2023. "Bankruptcy prediction using machine learning and Shapley additive explanations," Post-Print hal-04223161, HAL.
    12. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    13. García-Céspedes, Rubén & Moreno, Manuel, 2022. "The generalized Vasicek credit risk model: A Machine Learning approach," Finance Research Letters, Elsevier, vol. 47(PA).
    14. Lisa Crosato & Caterina Liberati & Marco Repetto, 2021. "Look Who's Talking: Interpretable Machine Learning for Assessing Italian SMEs Credit Default," Papers 2108.13914, arXiv.org, revised Sep 2021.
    15. Chen, Dangxing & Ye, Jiahui & Ye, Weicheng, 2023. "Interpretable selective learning in credit risk," Research in International Business and Finance, Elsevier, vol. 65(C).
    16. Jan Svanberg & Tohid Ardeshiri & Isak Samsten & Peter Öhman & Presha E. Neidermeyer & Tarek Rana & Natalia Semenova & Mats Danielson, 2022. "Corporate governance performance ratings with machine learning," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 29(1), pages 50-68, January.
    17. Xiaoshan Su & Manying Bai, 2020. "Stochastic gradient boosting frequency-severity model of insurance claims," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-24, August.
    18. Alonso-Robisco, Andrés & Carbó, José Manuel, 2022. "Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio," International Review of Financial Analysis, Elsevier, vol. 84(C).
    19. Jabeur, Sami Ben & Gharib, Cheima & Mefteh-Wali, Salma & Arfi, Wissal Ben, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    20. Brezigar-Masten, Arjana & Masten, Igor & Volk, Matjaž, 2021. "Modelin-g credit risk with a Tobit model of days past due," Journal of Banking & Finance, Elsevier, vol. 122(C).
    21. Andrés Alonso Robisco & José Manuel Carbó Martínez, 2022. "Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
    22. Kellner, Ralf & Nagl, Maximilian & Rösch, Daniel, 2022. "Opening the black box – Quantile neural networks for loss given default prediction," Journal of Banking & Finance, Elsevier, vol. 134(C).
    23. Cedric H. A. Koffi & Viani Biatat Djeundje & Olivier Menoukeu Pamen, 2024. "Impact of social factors on loan delinquency in microfinance," Papers 2410.13100, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sigrist, Fabio & Leuenberger, Nicola, 2023. "Machine learning for corporate default risk: Multi-period prediction, frailty correlation, loan portfolios, and tail probabilities," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1390-1406.
    2. Serrano-Cinca, Carlos & Gutiérrez-Nieto, Begoña & Bernate-Valbuena, Martha, 2019. "The use of accounting anomalies indicators to predict business failure," European Management Journal, Elsevier, vol. 37(3), pages 353-375.
    3. Alam, Nurul & Gao, Junbin & Jones, Stewart, 2021. "Corporate failure prediction: An evaluation of deep learning vs discrete hazard models," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 75(C).
    4. Tian, Shaonan & Yu, Yan, 2017. "Financial ratios and bankruptcy predictions: An international evidence," International Review of Economics & Finance, Elsevier, vol. 51(C), pages 510-526.
    5. Ugur, Mehmet & Solomon, Edna & Zeynalov, Ayaz, 2022. "Leverage, competition and financial distress hazard: Implications for capital structure in the presence of agency costs," Economic Modelling, Elsevier, vol. 108(C).
    6. Sumaira Ashraf & Elisabete G. S. Félix & Zélia Serrasqueiro, 2019. "Do Traditional Financial Distress Prediction Models Predict the Early Warning Signs of Financial Distress?," JRFM, MDPI, vol. 12(2), pages 1-17, April.
    7. Bátiz-Zuk Enrique & Mohamed Abdulkadir & Sánchez-Cajal Fátima, 2021. "Exploring the sources of loan default clustering using survival analysis with frailty," Working Papers 2021-14, Banco de México.
    8. Yi Cao & Xiaoquan Liu & Jia Zhai & Shan Hua, 2022. "A two‐stage Bayesian network model for corporate bankruptcy prediction," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(1), pages 455-472, January.
    9. John Nkwoma Inekwe, 2016. "Financial Distress, Employees’ Welfare and Entrepreneurship Among SMEs," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 129(3), pages 1135-1153, December.
    10. Stewart Jones, 2017. "Corporate bankruptcy prediction: a high dimensional analysis," Review of Accounting Studies, Springer, vol. 22(3), pages 1366-1422, September.
    11. Bai, Qing & Tian, Shaonan, 2020. "Innovate or die: Corporate innovation and bankruptcy forecasts," Journal of Empirical Finance, Elsevier, vol. 59(C), pages 88-108.
    12. Alessandro Bitetto & Stefano Filomeni & Michele Modina, 2021. "Understanding corporate default using Random Forest: The role of accounting and market information," DEM Working Papers Series 205, University of Pavia, Department of Economics and Management.
    13. Dong, Manh Cuong & Tian, Shaonan & Chen, Cathy W.S., 2018. "Predicting failure risk using financial ratios: Quantile hazard model approach," The North American Journal of Economics and Finance, Elsevier, vol. 44(C), pages 204-220.
    14. Ashraf, Sumaira & Félix, Elisabete G.S. & Serrasqueiro, Zélia, 2020. "Development and testing of an augmented distress prediction model: A comparative study on a developed and an emerging market," Journal of Multinational Financial Management, Elsevier, vol. 57.
    15. Katarina Valaskova & Dominika Gajdosikova & Jaroslav Belas, 2023. "Bankruptcy prediction in the post-pandemic period: A case study of Visegrad Group countries," Oeconomia Copernicana, Institute of Economic Research, vol. 14(1), pages 253-293, March.
    16. Evangelos C. Charalambakis & Ian Garrett, 2019. "On corporate financial distress prediction: What can we learn from private firms in a developing economy? Evidence from Greece," Review of Quantitative Finance and Accounting, Springer, vol. 52(2), pages 467-491, February.
    17. Alberto Tron & Maurizio Dallocchio & Salvatore Ferri & Federico Colantoni, 2023. "Corporate governance and financial distress: lessons learned from an unconventional approach," Journal of Management & Governance, Springer;Accademia Italiana di Economia Aziendale (AIDEA), vol. 27(2), pages 425-456, June.
    18. Jones, Stewart & Wang, Tim, 2019. "Predicting private company failure: A multi-class analysis," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 61(C), pages 161-188.
    19. Mohammad Mahdi Mousavi & Jamal Ouenniche, 2018. "Multi-criteria ranking of corporate distress prediction models: empirical evaluation and methodological contributions," Annals of Operations Research, Springer, vol. 271(2), pages 853-886, December.
    20. Cathcart, Lara & Dufour, Alfonso & Rossi, Ludovico & Varotto, Simone, 2020. "The differential impact of leverage on the default risk of small and large firms," Journal of Corporate Finance, Elsevier, vol. 60(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jbfina:v:102:y:2019:i:c:p:177-192. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jbf .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.