IDEAS home Printed from https://ideas.repec.org/a/vrs/eaiada/v22y2018i2p54-73n5.html
   My bibliography  Save this article

Predicting the Default Risk of Companies. Comparison of Credit Scoring Models: Logit Vs Support Vector Machines

Author

Listed:
  • Nehrebecka Natalia

    (National Bank of Poland, University of Warsaw, Warsaw, Poland; nnehrebecka@wne.uw.edu.pl)

Abstract

The aim of the article is to compare models on a train and validation sample, which will be created using logistic regression and Support Vector Machine (SVM) and will be used to assess the credit risk of non-financial enterprises. When creating models, the variables will be subjected to the transformation of the Weight of Evidence (WoE), the number of potential predictions will be reduced based on the Information Value (IV) statistics. The quality of the models will be assessed according to the most popular criteria such as GINI statistics, Kolmogorov-Smirnov (K-S) and Area Under Receiver Operating Characteristic (AUROC). Based on the results, it was found that there are significant differences between the logistic regression model of discriminatory character and the SVM for the model sample. In the case of a validation sample, logistic regression has the best prognostic capability. These analyses can be used to reduce the risk of negative effects on the financial sector.

Suggested Citation

  • Nehrebecka Natalia, 2018. "Predicting the Default Risk of Companies. Comparison of Credit Scoring Models: Logit Vs Support Vector Machines," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 22(2), pages 54-73, June.
  • Handle: RePEc:vrs:eaiada:v:22:y:2018:i:2:p:54-73:n:5
    DOI: 10.15611/eada.2018.2.05
    as

    Download full text from publisher

    File URL: https://doi.org/10.15611/eada.2018.2.05
    Download Restriction: no

    File URL: https://libkey.io/10.15611/eada.2018.2.05?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jankowitsch, Rainer & Pichler, Stefan & Schwaiger, Walter S.A., 2007. "Modelling the economic value of credit rating systems," Journal of Banking & Finance, Elsevier, vol. 31(1), pages 181-198, January.
    2. Rossi, Stefania P.S. & Schwaiger, Markus S. & Winkler, Gerhard, 2009. "How loan portfolio diversification affects risk, efficiency and capitalization: A managerial behavior model for Austrian banks," Journal of Banking & Finance, Elsevier, vol. 33(12), pages 2218-2226, December.
    3. Natalia Nehrebecka, 2016. "Approach to the assessment of credit risk for non-financial corporations. Evidence from Poland," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Combining micro and macro data for financial stability analysis, volume 41, Bank for International Settlements.
    4. Wolfgang Härdle & Yuh-Jye Lee & Dorothea Schäfer & Yi-Ren Yeh, 2009. "Variable selection and oversampling in the use of smooth support vector machines for predicting the default risk of companies," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 28(6), pages 512-534.
    5. Wolfgang Härdle & Yuh-Jye Lee & Dorothea Schäfer & Yi-Ren Yeh, 2007. "The Default Risk of Firms Examined with Smooth Support Vector Machines," Discussion Papers of DIW Berlin 757, DIW Berlin, German Institute for Economic Research.
    6. Crone, Sven F. & Finlay, Steven, 2012. "Instance sampling in credit scoring: An empirical study of sample size and balancing," International Journal of Forecasting, Elsevier, vol. 28(1), pages 224-238.
    7. Laura Auria & Rouslan A. Moro, 2008. "Support Vector Machines (SVM) as a Technique for Solvency Analysis," Discussion Papers of DIW Berlin 811, DIW Berlin, German Institute for Economic Research.
    8. Abildgren, Kim & Vølund Buchholst, Birgitte & Staghøj, Jonas, 2013. "Bank-firm relationships and the survival of non-financial firms during the financial crisis 2008-2009," Working Paper Series 1516, European Central Bank.
    9. Natalia Nehrebecka, 2011. "Wykorzystanie łańcuchów Markowa do prognozowania zmian w strukturze polskich przedsiębiorstw," Gospodarka Narodowa. The Polish Journal of Economics, Warsaw School of Economics, issue 10, pages 59-98.
    10. Härdle, Wolfgang Karl & Moro, Rouslan A. & Schäfer, Dorothea, 2007. "Estimating probabilities of default with support vector machines," Discussion Paper Series 2: Banking and Financial Studies 2007,18, Deutsche Bundesbank.
    11. Anderson, Raymond, 2007. "The Credit Scoring Toolkit: Theory and Practice for Retail Credit Risk Management and Decision Automation," OUP Catalogue, Oxford University Press, number 9780199226405.
    12. Marina-Eliza Spaliara & Holger Görg, 2009. "Financial health, exports, and firm survival: A comparison of British and French firms," Discussion Paper Series 2009_18, Department of Economics, Loughborough University.
    13. Russ A. Moro, 2008. "Analysis of the Predictors of Default for Portuguese Firms," Working Papers w200822, Banco de Portugal, Economics and Research Department.
    14. K B Schebesch & R Stecking, 2005. "Support vector machines for classifying and describing credit applicants: detecting typical and critical regions," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1082-1088, September.
    15. Gladys D. C. Barriga & Vicente G. Cancho & Francisco Louzada, 2015. "A non‐default rate regression model for credit scoring," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 31(6), pages 846-861, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lukasz Prorokowski, 2022. "New definition of default," Bank i Kredyt, Narodowy Bank Polski, vol. 53(5), pages 523-564.
    2. Guner Altan & Server Demirci, 2022. "Credit Scoring on Cash Flow Table with Machine Learning: XGBoost Approach," Journal of Economic Policy Researches, Istanbul University, Faculty of Economics, vol. 9(2), pages 397-424, July.
    3. Lisa Crosato & Caterina Liberati & Marco Repetto, 2021. "Look Who's Talking: Interpretable Machine Learning for Assessing Italian SMEs Credit Default," Papers 2108.13914, arXiv.org, revised Sep 2021.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Natalia Nehrebecka, 2021. "Internal Credit Risk Models and Digital Transformation: What to Prepare for? An Application to Poland," European Research Studies Journal, European Research Studies Journal, vol. 0(Special 3), pages 719-736.
    2. Natalia Nehrebecka, 2016. "Approach to the assessment of credit risk for non-financial corporations. Evidence from Poland," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Combining micro and macro data for financial stability analysis, volume 41, Bank for International Settlements.
    3. Marian Nehrebecki, 2023. "Zombification in Poland in particular during COVID-19 pandemic and low interest rates," Bank i Kredyt, Narodowy Bank Polski, vol. 54(2), pages 153-190.
    4. Dimitrios Nikolaidis & Michalis Doumpos, 2022. "Credit Scoring with Drift Adaptation Using Local Regions of Competence," SN Operations Research Forum, Springer, vol. 3(4), pages 1-28, December.
    5. Zhang, Junni L. & Härdle, Wolfgang Karl, 2008. "The bayesian additive classification tree applied to credit risk modelling," SFB 649 Discussion Papers 2008-003, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    6. Natalia Nehrebecka, 2018. "Sectoral risk assessment with particular emphasis on export enterprises in Poland," Zbornik radova Ekonomskog fakulteta u Rijeci/Proceedings of Rijeka Faculty of Economics, University of Rijeka, Faculty of Economics and Business, vol. 36(2), pages 677-700.
    7. Swati Anand & Kushendra Mishra, 2022. "Identifying potential millennial customers for financial institutions using SVM," Journal of Financial Services Marketing, Palgrave Macmillan, vol. 27(4), pages 335-345, December.
    8. Gero Szepannek, 2022. "An Overview on the Landscape of R Packages for Open Source Scorecard Modelling," Risks, MDPI, vol. 10(3), pages 1-33, March.
    9. Lean Yu & Xinxie Li & Ling Tang & Zongyi Zhang & Gang Kou, 2015. "Social credit: a comprehensive literature review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 1(1), pages 1-18, December.
    10. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    11. Kurt Hornik & Rainer Jankowitsch & Manuel Lingo & Stefan Pichler & Gerhard Winkler, 2010. "Determinants of heterogeneity in European credit ratings," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 24(3), pages 271-287, September.
    12. Baah Aye Kusi & Lydia Adzobu & Alex Kwame Abasi & Kwadjo Ansah-Adu, 2020. "Sectoral Loan Portfolio Concentration and Bank Stability: Evidence from an Emerging Economy," Journal of Emerging Market Finance, Institute for Financial Management and Research, vol. 19(1), pages 66-99, April.
    13. Hakenes, Hendrik & Schnabel, Isabel, 2011. "Bank size and risk-taking under Basel II," Journal of Banking & Finance, Elsevier, vol. 35(6), pages 1436-1449, June.
    14. Simoens, Mathieu & Vander Vennet, Rudi, 2022. "Does diversification protect European banks’ market valuations in a pandemic?," Finance Research Letters, Elsevier, vol. 44(C).
    15. Oguzhan Ece & Bulent Diclehan Cadirci, 2022. "The Effect of Loan Portfolio Concentration Level on Financial Stability and Performance: A Comparatıve Analysis in Dual Banking System," Journal of Economic Policy Researches, Istanbul University, Faculty of Economics, vol. 9(2), pages 523-556, July.
    16. Vaneet Bhatia & Sankarshan Basu & Subrata Kumar Mitra & Pradyumna Dash, 2018. "A review of bank efficiency and productivity," OPSEARCH, Springer;Operational Research Society of India, vol. 55(3), pages 557-600, November.
    17. Marcin Chlebus, 2014. "One-day prediction of state of turbulence for financial instrument based on models for binary dependent variable," Ekonomia journal, Faculty of Economic Sciences, University of Warsaw, vol. 37.
    18. Filipe Silva & Carlos Carreira, 2011. "Financial Constraints and Exports: An Analysis of Portuguese Firms During the European Monetary Integration," Notas Económicas, Faculty of Economics, University of Coimbra, issue 34, pages 35-56, December.
    19. Joachim Wagner & John Philipp Weche Gelübcke, 2012. "Foreign Ownership and Firm Survival: First Evidence for Enterprises in Germany," International Economics, CEPII research center, issue 132, pages 117-139.
    20. Goran Calic & Moren Lévesque & Anton Shevchenko, 2024. "On why women-owned businesses take more time to secure microloans," Small Business Economics, Springer, vol. 63(3), pages 917-938, October.

    More about this item

    Keywords

    Basel III; Internal Rating Based System; credit scoring; Support Vector Machines; logistic regression;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • G33 - Financial Economics - - Corporate Finance and Governance - - - Bankruptcy; Liquidation
    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:eaiada:v:22:y:2018:i:2:p:54-73:n:5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.