IDEAS home Printed from https://ideas.repec.org/p/hal/wpaper/hal-03286760.html
   My bibliography  Save this paper

Credit scoring using neural networks and SURE posterior probability calibration

Author

Listed:
  • Matthieu Garcin

    (ESILV - École Supérieure d'Ingénierie Léonard de Vinci)

  • Samuel Stéphan

    (ESILV - École Supérieure d'Ingénierie Léonard de Vinci, SAMM - Statistique, Analyse et Modélisation Multidisciplinaire (SAmos-Marin Mersenne) - UP1 - Université Paris 1 Panthéon-Sorbonne)

Abstract

In this article we compare the performances of a logistic regression and a feed forward neural network for credit scoring purposes. Our results show that the logistic regression gives quite good results on the dataset and the neural network can improve a little the performance. We also consider different sets of features in order to assess their importance in terms of prediction accuracy. We find that temporal features (i.e. repeated measures over time) can be an important source of information resulting in an increase in the overall model accuracy. Finally, we introduce a new technique for the calibration of predicted probabilities based on Stein's unbiased risk estimate (SURE). This calibration technique can be applied to very general calibration functions. In particular, we detail this method for the sigmoid function as well as for the Kumaraswamy function, which includes the identity as a particular case. We show that the SURE calibration technique is able to calibrate the predicted probabilities as well as the classical Platt method.

Suggested Citation

  • Matthieu Garcin & Samuel Stéphan, 2023. "Credit scoring using neural networks and SURE posterior probability calibration," Working Papers hal-03286760, HAL.
  • Handle: RePEc:hal:wpaper:hal-03286760
    Note: View the original document on HAL open archive server: https://hal.science/hal-03286760v2
    as

    Download full text from publisher

    File URL: https://hal.science/hal-03286760v2/document
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Takaya Saito & Marc Rehmsmeier, 2015. "The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-21, March.
    2. Anderson, Raymond, 2007. "The Credit Scoring Toolkit: Theory and Practice for Retail Credit Risk Management and Decision Automation," OUP Catalogue, Oxford University Press, number 9780199226405.
    3. Steenackers, A. & Goovaerts, M. J., 1989. "A credit scoring model for personal loans," Insurance: Mathematics and Economics, Elsevier, vol. 8(1), pages 31-34, March.
    4. Artem Bequé & Kristof Coussement & Ross Gayler & Stefan Lessmann, 2017. "Approaches for credit scorecard calibration: An empirical analysis," Post-Print hal-01745262, HAL.
    5. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    6. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    7. Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthieu Garcin & Samuel St'ephan, 2021. "Credit scoring using neural networks and SURE posterior probability calibration," Papers 2107.07206, arXiv.org.
    2. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    3. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    4. Maria Rocha Sousa & João Gama & Elísio Brandão, 2013. "Introducing time-changing economics into credit scoring," FEP Working Papers 513, Universidade do Porto, Faculdade de Economia do Porto.
    5. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    6. Enrique Batiz‐Zuk & Fabrizio López‐Gallo & Abdulkadir Mohamed & Fátima Sánchez‐Cajal, 2022. "Determinants of loan survival rates for small and medium‐sized enterprises: Evidence from an emerging economy," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(4), pages 4741-4755, October.
    7. A?da Kammoun & Imen Triki, 2016. "Credit Scoring Models for a Tunisian Microfinance Institution: Comparison between Artificial Neural Network and Logistic Regression," Review of Economics & Finance, Better Advances Press, Canada, vol. 6, pages 61-78, February.
    8. Tsukahara, Fábio Yasuhiro & Kimura, Herbert & Sobreiro, Vinicius Amorim & Zambrano, Juan Carlos Arismendi, 2016. "Validation of default probability models: A stress testing approach," International Review of Financial Analysis, Elsevier, vol. 47(C), pages 70-85.
    9. Cao Son Tran & Dan Nicolau & Richi Nayak & Peter Verhoeven, 2021. "Modeling Credit Risk: A Category Theory Perspective," JRFM, MDPI, vol. 14(7), pages 1-21, July.
    10. George Xianzhi Yuan & Huiqi Wang, 2019. "The general dynamic risk assessment for the enterprise by the hologram approach in financial technology," International Journal of Financial Engineering (IJFE), World Scientific Publishing Co. Pte. Ltd., vol. 6(01), pages 1-48, March.
    11. Ha-Thu Nguyen, 2015. "How is credit scoring used to predict default in China?," EconomiX Working Papers 2015-1, University of Paris Nanterre, EconomiX.
    12. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    13. Ha-Thu Nguyen, 2014. "Default Predictors in Credit Scoring - Evidence from France’s Retail Banking Institution," EconomiX Working Papers 2014-26, University of Paris Nanterre, EconomiX.
    14. Agustin Pérez-Martín & Agustin Pérez-Torregrosa & Alejandro Rabasa & Marta Vaca, 2020. "Feature Selection to Optimize Credit Banking Risk Evaluation Decisions for the Example of Home Equity Loans," Mathematics, MDPI, vol. 8(11), pages 1-16, November.
    15. Falco J. Bargagli-Dtoffi & Massimo Riccaboni & Armando Rungi, 2020. "Machine Learning for Zombie Hunting. Firms Failures and Financial Constraints," Working Papers 01/2020, IMT School for Advanced Studies Lucca, revised Jun 2020.
    16. Jairaj Gupta & Andros Gregoriou & Jerome Healy, 2015. "Forecasting bankruptcy for SMEs using hazard function: To what extent does size matter?," Review of Quantitative Finance and Accounting, Springer, vol. 45(4), pages 845-869, November.
    17. Mestiri, Sami & Farhat, Abdejelil, 2018. "Credit Risk Prediction based on Bayesian estimation of logistic regression model with random effects," MPRA Paper 119960, University Library of Munich, Germany.
    18. Rais Ahmad Itoo & A. Selvarasu & José António Filipe, 2015. "Loan Products and Credit Scoring by Commercial Banks (India)," International Journal of Finance, Insurance and Risk Management, International Journal of Finance, Insurance and Risk Management, vol. 5(1), pages 851-851.
    19. Bátiz-Zuk Enrique & Mohamed Abdulkadir & Sánchez-Cajal Fátima, 2021. "Exploring the sources of loan default clustering using survival analysis with frailty," Working Papers 2021-14, Banco de México.
    20. Jairaj Gupta & Nicholas Wilson & Andros Gregoriou & Jerome Healy, 2014. "The value of operating cash flow in modelling credit risk for SMEs," Applied Financial Economics, Taylor & Francis Journals, vol. 24(9), pages 649-660, May.

    More about this item

    Keywords

    Deep learning; credit scoring; calibration; SURE;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:wpaper:hal-03286760. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.