IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v277y2019i1p366-376.html
   My bibliography  Save this article

Identifying hidden patterns in credit risk survival data using Generalised Additive Models

Author

Listed:
  • Djeundje, Viani Biatat
  • Crook, Jonathan

Abstract

Modelling patterns in credit risk using survival analysis techniques have received considerable and increasing attention over the past decade. In these models, the predictor of the hazard of default is often expressed as a simple linear combination of the risk factors. In this work, we discuss how these models can be enhanced using Generalised Additive Models (GAMs). In the GAMs framework, the predictor is formulated as a combination of flexible univariate functions of the risk factors. In this paper, we parametrise GAMs for credit risk data in terms of penalised splines, outline the implementation via frequentist and Bayesian MCMC methods, apply them to a large portfolio of credit card accounts, and show how GAMs can be used to improve not only the application, behavioural and macro-economic components of survival models for credit risk data at individual account level, but also the accuracy of predictions. From a practitioner point of view, this work highlights that some accounts may actually become more (less) attractive to the lender if flexible smooth functions are used whereas the same applicant may be denied (accepted) a loan if the linearity assumption is forced.

Suggested Citation

  • Djeundje, Viani Biatat & Crook, Jonathan, 2019. "Identifying hidden patterns in credit risk survival data using Generalised Additive Models," European Journal of Operational Research, Elsevier, vol. 277(1), pages 366-376.
  • Handle: RePEc:eee:ejores:v:277:y:2019:i:1:p:366-376
    DOI: 10.1016/j.ejor.2019.02.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221719301171
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2019.02.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Crainiceanu, Ciprian M. & Ruppert, David & Wand, Matthew P., 2005. "Bayesian Analysis for Penalized Spline Regression Using WinBUGS," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 14(i14).
    2. G Andreeva, 2006. "European generic scoring models using survival analysis," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 57(10), pages 1180-1187, October.
    3. Simon N. Wood, 2011. "Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(1), pages 3-36, January.
    4. Simon N. Wood & Yannig Goude & Simon Shaw, 2015. "Generalized additive models for large data sets," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 64(1), pages 139-155, January.
    5. S. N. Wood, 2000. "Modelling and smoothing parameter estimation with multiple quadratic penalties," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(2), pages 413-428.
    6. Djeundje, Viani Biatat & Crook, Jonathan, 2019. "Dynamic survival models with varying coefficients for credit risks," European Journal of Operational Research, Elsevier, vol. 275(1), pages 319-333.
    7. Ludwig Fahrmeir & Stefan Lang, 2001. "Bayesian inference for generalized additive mixed models based on Markov random field priors," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 50(2), pages 201-220.
    8. Brezger, Andreas & Kneib, Thomas & Lang, Stefan, 2005. "BayesX: Analyzing Bayesian Structural Additive Regression Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 14(i11).
    9. Rada Dakovic & Claudia Czado & Daniel Berg, 2010. "Bankruptcy prediction in Norway: a comparison study," Applied Economics Letters, Taylor & Francis Journals, vol. 17(17), pages 1739-1746.
    10. Maria Stepanova & Lyn Thomas, 2002. "Survival Analysis Methods for Personal Loan Data," Operations Research, INFORMS, vol. 50(2), pages 277-289, April.
    11. Bellotti, Tony & Crook, Jonathan, 2013. "Forecasting and stress testing credit card default using dynamic models," International Journal of Forecasting, Elsevier, vol. 29(4), pages 563-574.
    12. Simon N. Wood, 2008. "Fast stable direct fitting and smoothness selection for generalized additive models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(3), pages 495-518, July.
    13. Brezger, Andreas & Lang, Stefan, 2006. "Generalized structured additive regression based on Bayesian P-splines," Computational Statistics & Data Analysis, Elsevier, vol. 50(4), pages 967-991, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Calabrese, Raffaella & Dombrowski, Timothy & Mandel, Antoine & Pace, R. Kelley & Zanin, Luca, 2024. "Impacts of extreme weather events on mortgage risks and their evolution under climate change: A case study on Florida," European Journal of Operational Research, Elsevier, vol. 314(1), pages 377-392.
    2. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    3. Jiří Witzany & Anastasiia Kozina, 2022. "Recovery process optimization using survival regression," Operational Research, Springer, vol. 22(5), pages 5269-5296, November.
    4. Medina-Olivares, Victor & Lindgren, Finn & Calabrese, Raffaella & Crook, Jonathan, 2023. "Joint models of multivariate longitudinal outcomes and discrete survival data with INLA: An application to credit repayment behaviour," European Journal of Operational Research, Elsevier, vol. 310(2), pages 860-873.
    5. Hang Thi Thanh Vu & Jeonghan Ko, 2024. "Effective Modeling of CO 2 Emissions for Light-Duty Vehicles: Linear and Non-Linear Models with Feature Selection," Energies, MDPI, vol. 17(7), pages 1-23, March.
    6. Oliver Blümke, 2022. "Multiperiod default probability forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(4), pages 677-696, July.
    7. Johari, Maryam & Hosseini-Motlagh, Seyyed-Mahdi, 2022. "Evolutionary behaviors regarding pricing and payment-convenience strategies with uncertain risk," European Journal of Operational Research, Elsevier, vol. 297(2), pages 600-614.
    8. De Bock, Koen W. & Coussement, Kristof & Caigny, Arno De & Słowiński, Roman & Baesens, Bart & Boute, Robert N. & Choi, Tsan-Ming & Delen, Dursun & Kraus, Mathias & Lessmann, Stefan & Maldonado, Sebast, 2024. "Explainable AI for Operational Research: A defining framework, methods, applications, and a research agenda," European Journal of Operational Research, Elsevier, vol. 317(2), pages 249-272.
    9. Lohmann, Christian & Ohliger, Thorsten, 2024. "Predicting the cure of a defaulted company: Nonlinear relationships between loan-related variables and the cure probability," Research in International Business and Finance, Elsevier, vol. 70(PB).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Strasak, Alexander M. & Umlauf, Nikolaus & Pfeiffer, Ruth M. & Lang, Stefan, 2011. "Comparing penalized splines and fractional polynomials for flexible modelling of the effects of continuous predictor variables," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1540-1551, April.
    2. Belitz, Christiane & Lang, Stefan, 2008. "Simultaneous selection of variables and smoothing parameters in structured additive regression models," Computational Statistics & Data Analysis, Elsevier, vol. 53(1), pages 61-81, September.
    3. Umlauf, Nikolaus & Adler, Daniel & Kneib, Thomas & Lang, Stefan & Zeileis, Achim, 2015. "Structured Additive Regression Models: An R Interface to BayesX," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i21).
    4. Nadja Klein & Michel Denuit & Stefan Lang & Thomas Kneib, 2013. "Nonlife Ratemaking and Risk Management with Bayesian Additive Models for Location, Scale and Shape," Working Papers 2013-24, Faculty of Economics and Statistics, Universität Innsbruck.
    5. Simon N. Wood, 2020. "Inference and computation with generalized additive models and their extensions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 307-339, June.
    6. Klein, Nadja & Denuit, Michel & Lang, Stefan & Kneib, Thomas, 2013. "Nonlife Ratemaking and Risk Management with Bayesian Additive Models for Location, Scale and Shape," LIDAM Discussion Papers ISBA 2013045, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    7. Schmidt, Paul & Mühlau, Mark & Schmid, Volker, 2017. "Fitting large-scale structured additive regression models using Krylov subspace methods," Computational Statistics & Data Analysis, Elsevier, vol. 105(C), pages 59-75.
    8. Longhi, Christian & Musolesi, Antonio & Baumont, Catherine, 2014. "Modeling structural change in the European metropolitan areas during the process of economic integration," Economic Modelling, Elsevier, vol. 37(C), pages 395-407.
    9. Simon N. Wood & Natalya Pya & Benjamin Säfken, 2016. "Smoothing Parameter and Model Selection for General Smooth Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1548-1563, October.
    10. Musolesi Antonio & Mazzanti Massimiliano, 2014. "Nonlinearity, heterogeneity and unobserved effects in the carbon dioxide emissions-economic development relation for advanced countries," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 18(5), pages 521-541, December.
    11. Thi Mai Luong, 2020. "Selection Effects of Lender and Borrower Choices on Risk Measurement, Management and Prudential Regulation," PhD Thesis, Finance Discipline Group, UTS Business School, University of Technology, Sydney, number 3-2020, January-A.
    12. Djeundje, Viani Biatat & Crook, Jonathan, 2019. "Dynamic survival models with varying coefficients for credit risks," European Journal of Operational Research, Elsevier, vol. 275(1), pages 319-333.
    13. Medina-Olivares, Victor & Calabrese, Raffaella & Crook, Jonathan & Lindgren, Finn, 2023. "Joint models for longitudinal and discrete survival data in credit scoring," European Journal of Operational Research, Elsevier, vol. 307(3), pages 1457-1473.
    14. Oliver Blümke, 2022. "Multiperiod default probability forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(4), pages 677-696, July.
    15. Li, Aimin & Li, Zhiyong & Bellotti, Anthony, 2023. "Predicting loss given default of unsecured consumer loans with time-varying survival scores," Pacific-Basin Finance Journal, Elsevier, vol. 78(C).
    16. Costa, M.J. & Shaw, J.E.H., 2009. "Parametrization and penalties in spline models with an application to survival analysis," Computational Statistics & Data Analysis, Elsevier, vol. 53(3), pages 657-670, January.
    17. François Freddy Ateba & Issaka Sagara & Nafomon Sogoba & Mahamoudou Touré & Drissa Konaté & Sory Ibrahim Diawara & Séidina Aboubacar Samba Diakité & Ayouba Diarra & Mamadou D. Coulibaly & Mathias Dolo, 2020. "Spatio-Temporal Dynamic of Malaria Incidence: A Comparison of Two Ecological Zones in Mali," IJERPH, MDPI, vol. 17(13), pages 1-21, June.
    18. Mazzanti, Massimiliano & Musolesi, Antonio, 2013. "Nonlinearity, Heterogeneity and Unobserved Effects in the CO2-income Relation for Advanced Countries," Climate Change and Sustainable Development 162374, Fondazione Eni Enrico Mattei (FEEM).
    19. Simon N. Wood, 2011. "Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(1), pages 3-36, January.
    20. Roberto Basile & Luigi Benfratello & Davide Castellani, 2012. "Geoadditive models for regional count data: an application to industrial location," ERSA conference papers ersa12p83, European Regional Science Association.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:277:y:2019:i:1:p:366-376. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.