Machine learning techniques in joint default assessment

My bibliography Save this paper

Machine learning techniques in joint default assessment

Author

Listed:

Margherita Doria
Elisa Luciano
Patrizia Semeraro

Registered:

Elisa Luciano

Abstract

This paper studies the consequences of capturing non-linear dependence among the covariates that drive the default of different obligors and the overall riskiness of their credit portfolio. Joint default modeling is, without loss of generality, the classical Bernoulli mixture model. Using an application to a credit card dataset we show that, even when Machine Learning techniques perform only slightly better than Logistic Regression in classifying individual defaults as a function of the covariates, they do outperform it at the portfolio level. This happens because they capture linear and non-linear dependence among the covariates, whereas Logistic Regression only captures linear dependence. The ability of Machine Learning methods to capture non-linear dependence among the covariates produces higher default correlation compared with Logistic Regression. As a consequence, on our data, Logistic Regression underestimates the riskiness of the credit portfolio.

Suggested Citation

Margherita Doria & Elisa Luciano & Patrizia Semeraro, 2022. "Machine learning techniques in joint default assessment," Papers 2205.01524, arXiv.org, revised Sep 2023.

Handle: RePEc:arx:papers:2205.01524

Download full text from publisher

Other versions of this item:

Edoardo Fadda & Elisa Luciano & Patrizia Semeraro, 2024. "Machine Learning techniques in joint default assessment," Carlo Alberto Notebooks 723 JEL Classification: G, Collegio Carlo Alberto.

References listed on IDEAS

Chen, Shunqin & Guo, Zhengfeng & Zhao, Xinlei, 2021. "Predicting mortgage early delinquency with machine learning methods," European Journal of Operational Research, Elsevier, vol. 290(1), pages 358-372.
Fitzpatrick, Trevor & Mues, Christophe, 2016. "An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market," European Journal of Operational Research, Elsevier, vol. 249(2), pages 427-439.
İsmail Başoğlu & Wolfgang Hörmann & Halis Sak, 2018. "Efficient simulations for a Bernoulli mixture model of portfolio credit risk," Annals of Operations Research, Springer, vol. 260(1), pages 113-128, January.
Roberto Fontana & Elisa Luciano & Patrizia Semeraro, 2021. "Model risk in credit risk," Mathematical Finance, Wiley Blackwell, vol. 31(1), pages 176-202, January.
- Roberto Fontana & Elisa Luciano & Patrizia Semeraro, 2019. "Model Risk in Credit Risk," Papers 1906.06164, arXiv.org.
Dhaene, Jan & Denuit, Michel, 1999. "The safest dependence structure among risks," Insurance: Mathematics and Economics, Elsevier, vol. 25(1), pages 11-21, September.
Foster D.P. & Stine R.A., 2004. "Variable Selection in Data Mining: Building a Predictive Model for Bankruptcy," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 303-313, January.
Desai, Vijay S. & Crook, Jonathan N. & Overstreet, George A., 1996. "A comparison of neural networks and linear scoring models in the credit union environment," European Journal of Operational Research, Elsevier, vol. 95(1), pages 24-37, November.
Apaar Sadhwani & Kay Giesecke & Justin Sirignano, 2021. "Deep Learning for Mortgage Risk [The Subprime Virus]," Journal of Financial Econometrics, Oxford University Press, vol. 19(2), pages 313-368.
Khandani, Amir E. & Kim, Adlar J. & Lo, Andrew W., 2010. "Consumer credit-risk models via machine-learning algorithms," Journal of Banking & Finance, Elsevier, vol. 34(11), pages 2767-2787, November.
Embrechts, Paul & Puccetti, Giovanni & Rüschendorf, Ludger, 2013. "Model uncertainty and VaR aggregation," Journal of Banking & Finance, Elsevier, vol. 37(8), pages 2750-2764.
Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
Kaas, Rob & Dhaene, Jan & Goovaerts, Marc J., 2000. "Upper and lower bounds for sums of random variables," Insurance: Mathematics and Economics, Elsevier, vol. 27(2), pages 151-168, October.
Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
- Elena Ivona Dumitrescu & Sullivan Hué & Christophe Hurlin & Sessi Tokpavi, 2022. "Machine Learning for Credit Scoring: Improving Logistic Regression with Non Linear Decision Tree Effects," Post-Print hal-03331114, HAL.
Barrieu, Pauline & Scandolo, Giacomo, 2015. "Assessing financial model risk," European Journal of Operational Research, Elsevier, vol. 242(2), pages 546-556.
- Pauline Barrieu & Giacomo Scandolo, 2013. "Assessing Financial Model Risk," Papers 1307.0684, arXiv.org, revised Jul 2013.
Carole Bernard & Ludger Rüschendorf & Steven Vanduffel & Jing Yao, 2017. "How robust is the value-at-risk of credit risk portfolios?," The European Journal of Finance, Taylor & Francis Journals, vol. 23(6), pages 507-534, May.
Carole Bernard & Corrado De Vecchi & Steven Vanduffel, 2023. "The impact of correlation on (Range) Value-at-Risk," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2023(6), pages 531-564, July.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Roberto Fontana & Patrizia Semeraro, 2023. "Measuring distribution risk in discrete models," Papers 2302.08838, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
Chen, Shunqin & Guo, Zhengfeng & Zhao, Xinlei, 2021. "Predicting mortgage early delinquency with machine learning methods," European Journal of Operational Research, Elsevier, vol. 290(1), pages 358-372.
Carole Bernard & Ludger Rüschendorf & Steven Vanduffel & Ruodu Wang, 2017. "Risk bounds for factor models," Finance and Stochastics, Springer, vol. 21(3), pages 631-659, July.
Corrado De Vecchi & Max Nendel & Jan Streicher, 2024. "Upper Comonotonicity and Risk Aggregation under Dependence Uncertainty," Papers 2406.19242, arXiv.org.
Claußen, Arndt & Rösch, Daniel & Schmelzle, Martin, 2019. "Hedging parameter risk," Journal of Banking & Finance, Elsevier, vol. 100(C), pages 111-121.
Bernard, Carole & Kazzi, Rodrigue & Vanduffel, Steven, 2020. "Range Value-at-Risk bounds for unimodal distributions under partial information," Insurance: Mathematics and Economics, Elsevier, vol. 94(C), pages 9-24.
Emmanuel Flachaire & Sullivan Hué & Sébastien Laurent & Gilles Hacheme, 2024. "Interpretable Machine Learning Using Partial Linear Models," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 86(3), pages 519-540, June.
- Emmanuel Flachaire & Sullivan Hué & Sébastien Laurent & Gilles Hacheme, 2023. "Interpretable Machine Learning Using Partial Linear Models," Post-Print hal-04529011, HAL.
Chen, Yujia & Calabrese, Raffaella & Martin-Barragan, Belen, 2024. "Interpretable machine learning for imbalanced credit scoring datasets," European Journal of Operational Research, Elsevier, vol. 312(1), pages 357-372.
Chen, Dangxing & Ye, Jiahui & Ye, Weicheng, 2023. "Interpretable selective learning in credit risk," Research in International Business and Finance, Elsevier, vol. 65(C).
Sullivan Hué, 2022. "GAM(L)A: An econometric model for interpretable machine learning," French Stata Users' Group Meetings 2022 19, Stata Users Group.
Emmanuel Flachaire & Gilles Hacheme & Sullivan Hu'e & S'ebastien Laurent, 2022. "GAM(L)A: An econometric model for interpretable Machine Learning," Papers 2203.11691, arXiv.org.
Luca Barbaglia & Sebastiano Manzan & Elisa Tosetti, 2023. "Forecasting Loan Default in Europe with Machine Learning," Journal of Financial Econometrics, Oxford University Press, vol. 21(2), pages 569-596.
Tigges, Maximilian & Mestwerdt, Sönke & Tschirner, Sebastian & Mauer, René, 2024. "Who gets the money? A qualitative analysis of fintech lending and credit scoring through the adoption of AI and alternative data," Technological Forecasting and Social Change, Elsevier, vol. 205(C).
Carole Bernard & Ludger Rüschendorf & Steven Vanduffel, 2017. "Value-at-Risk Bounds With Variance Constraints," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 84(3), pages 923-959, September.
Dangxing Chen & Weicheng Ye & Jiahui Ye, 2022. "Interpretable Selective Learning in Credit Risk," Papers 2209.10127, arXiv.org.
Antonella Campana, 2007. "On Tail Value-at-Risk for sums of non-independent random variables with a generalized Pareto distribution," The Geneva Papers on Risk and Insurance Theory, Springer;International Association for the Study of Insurance Economics (The Geneva Association), vol. 32(2), pages 169-180, December.
Thibaut Lux & Antonis Papapantoleon, 2016. "Model-free bounds on Value-at-Risk using extreme value information and statistical distances," Papers 1610.09734, arXiv.org, revised Nov 2018.
Mai Jan-Frederik & Schenk Steffen & Scherer Matthias, 2015. "Analyzing model robustness via a distortion of the stochastic root: A Dirichlet prior approach," Statistics & Risk Modeling, De Gruyter, vol. 32(3-4), pages 177-195, December.
Hofert Marius & Memartoluie Amir & Saunders David & Wirjanto Tony, 2017. "Improved algorithms for computing worst Value-at-Risk," Statistics & Risk Modeling, De Gruyter, vol. 34(1-2), pages 13-31, June.
Michael Bucker & Gero Szepannek & Alicja Gosiewska & Przemyslaw Biecek, 2020. "Transparency, Auditability and eXplainability of Machine Learning Models in Credit Scoring," Papers 2009.13384, arXiv.org.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2022-06-20 (Big Data)
NEP-CMP-2022-06-20 (Computational Economics)
NEP-RMG-2022-06-20 (Risk Management)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2205.01524. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Machine learning techniques in joint default assessment

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data