IDEAS home Printed from https://ideas.repec.org/a/oup/jfinec/v21y2023i2p569-596..html
   My bibliography  Save this article

Forecasting Loan Default in Europe with Machine Learning

Author

Listed:
  • Luca Barbaglia
  • Sebastiano Manzan
  • Elisa Tosetti

Abstract

We use a dataset of 12 million residential mortgages to investigate the loan default behavior in several European countries. We model the default occurrence as a function of borrower characteristics, loan-specific variables, and local economic conditions. We compare the performance of a set of machine learning algorithms relative to the logistic regression, finding that they perform significantly better in providing predictions. The most important variables in explaining loan default are the interest rate and the local economic characteristics. The existence of relevant geographical heterogeneity in the variable importance points at the need for regionally tailored risk-assessment policies in Europe.

Suggested Citation

  • Luca Barbaglia & Sebastiano Manzan & Elisa Tosetti, 2023. "Forecasting Loan Default in Europe with Machine Learning," Journal of Financial Econometrics, Oxford University Press, vol. 21(2), pages 569-596.
  • Handle: RePEc:oup:jfinec:v:21:y:2023:i:2:p:569-596.
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1093/jjfinec/nbab010
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Marine Carrasco & Barbara Rossi, 2016. "In-Sample Inference and Forecasting in Misspecified Factor Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(3), pages 313-338, July.
    2. Andrii Babii & Eric Ghysels & Jonas Striaukas, 2022. "Machine Learning Time Series Regressions With an Application to Nowcasting," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(3), pages 1094-1106, June.
    3. Aron, Janine & Muellbauer, John, 2016. "“Modelling and forecasting mortgage delinquency and foreclosure in the UK.”," Journal of Urban Economics, Elsevier, vol. 94(C), pages 32-53.
    4. Desai, Vijay S. & Crook, Jonathan N. & Overstreet, George A., 1996. "A comparison of neural networks and linear scoring models in the credit union environment," European Journal of Operational Research, Elsevier, vol. 95(1), pages 24-37, November.
    5. Atif Mian & Amir Sufi & Emil Verner, 2017. "Household Debt and Business Cycles Worldwide," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 132(4), pages 1755-1817.
    6. John Y. Campbell & João F. Cocco, 2015. "A Model of Mortgage Default," Journal of Finance, American Finance Association, vol. 70(4), pages 1495-1554, August.
    7. Stefania Albanesi & Domonkos F. Vamossy, 2019. "Predicting Consumer Default: A Deep Learning Approach," NBER Working Papers 26165, National Bureau of Economic Research, Inc.
    8. David Feldman & Shulamith Gross, 2005. "Mortgage Default: Classification Trees Analysis," The Journal of Real Estate Finance and Economics, Springer, vol. 30(4), pages 369-396, June.
    9. Tullio Jappelli & Marco Pagano & Marco Di Maggio, 2013. "Households' indebtedness and financial fragility," Journal of Financial Management, Markets and Institutions, Società editrice il Mulino, issue 1, pages 23-46, January.
    10. King, Gary & Zeng, Langche, 2001. "Logistic Regression in Rare Events Data," Political Analysis, Cambridge University Press, vol. 9(2), pages 137-163, January.
    11. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    12. Lore Dirick & Tony Bellotti & Gerda Claeskens & Bart Baesens, 2019. "Macro-Economic Factors in Credit Risk Calculations: Including Time-Varying Covariates in Mixture Cure Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 37(1), pages 40-53, January.
    13. Deng, Yongheng, 1997. "Mortgage Termination: An Empirical Hazard Model with a Stochastic Term Structure," The Journal of Real Estate Finance and Economics, Springer, vol. 14(3), pages 309-331, May.
    14. Ampudia, Miguel & van Vlokhoven, Has & Żochowski, Dawid, 2016. "Financial fragility of euro area households," Journal of Financial Stability, Elsevier, vol. 27(C), pages 250-262.
    15. Patrick Bajari & Chenghuan Sean Chu & Minjung Park, 2008. "An Empirical Model of Subprime Mortgage Default From 2000 to 2007," NBER Working Papers 14625, National Bureau of Economic Research, Inc.
    16. Aytekin Ertan & Maria Loumioti & Regina Wittenberg‐Moerman, 2017. "Enhancing Loan Quality Through Transparency: Evidence from the European Central Bank Loan Level Reporting Initiative," Journal of Accounting Research, Wiley Blackwell, vol. 55(4), pages 877-918, September.
    17. Andreas Fuster & Paul S. Willen, 2017. "Payment Size, Negative Equity, and Mortgage Default," American Economic Journal: Economic Policy, American Economic Association, vol. 9(4), pages 167-191, November.
    18. Fitzpatrick, Trevor & Mues, Christophe, 2016. "An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market," European Journal of Operational Research, Elsevier, vol. 249(2), pages 427-439.
    19. Diaz-Serrano, Luis, 2004. "Income Volatility and Residential Mortgage Delinquency: Evidence from 12 EU Countries," IZA Discussion Papers 1396, Institute of Labor Economics (IZA).
    20. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    21. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    22. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    23. Atif Mian & Amir Sufi, 2009. "The Consequences of Mortgage Credit Expansion: Evidence from the U.S. Mortgage Default Crisis," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 124(4), pages 1449-1496.
    24. Khandani, Amir E. & Kim, Adlar J. & Lo, Andrew W., 2010. "Consumer credit-risk models via machine-learning algorithms," Journal of Banking & Finance, Elsevier, vol. 34(11), pages 2767-2787, November.
    25. Babii, Andrii & Chen, Xi & Ghysels, Eric, 2019. "Commercial and Residential Mortgage Defaults: Spatial Dependence with Frailty," Journal of Econometrics, Elsevier, vol. 212(1), pages 47-77.
    26. Catherine D'Hondt & Rudy De Winne & Eric Ghysels & Steve Raymond, 2019. "Artificial Intelligence Alter Egos: Who benefits from Robo-investing?," Papers 1907.03370, arXiv.org.
    27. Manuel Adelino & Antoinette Schoar & Felipe Severino, 2016. "Editor's Choice Loan Originations and Defaults in the Mortgage Crisis: The Role of the Middle Class," The Review of Financial Studies, Society for Financial Studies, vol. 29(7), pages 1635-1670.
    28. Fritz Schiltz & Chiara Masci & Tommaso Agasisti & Daniel Horn, 2018. "Using regression tree ensembles to model interaction effects: a graphical approach," Applied Economics, Taylor & Francis Journals, vol. 50(58), pages 6341-6354, December.
    29. Cunningham, Donald F & Capone, Charles A, Jr, 1990. "The Relative Termination Experience of Adjustable to Fixed-Rate Mortgages," Journal of Finance, American Finance Association, vol. 45(5), pages 1687-1703, December.
    30. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    31. M. Teresa Sánchez-Martínez & Jose Sanchez-Campillo & Dolores Moreno-Herrero, 2016. "Mortgage debt and household vulnerability," International Journal of Housing Markets and Analysis, Emerald Group Publishing Limited, vol. 9(3), pages 400-420, August.
    32. Lambrecht, Bart M & Perraudin, William R M & Satchell, Steven, 2003. "Mortgage Default and Possession under Recourse: A Competing Hazards Approach," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 35(3), pages 425-442, June.
    33. Marcelo C. Medeiros & Gabriel F. R. Vasconcelos & Álvaro Veiga & Eduardo Zilberman, 2021. "Forecasting Inflation in a Data-Rich Environment: The Benefits of Machine Learning Methods," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(1), pages 98-119, January.
    34. Diaz-Serrano, Luis, 2005. "Income volatility and residential mortgage delinquency across the EU," Journal of Housing Economics, Elsevier, vol. 14(3), pages 153-177, September.
    35. Burcu Duygan-Bump & Charles Grant, 2009. "Household debt repayment behaviour: what role do institutions play? [‘Welfare implications of the Bankruptcy Reform Act of 1999’]," Economic Policy, CEPR, CESifo, Sciences Po;CES;MSH, vol. 24(57), pages 108-140.
    36. Aller, Carlos & Grant, Charles, 2018. "The effect of the financial crisis on default by Spanish households," Journal of Financial Stability, Elsevier, vol. 36(C), pages 39-52.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    2. Mocetti, Sauro & Viviano, Eliana, 2017. "Looking behind mortgage delinquencies," Journal of Banking & Finance, Elsevier, vol. 75(C), pages 53-63.
    3. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    4. O’Toole, Conor & Slaymaker, Rachel, 2021. "Repayment capacity, debt service ratios and mortgage default: An exploration in crisis and non-crisis periods," Journal of Banking & Finance, Elsevier, vol. 133(C).
    5. Pan, Shuiyang & Long, Suwan(Cheng) & Wang, Yiming & Xie, Ying, 2023. "Nonlinear asset pricing in Chinese stock market: A deep learning approach," International Review of Financial Analysis, Elsevier, vol. 87(C).
    6. Philippe Goulet Coulombe, 2021. "To Bag is to Prune," Working Papers 21-03, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management, revised Jun 2021.
    7. Grzegorz Wałęga & Agnieszka Wałęga, 2021. "Over-indebted Households in Poland: Classification Tree Analysis," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 153(2), pages 561-584, January.
    8. Zhang, Qin & Ni, He & Xu, Hao, 2023. "Nowcasting Chinese GDP in a data-rich environment: Lessons from machine learning algorithms," Economic Modelling, Elsevier, vol. 122(C).
    9. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    10. Tobias Götze & Marc Gürtler & Eileen Witowski, 2020. "Improving CAT bond pricing models via machine learning," Journal of Asset Management, Palgrave Macmillan, vol. 21(5), pages 428-446, September.
    11. Georgarakos, Dimitris & Lojschova, Adriana & Ward-Warmedinger, Melanie E., 2009. "Mortgage Indebtedness and Household Financial Distress," IZA Discussion Papers 4631, Institute of Labor Economics (IZA).
    12. Bryan T. Kelly & Asaf Manela & Alan Moreira, 2019. "Text Selection," NBER Working Papers 26517, National Bureau of Economic Research, Inc.
    13. Borup, Daniel & Christensen, Bent Jesper & Mühlbach, Nicolaj Søndergaard & Nielsen, Mikkel Slot, 2023. "Targeting predictors in random forest regression," International Journal of Forecasting, Elsevier, vol. 39(2), pages 841-868.
    14. Daniel Borup & Philippe Goulet Coulombe & Erik Christian Montes Schütte & David E. Rapach & Sander Schwenk-Nebbe, 2022. "The Anatomy of Out-of-Sample Forecasting Accuracy," FRB Atlanta Working Paper 2022-16, Federal Reserve Bank of Atlanta.
    15. Gene Amromin & Jennifer Huang & Clemens Sialm & Edward Zhong, 2018. "Complex Mortgages [Why don’t lenders renegotiate more home mortgages? Redefaults, self-cures, and securitization]," Review of Finance, European Finance Association, vol. 22(6), pages 1975-2007.
    16. Ziwei Mei & Zhentao Shi, 2022. "On LASSO for High Dimensional Predictive Regression," Papers 2212.07052, arXiv.org, revised Jan 2024.
    17. Stanga, Irina & Vlahu, Razvan & de Haan, Jakob, 2020. "Mortgage arrears, regulation and institutions: Cross-country evidence," Journal of Banking & Finance, Elsevier, vol. 118(C).
    18. Paul Geertsema & Helen Lu, 2023. "Relative Valuation with Machine Learning," Journal of Accounting Research, Wiley Blackwell, vol. 61(1), pages 329-376, March.
    19. Lenarčič, Črt, 2022. "Drivers of household arrears: an euro area country panel data analysis," MPRA Paper 114558, University Library of Munich, Germany.
    20. Andrés Alonso Robisco & José Manuel Carbó Martínez, 2022. "Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.

    More about this item

    Keywords

    big data; credit risk; loan default; machine learning; regional analysis;
    All these keywords.

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • D14 - Microeconomics - - Household Behavior - - - Household Saving; Personal Finance
    • R11 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General Regional Economics - - - Regional Economic Activity: Growth, Development, Environmental Issues, and Changes

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:oup:jfinec:v:21:y:2023:i:2:p:569-596.. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oxford University Press (email available below). General contact details of provider: https://edirc.repec.org/data/sofieea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.