IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v249y2016i2p427-439.html
   My bibliography  Save this article

An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market

Author

Listed:
  • Fitzpatrick, Trevor
  • Mues, Christophe

Abstract

This paper evaluates the performance of a number of modelling approaches for future mortgage default status. Boosted regression trees, random forests, penalised linear and semi-parametric logistic regression models are applied to four portfolios of over 300,000 Irish owner-occupier mortgages. The main findings are that the selected approaches have varying degrees of predictive power and that boosted regression trees significantly outperform logistic regression. This suggests that boosted regression trees can be a useful addition to the current toolkit for mortgage credit risk assessment by banks and regulators.

Suggested Citation

  • Fitzpatrick, Trevor & Mues, Christophe, 2016. "An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market," European Journal of Operational Research, Elsevier, vol. 249(2), pages 427-439.
  • Handle: RePEc:eee:ejores:v:249:y:2016:i:2:p:427-439
    DOI: 10.1016/j.ejor.2015.09.014
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221715008383
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2015.09.014?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hand, David J., 2009. "Mining the past to determine the future: Problems and possibilities," International Journal of Forecasting, Elsevier, vol. 25(3), pages 441-451, July.
    2. Horton, Nicholas J. & Kleinman, Ken P., 2007. "Much Ado About Nothing: A Comparison of Missing Data Methods and Software to Fit Incomplete Data Regression Models," The American Statistician, American Statistical Association, vol. 61, pages 79-90, February.
    3. Kennedy, Gerard & McIndoe-Calder, Tara, 2012. "The Irish Mortgage Market: Stylised Facts, Negative Equity and Arrears," Quarterly Bulletin Articles, Central Bank of Ireland, pages 85-108, February.
    4. Kuhn, Max, 2008. "Building Predictive Models in R Using the caret Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 28(i05).
    5. De Bock, Koen W. & Coussement, Kristof & Van den Poel, Dirk, 2010. "Ensemble classification based on generalized additive models," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1535-1546, June.
    6. Daniel Berg, 2007. "Bankruptcy prediction by generalized additive models," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 23(2), pages 129-143, March.
    7. David Feldman & Shulamith Gross, 2005. "Mortgage Default: Classification Trees Analysis," The Journal of Real Estate Finance and Economics, Springer, vol. 30(4), pages 369-396, June.
    8. T Bellotti & J Crook, 2009. "Credit scoring with macroeconomic variables using survival analysis," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(12), pages 1699-1707, December.
    9. K Kennedy & B Mac Namee & S J Delany, 2013. "Using semi-supervised classifiers for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 64(4), pages 513-529, April.
    10. Foote, Christopher L. & Gerardi, Kristopher & Willen, Paul S., 2008. "Negative equity and foreclosure: Theory and evidence," Journal of Urban Economics, Elsevier, vol. 64(2), pages 234-245, September.
    11. Friedman, Jerome H., 2002. "Stochastic gradient boosting," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 367-378, February.
    12. Haughwout, Andrew & Peach, Richard & Tracy, Joseph, 2008. "Juvenile delinquent mortgages: Bad credit or bad economy?," Journal of Urban Economics, Elsevier, vol. 64(2), pages 246-257, September.
    13. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    14. Yongheng Deng & John M. Quigley & Robert Van Order, 2000. "Mortgage Terminations, Heterogeneity and the Exercise of Mortgage Options," Econometrica, Econometric Society, vol. 68(2), pages 275-308, March.
    15. Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.
    16. Galindo, J & Tamayo, P, 2000. "Credit Risk Assessment Using Statistical and Machine Learning: Basic Methodology and Risk Modeling Applications," Computational Economics, Springer;Society for Computational Economics, vol. 15(1-2), pages 107-143, April.
    17. B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
    18. Martens, David & Baesens, Bart & Van Gestel, Tony & Vanthienen, Jan, 2007. "Comprehensible credit scoring models using rule extraction from support vector machines," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1466-1476, December.
    19. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    20. Khandani, Amir E. & Kim, Adlar J. & Lo, Andrew W., 2010. "Consumer credit-risk models via machine-learning algorithms," Journal of Banking & Finance, Elsevier, vol. 34(11), pages 2767-2787, November.
    21. Bastos, Joao, 2007. "Credit scoring with boosted decision trees," MPRA Paper 8034, University Library of Munich, Germany.
    22. Medema, Lydian & Koning, Ruud H. & Lensink, Robert, 2009. "A practical approach to validating a PD model," Journal of Banking & Finance, Elsevier, vol. 33(4), pages 701-708, April.
    23. Das, Sanjiv R., 2012. "The Principal Principle," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 47(6), pages 1215-1246, December.
    24. Das, Sanjiv R. & Meadows, Ray, 2013. "Strategic loan modification: An options-based response to strategic default," Journal of Banking & Finance, Elsevier, vol. 37(2), pages 636-647.
    25. Hand, David J., 2009. "Mining the past to determine the future: Rejoinder," International Journal of Forecasting, Elsevier, vol. 25(3), pages 461-462, July.
    26. Clifford M. Hurvich & Jeffrey S. Simonoff & Chih‐Ling Tsai, 1998. "Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 271-293.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chen, Shunqin & Guo, Zhengfeng & Zhao, Xinlei, 2021. "Predicting mortgage early delinquency with machine learning methods," European Journal of Operational Research, Elsevier, vol. 290(1), pages 358-372.
    2. Richard Chamboko & Jorge Miguel Bravo, 2020. "A Multi-State Approach to Modelling Intermediate Events and Multiple Mortgage Loan Outcomes," Risks, MDPI, vol. 8(2), pages 1-29, June.
    3. Crone, Sven F. & Finlay, Steven, 2012. "Instance sampling in credit scoring: An empirical study of sample size and balancing," International Journal of Forecasting, Elsevier, vol. 28(1), pages 224-238.
    4. Stefan Lessmann & Stefan Voß, 2010. "Customer-Centric Decision Support," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 2(2), pages 79-93, April.
    5. Dimitris Andriosopoulos & Michalis Doumpos & Panos M. Pardalos & Constantin Zopounidis, 2019. "Computational approaches and data analytics in financial services: A literature review," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 70(10), pages 1581-1599, October.
    6. TOBBACK, Ellen & MARTENS, David, 2017. "Retail credit scoring using fine-grained payment data," Working Papers 2017011, University of Antwerp, Faculty of Business and Economics.
    7. Reamonn Lyndon & Yvonne McCarthy, 2013. "What Lies Beneath? Understanding Recent Trends in Irish Mortgage Arrears," The Economic and Social Review, Economic and Social Studies, vol. 44(1), pages 117-150.
    8. Asish Saha & Hock-Eam Lim & Goh-Yeok Siew, 2021. "Housing Loan Repayment Behaviour in Malaysia: An Analytical Insight," International Journal of Business and Economics, School of Management Development, Feng Chia University, Taichung, Taiwan, vol. 20(2), pages 1-19, September.
    9. Goodstein, Ryan & Hanouna, Paul & Ramirez, Carlos D. & Stahel, Christof W., 2017. "Contagion effects in strategic mortgage defaults," Journal of Financial Intermediation, Elsevier, vol. 30(C), pages 50-60.
    10. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    11. Michael Bucker & Gero Szepannek & Alicja Gosiewska & Przemyslaw Biecek, 2020. "Transparency, Auditability and eXplainability of Machine Learning Models in Credit Scoring," Papers 2009.13384, arXiv.org.
    12. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    13. Juan Laborda & Seyong Ryoo, 2021. "Feature Selection in a Credit Scoring Model," Mathematics, MDPI, vol. 9(7), pages 1-22, March.
    14. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    15. Mocetti, Sauro & Viviano, Eliana, 2017. "Looking behind mortgage delinquencies," Journal of Banking & Finance, Elsevier, vol. 75(C), pages 53-63.
    16. K. W. De Bock & D. Van Den Poel, 2012. "Reconciling Performance and Interpretability in Customer Churn Prediction using Ensemble Learning based on Generalized Additive Models," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/805, Ghent University, Faculty of Economics and Business Administration.
    17. Olson, Luke M. & Qi, Min & Zhang, Xiaofei & Zhao, Xinlei, 2021. "Machine learning loss given default for corporate debt," Journal of Empirical Finance, Elsevier, vol. 64(C), pages 144-159.
    18. Koen W. de Bock, 2017. "The best of two worlds: Balancing model strength and comprehensibility in business failure prediction using spline-rule ensembles," Post-Print hal-01588059, HAL.
    19. Kristopher Gerardi & Kyle F. Herkenhoff & Lee E. Ohanian & Paul S. Willen, 2018. "Can’t Pay or Won’t Pay? Unemployment, Negative Equity, and Strategic Default," The Review of Financial Studies, Society for Financial Studies, vol. 31(3), pages 1098-1131.
    20. Chen, Dangxing & Ye, Jiahui & Ye, Weicheng, 2023. "Interpretable selective learning in credit risk," Research in International Business and Finance, Elsevier, vol. 65(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:249:y:2016:i:2:p:427-439. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.