IDEAS home Printed from https://ideas.repec.org/a/eee/intfor/v37y2021i4p1590-1613.html
   My bibliography  Save this article

Forecasting loss given default for peer-to-peer loans via heterogeneous stacking ensemble approach

Author

Listed:
  • Xia, Yufei
  • Zhao, Junhao
  • He, Lingyun
  • Li, Yinguo
  • Yang, Xiaoli

Abstract

Peer-to-peer (P2P) lending is an emerging field in FinTech and is an alternative source of personal loans. However, P2P lending faces severe credit risk due to high information asymmetry and insufficient collateral. We develop a novel heterogeneous stacking ensemble (HSE) approach by using two real-world datasets to improve the loss given default (LGD) forecasting in the P2P lending domain. Some special data in P2P lending and macroeconomic variables are employed as supplementary data sources to further enhance the model performance. Our proposal is compared with several popular models, including parametric and non-parametric ones, in terms of predictive accuracy and capital requirement. Our finding reveals that special data in P2P lending (e.g., number of investors and loan description) and macroeconomic variables are powerful predictors of LGD in P2P lending. The proposed HSE model outperforms the benchmark models in most cases and significantly achieves optimal average ranks across all the evaluation metrics. The results remain robust under several validations.

Suggested Citation

  • Xia, Yufei & Zhao, Junhao & He, Lingyun & Li, Yinguo & Yang, Xiaoli, 2021. "Forecasting loss given default for peer-to-peer loans via heterogeneous stacking ensemble approach," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1590-1613.
  • Handle: RePEc:eee:intfor:v:37:y:2021:i:4:p:1590-1613
    DOI: 10.1016/j.ijforecast.2021.03.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0169207021000534
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijforecast.2021.03.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Herzenstein, Michal & Dholakia, Utpal M. & Andrews, Rick L., 2011. "Strategic Herding Behavior in Peer-to-Peer Loan Auctions," Journal of Interactive Marketing, Elsevier, vol. 25(1), pages 27-36.
    2. Sydney C. Ludvigson, 2004. "Consumer Confidence and Consumer Spending," Journal of Economic Perspectives, American Economic Association, vol. 18(2), pages 29-50, Spring.
    3. Guo, Yanhong & Zhou, Wenjun & Luo, Chunyu & Liu, Chuanren & Xiong, Hui, 2016. "Instance-based credit risk assessment for investment decisions in P2P lending," European Journal of Operational Research, Elsevier, vol. 249(2), pages 417-426.
    4. repec:bla:ecnote:v:33:y:2004:i:2:p:183-208 is not listed on IDEAS
    5. Katarzyna Bijak & Lyn C Thomas, 2015. "Modelling LGD for unsecured retail loans using Bayesian methods," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 66(2), pages 342-352, February.
    6. Lean Yu & Zebin Yang & Ling Tang, 2016. "A novel multistage deep belief network based extreme learning machine ensemble learning paradigm for credit risk assessment," Flexible Services and Manufacturing Journal, Springer, vol. 28(4), pages 576-592, December.
    7. João Bastos, 2014. "Ensemble Predictions of Recovery Rates," Journal of Financial Services Research, Springer;Western Finance Association, vol. 46(2), pages 177-193, October.
    8. Frontczak, Robert & Rostek, Stefan, 2015. "Modeling loss given default with stochastic collateral," Economic Modelling, Elsevier, vol. 44(C), pages 162-170.
    9. Tanoue, Yuta & Kawada, Akihiro & Yamashita, Satoshi, 2017. "Forecasting loss given default of bank loans with multi-stage model," International Journal of Forecasting, Elsevier, vol. 33(2), pages 513-522.
    10. Carlos Serrano-Cinca & Begoña Gutiérrez-Nieto & Luz López-Palacios, 2015. "Determinants of Default in P2P Lending," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-22, October.
    11. Jagtiani, Julapa & Lemieux, Catharine, 2018. "Do fintech lenders penetrate areas that are underserved by traditional banks?," Journal of Economics and Business, Elsevier, vol. 100(C), pages 43-54.
    12. Garry Bruton & Susanna Khavul & Donald Siegel & Mike Wright, 2015. "New Financial Alternatives in Seeding Entrepreneurship: Microfinance, Crowdfunding, and Peer–to–Peer Innovations," Entrepreneurship Theory and Practice, , vol. 39(1), pages 9-26, January.
    13. Do, Hung Xuan & Rösch, Daniel & Scheule, Harald, 2018. "Predicting loss severities for residential mortgage loans: A three-step selection approach," European Journal of Operational Research, Elsevier, vol. 270(1), pages 246-259.
    14. Tong, Edward N.C. & Mues, Christophe & Thomas, Lyn, 2013. "A zero-adjusted gamma model for mortgage loan loss given default," International Journal of Forecasting, Elsevier, vol. 29(4), pages 548-562.
    15. A Matuszyk & C Mues & L C Thomas, 2010. "Modelling LGD for unsecured personal loans: decision tree approach," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 393-398, March.
    16. Guangyou Zhou & Yijia Zhang & Sumei Luo, 2018. "P2P Network Lending, Loss Given Default and Credit Risks," Sustainability, MDPI, vol. 10(4), pages 1-15, March.
    17. Bastos, João A., 2010. "Forecasting bank loans loss-given-default," Journal of Banking & Finance, Elsevier, vol. 34(10), pages 2510-2517, October.
    18. Freedman, Seth & Jin, Ginger Zhe, 2017. "The information value of online social networks: Lessons from peer-to-peer lending," International Journal of Industrial Organization, Elsevier, vol. 51(C), pages 185-222.
    19. Qi, Min & Zhao, Xinlei, 2011. "Comparison of modeling methods for Loss Given Default," Journal of Banking & Finance, Elsevier, vol. 35(11), pages 2842-2855, November.
    20. Edward I. Altman & Brooks Brady & Andrea Resti & Andrea Sironi, 2005. "The Link between Default and Recovery Rates: Theory, Empirical Evidence, and Implications," The Journal of Business, University of Chicago Press, vol. 78(6), pages 2203-2228, November.
    21. Yufei Xia & Lingyun He & Yinguo Li & Nana Liu & Yanlin Ding, 2020. "Predicting loan default in peer‐to‐peer lending using narrative data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(2), pages 260-280, March.
    22. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    23. Dorfleitner, Gregor & Priberny, Christopher & Schuster, Stephanie & Stoiber, Johannes & Weber, Martina & de Castro, Ivan & Kammler, Julia, 2016. "Description-text related soft information in peer-to-peer lending – Evidence from two leading European platforms," Journal of Banking & Finance, Elsevier, vol. 64(C), pages 169-187.
    24. Yao, Xiao & Crook, Jonathan & Andreeva, Galina, 2015. "Support vector regression for loss given default modelling," European Journal of Operational Research, Elsevier, vol. 240(2), pages 528-538.
    25. Esa Jokivuolle & Samu Peura, 2003. "Incorporating Collateral Value Uncertainty in Loss Given Default Estimates and Loan‐to‐value Ratios," European Financial Management, European Financial Management Association, vol. 9(3), pages 299-314, September.
    26. Yao, Xiao & Crook, Jonathan & Andreeva, Galina, 2017. "Enhancing two-stage modelling methodology for loss given default with support vector machines," European Journal of Operational Research, Elsevier, vol. 263(2), pages 679-689.
    27. Loterman, Gert & Brown, Iain & Martens, David & Mues, Christophe & Baesens, Bart, 2012. "Benchmarking regression algorithms for loss given default modeling," International Journal of Forecasting, Elsevier, vol. 28(1), pages 161-170.
    28. Nazemi, Abdolreza & Fatemi Pour, Farnoosh & Heidenreich, Konstantin & Fabozzi, Frank J., 2017. "Fuzzy decision fusion approach for loss-given-default modeling," European Journal of Operational Research, Elsevier, vol. 262(2), pages 780-791.
    29. Mindy Leow & Christophe Mues & Lyn Thomas, 2014. "The economy and loss given default: evidence from two UK retail lending data sets," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 65(3), pages 363-375, March.
    30. Riza Emekter & Yanbin Tu & Benjamas Jirasakuldech & Min Lu, 2015. "Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending," Applied Economics, Taylor & Francis Journals, vol. 47(1), pages 54-70, January.
    31. Mingfeng Lin & Nagpurnanand R. Prabhala & Siva Viswanathan, 2013. "Judging Borrowers by the Company They Keep: Friendship Networks and Information Asymmetry in Online Peer-to-Peer Lending," Management Science, INFORMS, vol. 59(1), pages 17-35, August.
    32. Hartmann-Wendels, Thomas & Miller, Patrick & Töws, Eugen, 2014. "Loss given default for leasing: Parametric and nonparametric estimations," Journal of Banking & Finance, Elsevier, vol. 40(C), pages 364-375.
    33. T Bellotti & J Crook, 2009. "Credit scoring with macroeconomic variables using survival analysis," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(12), pages 1699-1707, December.
    34. Juanjuan Zhang & Peng Liu, 2012. "Rational Herding in Microloan Markets," Management Science, INFORMS, vol. 58(5), pages 892-912, May.
    35. Leow, Mindy & Mues, Christophe, 2012. "Predicting loss given default (LGD) for residential mortgage loans: A two-stage model and empirical evidence for UK bank data," International Journal of Forecasting, Elsevier, vol. 28(1), pages 183-195.
    36. Ellen Tobback & David Martens & Tony Van Gestel & Bart Baesens, 2014. "Forecasting Loss Given Default models: impact of account characteristics and the macroeconomic state," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 65(3), pages 376-392, March.
    37. Bellotti, Tony & Crook, Jonathan, 2012. "Loss given default models incorporating macroeconomic variables for credit cards," International Journal of Forecasting, Elsevier, vol. 28(1), pages 171-182.
    38. Zhang, Jie & Thomas, Lyn C., 2012. "Comparisons of linear regression and survival analysis using single and mixture distributions approaches in modelling LGD," International Journal of Forecasting, Elsevier, vol. 28(1), pages 204-215.
    39. Dermine, J. & de Carvalho, C. Neto, 2006. "Bank loan losses-given-default: A case study," Journal of Banking & Finance, Elsevier, vol. 30(4), pages 1219-1243, April.
    40. Hurlin, Christophe & Leymarie, Jérémy & Patin, Antoine, 2018. "Loss functions for Loss Given Default model comparison," European Journal of Operational Research, Elsevier, vol. 268(1), pages 348-360.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shi, Tao & Li, Chongyang & Wanyan, Hong & Xu, Ying & Zhang, Wei, 2022. "The lending risk predicting of the folk informal financial organization from big data using the deep learning hybrid model," Finance Research Letters, Elsevier, vol. 50(C).
    2. Li, Aimin & Li, Zhiyong & Bellotti, Anthony, 2023. "Predicting loss given default of unsecured consumer loans with time-varying survival scores," Pacific-Basin Finance Journal, Elsevier, vol. 78(C).
    3. Liu, Wanan & Fan, Hong & Xia, Meng, 2023. "Tree-based heterogeneous cascade ensemble model for credit scoring," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1593-1614.
    4. Yufei Xia & Xinyi Guo & Yinguo Li & Lingyun He & Xueyuan Chen, 2022. "Deep learning meets decision trees: An application of a heterogeneous deep forest approach in credit scoring for online consumer lending," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(8), pages 1669-1690, December.
    5. Li, Zhiyong & Li, Aimin & Bellotti, Anthony & Yao, Xiao, 2023. "The profitability of online loans: A competing risks analysis on default and prepayment," European Journal of Operational Research, Elsevier, vol. 306(2), pages 968-985.
    6. Choudhary, Priya & Thenmozhi, M., 2024. "Fintech and financial sector: ADO analysis and future research agenda," International Review of Financial Analysis, Elsevier, vol. 93(C).
    7. Bastos, João A. & Matos, Sara M., 2022. "Explainable models of credit losses," European Journal of Operational Research, Elsevier, vol. 301(1), pages 386-394.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Aimin & Li, Zhiyong & Bellotti, Anthony, 2023. "Predicting loss given default of unsecured consumer loans with time-varying survival scores," Pacific-Basin Finance Journal, Elsevier, vol. 78(C).
    2. Hurlin, Christophe & Leymarie, Jérémy & Patin, Antoine, 2018. "Loss functions for Loss Given Default model comparison," European Journal of Operational Research, Elsevier, vol. 268(1), pages 348-360.
    3. Dimitris Andriosopoulos & Michalis Doumpos & Panos M. Pardalos & Constantin Zopounidis, 2019. "Computational approaches and data analytics in financial services: A literature review," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 70(10), pages 1581-1599, October.
    4. Yuta Tanoue & Satoshi Yamashita & Hideaki Nagahata, 2020. "Comparison study of two-step LGD estimation model with probability machines," Risk Management, Palgrave Macmillan, vol. 22(3), pages 155-177, September.
    5. Li, Zhiyong & Li, Aimin & Bellotti, Anthony & Yao, Xiao, 2023. "The profitability of online loans: A competing risks analysis on default and prepayment," European Journal of Operational Research, Elsevier, vol. 306(2), pages 968-985.
    6. Nazemi, Abdolreza & Fatemi Pour, Farnoosh & Heidenreich, Konstantin & Fabozzi, Frank J., 2017. "Fuzzy decision fusion approach for loss-given-default modeling," European Journal of Operational Research, Elsevier, vol. 262(2), pages 780-791.
    7. Kaposty, Florian & Kriebel, Johannes & Löderbusch, Matthias, 2020. "Predicting loss given default in leasing: A closer look at models and variable selection," International Journal of Forecasting, Elsevier, vol. 36(2), pages 248-266.
    8. Marc Gürtler & Marvin Zöllner, 2023. "Heterogeneities among credit risk parameter distributions: the modality defines the best estimation method," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(1), pages 251-287, March.
    9. Chen, Xiaowei & Wang, Gang & Zhang, Xiangting, 2019. "Modeling recovery rate for leveraged loans," Economic Modelling, Elsevier, vol. 81(C), pages 231-241.
    10. Emily Johnston Ross & Lynn Shibut, 2021. "Loss Given Default, Loan Seasoning and Financial Fragility: Evidence from Commercial Real Estate Loans at Failed Banks," The Journal of Real Estate Finance and Economics, Springer, vol. 63(4), pages 630-661, November.
    11. Betz, Jennifer & Kellner, Ralf & Rösch, Daniel, 2018. "Systematic Effects among Loss Given Defaults and their Implications on Downturn Estimation," European Journal of Operational Research, Elsevier, vol. 271(3), pages 1113-1144.
    12. Jennifer Betz & Ralf Kellner & Daniel Rösch, 2021. "Time matters: How default resolution times impact final loss rates," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 619-644, June.
    13. Starosta, Wojciech, 2021. "Loss given default decomposition using mixture distributions of in-default events," European Journal of Operational Research, Elsevier, vol. 292(3), pages 1187-1199.
    14. Kellner, Ralf & Nagl, Maximilian & Rösch, Daniel, 2022. "Opening the black box – Quantile neural networks for loss given default prediction," Journal of Banking & Finance, Elsevier, vol. 134(C).
    15. Paolo Gambetti & Francesco Roccazzella & Frédéric Vrins, 2022. "Meta-Learning Approaches for Recovery Rate Prediction," Risks, MDPI, vol. 10(6), pages 1-29, June.
    16. Do, Hung Xuan & Rösch, Daniel & Scheule, Harald, 2018. "Predicting loss severities for residential mortgage loans: A three-step selection approach," European Journal of Operational Research, Elsevier, vol. 270(1), pages 246-259.
    17. Miller, Patrick & Töws, Eugen, 2018. "Loss given default adjusted workout processes for leases," Journal of Banking & Finance, Elsevier, vol. 91(C), pages 189-201.
    18. Christophe Hurlin & Jérémy Leymarie & Antoine Patin, 2018. "Loss functions for LGD model comparison," Working Papers halshs-01516147, HAL.
    19. Salvatore D. Tomarchio & Antonio Punzo, 2019. "Modelling the loss given default distribution via a family of zero‐and‐one inflated mixture models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1247-1266, October.
    20. Thamayanthi Chellathurai, 2017. "Probability Density Of Recovery Rate Given Default Of A Firm’S Debt And Its Constituent Tranches," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 20(04), pages 1-34, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:intfor:v:37:y:2021:i:4:p:1590-1613. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ijforecast .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.