IDEAS home Printed from https://ideas.repec.org/a/gam/jfinte/v3y2024i1p12-215d1351432.html
   My bibliography  Save this article

Reimagining Peer-to-Peer Lending Sustainability: Unveiling Predictive Insights with Innovative Machine Learning Approaches for Loan Default Anticipation

Author

Listed:
  • Ly Nguyen

    (Emissis Ltd., 2 Ellerbeck Court, Stockley Business Park, Middlesbrough TS9 5PT, UK)

  • Mominul Ahsan

    (Department of Computer Science, University of York, Deramore Lane, York YO10 5GH, UK)

  • Julfikar Haider

    (Department of Engineering, Manchester Metropolitan University, John Dalton Building, Chester Street, Manchester M1 5GD, UK)

Abstract

Peer-to-peer lending, a novel element of Internet finance that links lenders and borrowers via online platforms, has generated large profits for investors. However, borrowers’ missed payments have negatively impacted the industry’s sustainable growth. It is imperative to create a system that can correctly predict loan defaults to lessen the damage brought on by defaulters. The goal of this study is to fill the gap in the literature by exploring the feasibility of developing prediction models for P2P loan defaults without relying heavily on personal data while also focusing on identifying key variables influencing borrowers’ repayment capacity through systematic feature selection and exploratory data analysis. Given this, this study aims to create a computational model that aids lenders in determining the approval or rejection of a loan application, relying on the financial data provided by applicants. The selected dataset, sourced from an open database, contains 8578 transaction records and includes 14 attributes related to financial information, with no personal data included. A loan dataset is first subjected to an in-depth exploratory data analysis to find behaviors connected to loan defaults. Subsequently, diverse and noteworthy machine learning classification algorithms, including Random Forest, Support Vector Machine, Decision Tree, Logistic Regression, Naïve Bayes, and XGBoost, were employed to build models capable of discerning borrowers who repay their loans from those who do not. Our findings indicate that borrowers who fail to comply with their lenders’ credit policies, pay elevated interest rates, and possess low FICO ratings are at a higher likelihood of defaulting. Furthermore, elevated risk is observed among clients who obtain loans for small businesses. All classification models, including XGBoost and Random Forest, successfully developed and performed satisfactorily and achieved an accuracy of over 80%. When the decision threshold is set to 0.4, the best performance for predicting loan defaulters is achieved using logistic regression, which accurately identifies 83% of the defaulted loans, with a recall of 83%, precision of 21% and f1 score of 33%.

Suggested Citation

  • Ly Nguyen & Mominul Ahsan & Julfikar Haider, 2024. "Reimagining Peer-to-Peer Lending Sustainability: Unveiling Predictive Insights with Innovative Machine Learning Approaches for Loan Default Anticipation," FinTech, MDPI, vol. 3(1), pages 1-32, March.
  • Handle: RePEc:gam:jfinte:v:3:y:2024:i:1:p:12-215:d:1351432
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2674-1032/3/1/12/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2674-1032/3/1/12/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Adam Nowak & Amanda Ross & Christopher Yencha, 2018. "Small Business Borrowing And Peer‐To‐Peer Lending: Evidence From Lending Club," Contemporary Economic Policy, Western Economic Association International, vol. 36(2), pages 318-336, April.
    2. Cuiqing Jiang & Zhao Wang & Ruiya Wang & Yong Ding, 2018. "Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending," Annals of Operations Research, Springer, vol. 266(1), pages 511-529, July.
    3. Seth Freedman & Ginger Zhe Jin, 2008. "Do Social Networks Solve Information Problems for Peer-to-Peer Lending? Evidence from Prosper.com," Working Papers 08-43, NET Institute.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gregor Dorfleitner & Eva-Maria Oswald & Rongxin Zhang, 2021. "From Credit Risk to Social Impact: On the Funding Determinants in Interest-Free Peer-to-Peer Lending," Journal of Business Ethics, Springer, vol. 170(2), pages 375-400, May.
    2. Juanjuan Chen & Yabin Zhang & Zhujia Yin, 2018. "Education Premium In The Online Peer-To-Peer Lending Marketplace: Evidence From The Big Data In China," The Singapore Economic Review (SER), World Scientific Publishing Co. Pte. Ltd., vol. 63(01), pages 45-64, March.
    3. Christopher Gerling & Stefan Lessmann, 2023. "Multimodal Document Analytics for Banking Process Automation," Papers 2307.11845, arXiv.org, revised Nov 2023.
    4. Ejaz Ghani & William R. Kerr & Christopher Stanton, 2014. "Diasporas and Outsourcing: Evidence from oDesk and India," Management Science, INFORMS, vol. 60(7), pages 1677-1697, July.
    5. Xueru Chen & Xiaoji Hu & Shenglin Ben, 2021. "How do reputation, structure design and FinTech ecosystem affect the net cash inflow of P2P lending platforms? Evidence from China," Electronic Commerce Research, Springer, vol. 21(4), pages 1055-1082, December.
    6. Ajay Agrawal & Christian Catalini & Avi Goldfarb, 2014. "Some Simple Economics of Crowdfunding," Innovation Policy and the Economy, University of Chicago Press, vol. 14(1), pages 63-97.
    7. Philippe Bernard & Najat El Mekkaoui De Freitas & Bertrand B. Maillet, 2022. "A financial fraud detection indicator for investors: an IDeA," Annals of Operations Research, Springer, vol. 313(2), pages 809-832, June.
    8. Wangcheng Yan & Wenjun Zhou, 2023. "Is blockchain a cure for peer-to-peer lending?," Annals of Operations Research, Springer, vol. 321(1), pages 693-716, February.
    9. Kovacs, Attila, 2018. "Gender Differences in Equity Crowdfunding," OSF Preprints 5pcmb, Center for Open Science.
    10. Peng Wang & Haichao Zheng & Dongyu Chen & Liangchao Ding, 2015. "Exploring the critical factors influencing online lending intentions," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 1(1), pages 1-11, December.
    11. Ata Allah Taleizadeh & Aria Zaker Safaei & Arijit Bhattacharya & Alireza Amjadian, 2022. "Online peer-to-peer lending platform and supply chain finance decisions and strategies," Annals of Operations Research, Springer, vol. 315(1), pages 397-427, August.
    12. Dongyu Chen & Xiaolin Li & Fujun Lai, 2023. "Shill bidding in lenders’ eyes? A cross-country study on the influence of large bids in online P2P lending," Electronic Commerce Research, Springer, vol. 23(2), pages 1089-1114, June.
    13. Käfer Benjamin, 2018. "Peer-to-Peer Lending – A (Financial Stability) Risk Perspective," Review of Economics, De Gruyter, vol. 69(1), pages 1-25, April.
    14. Yanhong Guo & Shuai Jiang & Wenjun Zhou & Chunyu Luo & Hui Xiong, 2021. "A predictive indicator using lender composition for loan evaluation in P2P lending," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-24, December.
    15. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    16. repec:zbw:bofrdp:urn:nbn:fi:bof-201511261452 is not listed on IDEAS
    17. Jiang, Cuiqing & Lyu, Ximei & Yuan, Yufei & Wang, Zhao & Ding, Yong, 2022. "Mining semantic features in current reports for financial distress prediction: Empirical evidence from unlisted public firms in China," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1086-1099.
    18. Soumajyoti Sarkar & Hamidreza Alvari, 2020. "Mitigating Bias in Online Microfinance Platforms: A Case Study on Kiva.org," Papers 2006.12995, arXiv.org.
    19. Yufei Xia & Lingyun He & Yinguo Li & Nana Liu & Yanlin Ding, 2020. "Predicting loan default in peer‐to‐peer lending using narrative data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(2), pages 260-280, March.
    20. Xiaoyu Li & Jiahong Yuan & Yan Shi & Zilai Sun & Junhu Ruan, 2020. "Emerging Trends and Innovation Modes of Internet Finance—Results from Co-Word and Co-Citation Networks," Future Internet, MDPI, vol. 12(3), pages 1-14, March.
    21. Liu, Yezheng & Qian, Yang & Jiang, Yuanchun & Shang, Jennifer, 2020. "Using favorite data to analyze asymmetric competition: Machine learning models," European Journal of Operational Research, Elsevier, vol. 287(2), pages 600-615.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jfinte:v:3:y:2024:i:1:p:12-215:d:1351432. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.