IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1707.04831.html
   My bibliography  Save this paper

Machine learning application in online lending risk prediction

Author

Listed:
  • Xiaojiao Yu

Abstract

Online leading has disrupted the traditional consumer banking sector with more effective loan processing. Risk prediction and monitoring is critical for the success of the business model. Traditional credit score models fall short in applying big data technology in building risk model. In this manuscript, data with various format and size were collected from public website, third-parties and assembled with client's loan application information data. Ensemble machine learning models, random forest model and XGBoost model, were built and trained with the historical transaction data and subsequently tested with separate data. XGBoost model shows higher K-S value, suggesting better classification capability in this task. Top 10 important features from the two models suggest external data such as zhimaScore, multi-platform stacking loans information, and social network information are important factors in predicting loan default probability.

Suggested Citation

  • Xiaojiao Yu, 2017. "Machine learning application in online lending risk prediction," Papers 1707.04831, arXiv.org.
  • Handle: RePEc:arx:papers:1707.04831
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1707.04831
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. William Adams & Liran Einav & Jonathan Levin, 2009. "Liquidity Constraints and Imperfect Information in Subprime Lending," American Economic Review, American Economic Association, vol. 99(1), pages 49-84, March.
    2. Iyer, Rajkamal & Khwaja, Asim Ijaz & Luttmer, Erzo F. P. & Shue, Kelly, 2009. "Screening in New Credit Markets: Can Individual Lenders Infer Borrower Creditworthiness in Peer-to-Peer Lending?," Working Paper Series rwp09-031, Harvard University, John F. Kennedy School of Government.
    3. Seth Freedman & Ginger Zhe Jin, 2008. "Do Social Networks Solve Information Problems for Peer-to-Peer Lending? Evidence from Prosper.com," Working Papers 08-43, NET Institute.
    4. Huaiqing Wang & Kun Chen & Wei Zhu & Zhenxia Song, 2015. "A process model on P2P lending," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 1(1), pages 1-8, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ruyi Ge & Juan Feng & Bin Gu, 2016. "Borrower’s default and self-disclosure of social media information in P2P lending," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 2(1), pages 1-6, December.
    2. Jackson J. Mi & Tianxiao Hu & Luke Deer, 2018. "User Data Can Tell Defaulters in P2P Lending," Annals of Data Science, Springer, vol. 5(1), pages 59-67, March.
    3. Rajkamal Iyer & Asim Ijaz Khwaja & Erzo F. P. Luttmer & Kelly Shue, 2016. "Screening Peers Softly: Inferring the Quality of Small Borrowers," Management Science, INFORMS, vol. 62(6), pages 1554-1577, June.
    4. Xiao-hong Chen & Fu-jing Jin & Qun Zhang & Li Yang, 2016. "Are investors rational or perceptual in P2P lending?," Information Systems and e-Business Management, Springer, vol. 14(4), pages 921-944, November.
    5. Qizhi Tao & Yizhe Dong & Ziming Lin, 2017. "Who can get money? Evidence from the Chinese peer-to-peer lending platform," Information Systems Frontiers, Springer, vol. 19(3), pages 425-441, June.
    6. Jianrong Yao & Jiarui Chen & June Wei & Yuangao Chen & Shuiqing Yang, 2019. "The relationship between soft information in loan titles and online peer-to-peer lending: evidence from RenRenDai platform," Electronic Commerce Research, Springer, vol. 19(1), pages 111-129, March.
    7. Miller, Sarah, 2015. "Information and default in consumer credit markets: Evidence from a natural experiment," Journal of Financial Intermediation, Elsevier, vol. 24(1), pages 45-70.
    8. Iyer, Rajkamal & Khwaja, Asim Ijaz & Luttmer, Erzo F. P. & Shue, Kelly, 2009. "Screening in New Credit Markets: Can Individual Lenders Infer Borrower Creditworthiness in Peer-to-Peer Lending?," Working Paper Series rwp09-031, Harvard University, John F. Kennedy School of Government.
    9. Yeujun Yoon & Yu Li & Yan Feng, 2019. "Factors affecting platform default risk in online peer-to-peer (P2P) lending business: an empirical study using Chinese online P2P platform data," Electronic Commerce Research, Springer, vol. 19(1), pages 131-158, March.
    10. Efraim Berkovich, 2011. "Search and herding effects in peer-to-peer lending: evidence from prosper.com," Annals of Finance, Springer, vol. 7(3), pages 389-405, August.
    11. Xiangxiang Zeng & Li Liu & Stephen Leung & Jiangze Du & Xun Wang & Tao Li, 2017. "A decision support model for investment on P2P lending platform," PLOS ONE, Public Library of Science, vol. 12(9), pages 1-18, September.
    12. Juanjuan Zhang & Peng Liu, 2012. "Rational Herding in Microloan Markets," Management Science, INFORMS, vol. 58(5), pages 892-912, May.
    13. Qizhi Tao & Yizhe Dong & Ziming Lin, 0. "Who can get money? Evidence from the Chinese peer-to-peer lending platform," Information Systems Frontiers, Springer, vol. 0, pages 1-17.
    14. Seth M. Freedman & Ginger Zhe Jin, 2011. "Learning by Doing with Asymmetric Information: Evidence from Prosper.com," NBER Working Papers 16855, National Bureau of Economic Research, Inc.
    15. Nadia Nahar Purkayastha & Şule Erdem Tuzlukaya, 2020. "Determination Of The Benefits And Risks Of Peer-To-Peer (P2p) Lending: A Social Network Teory Approach," Copernican Journal of Finance & Accounting, Uniwersytet Mikolaja Kopernika, vol. 9(3), pages 131-143.
    16. Xiong Xiong & Zhang Jin & Jin Xi & Feng Xu, 2016. "Review on Financial Innovations in Big Data Era," Journal of Systems Science and Information, De Gruyter, vol. 4(6), pages 489-504, December.
    17. Nataliya Barasinska & Dorothea Schäfer, 2010. "Are Women More Credit-Constrained than Men?: Evidence from a Rising Credit Market," Working Paper / FINESS 6.3, DIW Berlin, German Institute for Economic Research.
    18. Stephan Meier & Charles Sprenger, 2010. "Present-Biased Preferences and Credit Card Borrowing," American Economic Journal: Applied Economics, American Economic Association, vol. 2(1), pages 193-210, January.
    19. Roy, Saktinil & Kemme, David M., 2012. "Causes of banking crises: Deregulation, credit booms and asset bubbles, then and now," International Review of Economics & Finance, Elsevier, vol. 24(C), pages 270-294.
    20. Efraim Benmelech & Ralf R. Meisenzahl & Rodney Ramcharan, 2017. "The Real Effects of Liquidity During the Financial Crisis: Evidence from Automobiles," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 132(1), pages 317-365.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1707.04831. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.