IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i13p2282-d851596.html
   My bibliography  Save this article

A Framework of Global Credit-Scoring Modeling Using Outlier Detection and Machine Learning in a P2P Lending Platform

Author

Listed:
  • Dong-Her Shih

    (Department of Information Management, National Yunlin University of Science and Technology, Douliu 64002, Taiwan)

  • Ting-Wei Wu

    (Department of Information Management, National Yunlin University of Science and Technology, Douliu 64002, Taiwan)

  • Po-Yuan Shih

    (Department of Finance, National Yunlin University of Science and Technology, Douliu 64002, Taiwan)

  • Nai-An Lu

    (Department of Information Management, National Yunlin University of Science and Technology, Douliu 64002, Taiwan)

  • Ming-Hung Shih

    (Department of Electrical and Computer Engineering, Iowa State University, 2520 Osborn Drive, Ames, IA 50011, USA)

Abstract

A great challenge for credit-scoring models in online peer-to-peer (P2P) lending platforms is that credit-scoring models simply discard rejected applicants. This selective discard can lead to an inability to increase the number of potentially qualified applicants, ultimately affecting the revenue of the lending platform. One way to deal with this is to employ reject inference, a technique that infers the state of a rejected sample and incorporates the results into a credit-scoring model. The most popular approach to reject inference is to use a credit-scoring model built only on accepted samples to directly predict the status of rejected samples. However, the distribution of accepted samples in online P2P lending is different from the distribution of rejected samples, and the credit-scoring model on the original accepted sample may no longer apply. In addition, the acceptance sample may also include applicants who cannot repay the loan. If these applicants can be filtered out, the losses to the lending platform can also be reduced. Therefore, we propose a global credit-scoring model framework that combines multiple feature selection methods and classifiers to better evaluate the model after adding rejected samples. In addition, this study uses outlier detection methods to explore the internal relationships of all samples, which can delete outlier applicants in accepted samples or increase outlier applicants in rejected samples. Finally, this study uses four data samples and reject inference to construct four different credit-scoring models. The experimental results show that the credit-scoring model combining Pearson and random forest proposed in this study has significantly better accuracy and AUC than other scholars. Compared with previous studies, using outlier detection to remove outliers in loan acceptance samples and identify potentially creditworthy loan applicants from loan rejection samples is a good strategy. Furthermore, this study not only improves the accuracy of the credit-scoring model but also increases the number of lenders, which in turn increases the profitability of the lending platform.

Suggested Citation

  • Dong-Her Shih & Ting-Wei Wu & Po-Yuan Shih & Nai-An Lu & Ming-Hung Shih, 2022. "A Framework of Global Credit-Scoring Modeling Using Outlier Detection and Machine Learning in a P2P Lending Platform," Mathematics, MDPI, vol. 10(13), pages 1-13, June.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:13:p:2282-:d:851596
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/13/2282/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/13/2282/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Thomas B. Astebro & G. Chen, 2001. "The Economic Value of Reject Inference in Credit Scoring," Post-Print hal-00654597, HAL.
    2. Larissa Batrancea & Mircea Iosif Rus & Ema Speranta Masca & Ioan Dan Morar, 2021. "Fiscal Pressure as a Trigger of Financial Performance for the Energy Industry: An Empirical Investigation across a 16-Year Period," Energies, MDPI, vol. 14(13), pages 1-17, June.
    3. Bücker, Michael & van Kampen, Maarten & Krämer, Walter, 2013. "Reject inference in consumer credit scoring with nonignorable missing data," Journal of Banking & Finance, Elsevier, vol. 37(3), pages 1040-1045.
    4. Larissa Batrancea, 2021. "The Influence of Liquidity and Solvency on Performance within the Healthcare Industry: Evidence from Publicly Listed Companies," Mathematics, MDPI, vol. 9(18), pages 1-15, September.
    5. Larissa Batrancea, 2021. "An Econometric Approach Regarding the Impact of Fiscal Pressure on Equilibrium: Evidence from Electricity, Gas and Oil Companies Listed on the New York Stock Exchange," Mathematics, MDPI, vol. 9(6), pages 1-22, March.
    6. Trivedi, Shrawan Kumar, 2020. "A study on credit scoring modeling with different feature selection and machine learning approaches," Technology in Society, Elsevier, vol. 63(C).
    7. J Banasik & J Crook, 2010. "Reject inference in survival analysis by augmentation," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 473-485, March.
    8. Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.
    9. Crook, Jonathan & Banasik, John, 2004. "Does reject inference really improve the performance of application scoring models?," Journal of Banking & Finance, Elsevier, vol. 28(4), pages 857-874, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Monir El Annas & Badreddine Benyacoub & Mohamed Ouzineb, 2023. "Semi-supervised adapted HMMs for P2P credit scoring systems with reject inference," Computational Statistics, Springer, vol. 38(1), pages 149-169, March.
    2. Rogelio A. Mancisidor & Michael Kampffmeyer & Kjersti Aas & Robert Jenssen, 2019. "Deep Generative Models for Reject Inference in Credit Scoring," Papers 1904.11376, arXiv.org, revised Sep 2021.
    3. Yasmeen Idilbi-Bayaa & Mahmoud Qadan, 2022. "Tell Me Why I Do Not Like Mondays," Mathematics, MDPI, vol. 10(11), pages 1-22, May.
    4. Andre Amaral & Taysir E. Dyhoum & Hussein A. Abdou & Hassan M. Aljohani, 2022. "Modeling for the Relationship between Monetary Policy and GDP in the USA Using Statistical Methods," Mathematics, MDPI, vol. 10(21), pages 1-20, November.
    5. Darya Dancaková & Jakub Sopko & Jozef Glova & Alena Andrejovská, 2022. "The Impact of Intangible Assets on the Market Value of Companies: Cross-Sector Evidence," Mathematics, MDPI, vol. 10(20), pages 1-14, October.
    6. Zhiyong Li & Xinyi Hu & Ke Li & Fanyin Zhou & Feng Shen, 2020. "Inferring the outcomes of rejected loans: an application of semisupervised clustering," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 631-654, February.
    7. Peter Brusov & Tatiana Filatova & Natali Orekhova, 2023. "Influence of Method and Frequency of Profit Tax Payments on Company Financial Indicators," Springer Books, in: The Brusov–Filatova–Orekhova Theory of Capital Structure, chapter 0, pages 241-264, Springer.
    8. Michael Bucker & Gero Szepannek & Alicja Gosiewska & Przemyslaw Biecek, 2020. "Transparency, Auditability and eXplainability of Machine Learning Models in Credit Scoring," Papers 2009.13384, arXiv.org.
    9. Jingjing Liu & Jing Wang & Tianlin Zhai & Zehui Li, 2022. "The Response of Ecologically Functional Land to Changes in Urban Economic Growth and Transportation Construction in China," IJERPH, MDPI, vol. 19(21), pages 1-17, November.
    10. Charitou, Andreas & Dionysiou, Dionysia & Lambertides, Neophytos & Trigeorgis, Lenos, 2013. "Alternative bankruptcy prediction models using option-pricing theory," Journal of Banking & Finance, Elsevier, vol. 37(7), pages 2329-2341.
    11. Ha-Thu Nguyen, 2016. "Reject inference in application scorecards: evidence from France," EconomiX Working Papers 2016-10, University of Paris Nanterre, EconomiX.
    12. Ha Thu Nguyen, 2016. "Reject inference in application scorecards: evidence from France," Working Papers hal-04141601, HAL.
    13. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    14. Peter Brusov & Tatiana Filatova & Natali Orekhova, 2023. "The Generalization of the Brusov–Filatova–Orekhova Theory for the Case of Payments of Tax on Profit with Arbitrary Frequency," Springer Books, in: The Brusov–Filatova–Orekhova Theory of Capital Structure, chapter 0, pages 217-239, Springer.
    15. Mehmet Ali Balcı & Larissa M. Batrancea & Ömer Akgüller & Anca Nichita, 2022. "Coarse Graining on Financial Correlation Networks," Mathematics, MDPI, vol. 10(12), pages 1-16, June.
    16. Peter Brusov & Tatiana Filatova & Natali Orekhova, 2023. "Benefits of Advance Payments of Tax on Profit: Consideration Within Brusov–Filatova–Orekhova (BFO) Theory," Springer Books, in: The Brusov–Filatova–Orekhova Theory of Capital Structure, chapter 0, pages 205-216, Springer.
    17. Qiang Liu & Yingtao Luo & Shu Wu & Zhen Zhang & Xiangnan Yue & Hong Jin & Liang Wang, 2022. "RMT-Net: Reject-aware Multi-Task Network for Modeling Missing-not-at-random Data in Financial Credit Scoring," Papers 2206.00568, arXiv.org.
    18. Maria Čuljak & Josip Arnerić & Ante Žigman, 2022. "Is Jump Robust Two Times Scaled Estimator Superior among Realized Volatility Competitors?," Mathematics, MDPI, vol. 10(12), pages 1-11, June.
    19. Emilia Herman & Kinga-Emese Zsido, 2023. "The Financial Sustainability of Retail Food SMEs Based on Financial Equilibrium and Financial Performance," Mathematics, MDPI, vol. 11(15), pages 1-26, August.
    20. Weidong Guo & Zach Zhizhong Zhou, 2022. "A comparative study of combining tree‐based feature selection methods and classifiers in personal loan default prediction," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(6), pages 1248-1313, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:13:p:2282-:d:851596. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.