IDEAS home Printed from https://ideas.repec.org/p/cdl/ucscec/qt3v33k65c.html
   My bibliography  Save this paper

A novel method for credit scoring based on feature transformation and ensemble model

Author

Listed:
  • Li, Hongxiang
  • Feng, Ao
  • Lin, Bin
  • Su, Houcheng
  • Liu, Zixi
  • Duan, Xuliang
  • Pu, Haibo
  • Wang, Yifei

Abstract

Credit scoring is a very critical task for banks and other financial institutions, and it has become an important evaluation metric to distinguish potential defaulting users. In this paper, we propose a credit score prediction method based on feature transformation and ensemble model, which is essentially a cascade approach. The feature transformation process consisting of boosting trees (BT) and auto-encoders (AE) is employed to replace manual feature engineering and to solve the data imbalance problem. For the classification process, this paper designs a heterogeneous ensemble model by weighting the factorization machine (FM) and deep neural networks (DNN), which can efficiently extract low-order intersections and high-order intersections. Comprehensive experiments were conducted on two standard datasets and the results demonstrate that the proposed approach outperforms existing credit scoring models in accuracy.

Suggested Citation

  • Li, Hongxiang & Feng, Ao & Lin, Bin & Su, Houcheng & Liu, Zixi & Duan, Xuliang & Pu, Haibo & Wang, Yifei, 2021. "A novel method for credit scoring based on feature transformation and ensemble model," Santa Cruz Department of Economics, Working Paper Series qt3v33k65c, Department of Economics, UC Santa Cruz.
  • Handle: RePEc:cdl:ucscec:qt3v33k65c
    as

    Download full text from publisher

    File URL: https://www.escholarship.org/uc/item/3v33k65c.pdf;origin=repeccitec
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Caruso, G. & Gattone, S.A. & Fortuna, F. & Di Battista, T., 2021. "Cluster Analysis for mixed data: An application to credit risk evaluation," Socio-Economic Planning Sciences, Elsevier, vol. 73(C).
    2. Fahmida E. Moula & Chi Guotai & Mohammad Zoynul Abedin, 2017. "Credit default prediction modeling: an application of support vector machine," Risk Management, Palgrave Macmillan, vol. 19(2), pages 158-187, May.
    3. Lang, Jan Hannes & Peltonen, Tuomas A. & Sarlin, Peter, 2018. "A framework for early-warning modeling with an application to banks," Working Paper Series 2182, European Central Bank.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Abedin, Mohammad Zoynul & Hajek, Petr & Sharif, Taimur & Satu, Md. Shahriare & Khan, Md. Imran, 2023. "Modelling bank customer behaviour using feature engineering and classification techniques," Research in International Business and Finance, Elsevier, vol. 65(C).
    2. Sun, Weixin & Zhang, Xuantao & Li, Minghao & Wang, Yong, 2023. "Interpretable high-stakes decision support system for credit default forecasting," Technological Forecasting and Social Change, Elsevier, vol. 196(C).
    3. Mr. Jorge A Chan-Lau, 2020. "UnFEAR: Unsupervised Feature Extraction Clustering with an Application to Crisis Regimes Classification," IMF Working Papers 2020/262, International Monetary Fund.
    4. Smith, Jonathan Acosta & Grill, Michael & Lang, Jan Hannes, 2017. "The leverage ratio, risk-taking and bank stability," Working Paper Series 2079, European Central Bank.
    5. Tarkocin, Coskun & Donduran, Murat, 2024. "Constructing early warning indicators for banks using machine learning models," The North American Journal of Economics and Finance, Elsevier, vol. 69(PB).
    6. Michał Bernardelli & Zbigniew Korzeb & Paweł Niedziółka, 2022. "Does Fossil Fuel Financing Affect Banks’ ESG Ratings?," Energies, MDPI, vol. 15(4), pages 1-19, February.
    7. Fiza Qureshi & Ali M. Kutan & Habib Hussain Khan & Saba Qureshi, 2019. "Equity fund flows, market returns, and market risk: evidence from China," Risk Management, Palgrave Macmillan, vol. 21(1), pages 48-71, March.
    8. Belanes, Amel & Saâdaoui, Foued & Abedin, Mohammad Zoynul, 2024. "Potential diversification benefits: A comparative study of Islamic and conventional stock market indexes," Research in International Business and Finance, Elsevier, vol. 67(PA).
    9. Berthonnaud, Pierre & Cesati, Enrico & Drudi, Maria Ludovica & Jager, Kirsten & Kick, Heinrich & Lanciani, Marcello & Schneider, Ludwig & Schwarz, Claudia & Siakoulis, Vasileios & Vroege, Robert, 2021. "Asset encumbrance in euro area banks: analysing trends, drivers and prediction properties for individual bank crises," Occasional Paper Series 261, European Central Bank.
    10. Bitetto, Alessandro & Cerchiello, Paola & Mertzanis, Charilaos, 2023. "Measuring financial soundness around the world: A machine learning approach," International Review of Financial Analysis, Elsevier, vol. 85(C).
    11. Mohammad Motasem ALrfai & Danilah Binti Salleh & Waeibrorheem Waemustafa, 2022. "Empirical Examination of Credit Risk Determinant of Commercial Banks in Jordan," Risks, MDPI, vol. 10(4), pages 1-11, April.
    12. Shi, Baofeng & Zhao, Xue & Wu, Bi & Dong, Yizhe, 2019. "Credit rating and microfinance lending decisions based on loss given default (LGD)," Finance Research Letters, Elsevier, vol. 30(C), pages 124-129.
    13. Jiaming Liu & Xuemei Zhang & Haitao Xiong, 2024. "Credit risk prediction based on causal machine learning: Bayesian network learning, default inference, and interpretation," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(5), pages 1625-1660, August.
    14. Abisola Akinjole & Olamilekan Shobayo & Jumoke Popoola & Obinna Okoyeigbo & Bayode Ogunleye, 2024. "Ensemble-Based Machine Learning Algorithm for Loan Default Risk Prediction," Mathematics, MDPI, vol. 12(21), pages 1-32, October.
    15. Jacopo Carmassi & Sonja Dobkowitz & Johanne Evrard & Laura Parisi & André F Silva & Michael Wedow, 2020. "Completing the Banking Union with a European deposit insurance scheme: who is afraid of cross-subsidization?," Economic Policy, CEPR, CESifo, Sciences Po;CES;MSH, vol. 35(101), pages 41-95.
    16. Jiang Hua & Tonglin Hao & Liangcai Zeng & Gui Yu, 2021. "YOLOMask, an Instance Segmentation Algorithm Based on Complementary Fusion Network," Mathematics, MDPI, vol. 9(15), pages 1-12, July.
    17. Maisa Cardoso Aniceto & Flavio Barboza & Herbert Kimura, 2020. "Machine learning predictivity applied to consumer creditworthiness," Future Business Journal, Springer, vol. 6(1), pages 1-14, December.
    18. Buckmann, Marcus & Gallego Marquez, Paula & Gimpelewicz, Mariana & Kapadia, Sujit & Rismanchi, Katie, 2023. "The more the merrier? Evidence on the value of multiple requirements in bank regulation," Journal of Banking & Finance, Elsevier, vol. 149(C).
    19. Zhiwu Zhou & Julián Alcalá & Víctor Yepes, 2021. "Optimized Application of Sustainable Development Strategy in International Engineering Project Management," Mathematics, MDPI, vol. 9(14), pages 1-29, July.
    20. Quentin Bro de Comères, 2022. "Predicting European Banks Distress Events: Do Financial Information Producers Matter?," Working Papers hal-03752678, HAL.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cdl:ucscec:qt3v33k65c. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Lisa Schiff (email available below). General contact details of provider: https://edirc.repec.org/data/ecucsus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.