IDEAS home Printed from https://ideas.repec.org/a/spt/apfiba/v14y2024i4f14_4_5.html
   My bibliography  Save this article

Bank Customer Churn Prediction Using Machine Learning Framework

Author

Listed:
  • Rasha Ashraf

Abstract

Using real customer data from a large community bank in the South of the US, this paper analyzes the customer churn prediction problem by constructing and comparing ten machine learning classification models with five sample techniques. Our results show that Random Forest, XG Boost, AdaBoost, and Bagging Meta classifiers dominate others in terms of overall accuracy, F-score, and AUC curve for the test observations. For the four classifiers, the overall accuracy ranges from 87% to 96% across five different sampling methods explored, while the AUC values range between 0.9 to 0.93. Considering overall accuracy and F-Score, AdaBoost with original and MTDF sampling technique dominates others; however, considering the AUC measure, XG Boost and Random Forest perform similarly to AdaBoost, which slightly dominate Bagging Meta across all sampling techniques; although the performance measures for these four classifiers are comparable across all sampling techniques. The paper further presents important features of customer churn behavior as predicted by the model. The diagnostic analysis also provides an insightful comparison between churned and non-churned customers. Â JEL classification numbers: C0, C5, C8, G21.

Suggested Citation

  • Rasha Ashraf, 2024. "Bank Customer Churn Prediction Using Machine Learning Framework," Journal of Applied Finance & Banking, SCIENPRESS Ltd, vol. 14(4), pages 1-5.
  • Handle: RePEc:spt:apfiba:v:14:y:2024:i:4:f:14_4_5
    as

    Download full text from publisher

    File URL: http://www.scienpress.com/Upload/JAFB%2fVol%2014_4_5.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July.
    2. Román Salmerón Gómez & Ainara Rodríguez Sánchez & Catalina García García & José García Pérez, 2020. "The VIF and MSE in Raise Regression," Mathematics, MDPI, vol. 8(4), pages 1-28, April.
    3. Erel, Isil & Liebersohn, Jack, 2020. "Does FinTech Substitute for Banks? Evidence from the Paycheck Protection Program," Working Paper Series 2020-16, Ohio State University, Charles A. Dice Center for Research in Financial Economics.
    4. Manasa Gopal & Philipp Schnabl, 2022. "The Rise of Finance Companies and FinTech Lenders in Small Business Lending," The Review of Financial Studies, Society for Financial Studies, vol. 35(11), pages 4859-4901.
    5. Lee, Sauchi Stephen, 2000. "Noisy replication in skewed binary classification," Computational Statistics & Data Analysis, Elsevier, vol. 34(2), pages 165-191, August.
    6. Dudyala Anil Kumar & V. Ravi, 2008. "Predicting credit card customer churn in banks using data mining," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 1(1), pages 4-28.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ahnert, Toni & Doerr, Sebastian & Pierri, Nicola & Timmer, Yannick, 2022. "Information Technology in Banking and Entrepreneurship," CEPR Discussion Papers 17335, C.E.P.R. Discussion Papers.
    2. Paul Beaumont & Huan Tang & Éric Vansteenberghe, 2024. "Collateral Effects: The Role of FinTech in Small Business Lending [Effets collatéraux : le rôle des Fintechs dans le financement des petites et moyennes entreprises]," Débats économiques et financiers 42, Banque de France.
    3. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    4. Vera Miguéis & Dirk Poel & Ana Camanho & João Falcão e Cunha, 2012. "Predicting partial customer churn using Markov for discrimination for modeling first purchase sequences," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 6(4), pages 337-353, December.
    5. Abbas Keramati & Hajar Ghaneei & Seyed Mohammad Mirmohammadi, 2016. "Developing a prediction model for customer churn from electronic banking services using data mining," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 2(1), pages 1-13, December.
    6. Alperovych, Yan & Divakaruni, Anantha & Le Grand, François, 2022. "FinTech Lending under Austerity," SocArXiv atsk9, Center for Open Science.
    7. Paul Beaumont & Huan Tang & Eric Vansteenberghe, "undated". "Effets collatéraux : le rôle des Fintechs dans le financement des petites et moyennes entreprises," Analyse et synthèse 157, Banque de France.
    8. Ballings, Michel & Van den Poel, Dirk, 2015. "CRM in social media: Predicting increases in Facebook usage frequency," European Journal of Operational Research, Elsevier, vol. 244(1), pages 248-260.
    9. Divakaruni, Anantha & Alperovych, Yan & Le Grand, François, 2022. "FinTech Lending under Austerity," OSF Preprints m4tps, Center for Open Science.
    10. Verbeke, Wouter & Dejaeger, Karel & Martens, David & Hur, Joon & Baesens, Bart, 2012. "New insights into churn prediction in the telecommunication sector: A profit driven data mining approach," European Journal of Operational Research, Elsevier, vol. 218(1), pages 211-229.
    11. Risselada, Hans & Verhoef, Peter C. & Bijmolt, Tammo H.A., 2010. "Staying Power of Churn Prediction Models," Journal of Interactive Marketing, Elsevier, vol. 24(3), pages 198-208.
    12. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    13. Iñaki Aldasoro & Sebastian Doerr & Haonan Zhou, 2023. "Non-bank lending during crises," BIS Working Papers 1074, Bank for International Settlements.
    14. D. Van den Poel, 2003. "Predicting Mail-Order Repeat Buying. Which Variables Matter?," Review of Business and Economic Literature, KU Leuven, Faculty of Economics and Business (FEB), Review of Business and Economic Literature, vol. 0(3), pages 371-404.
    15. Mihai ȚICHINDELEAN & Claudia OGREAN & Mihaela HERCIU, 2024. "Do Loyal Customers Buy Differently? Examining Customers’ Loyalty In A Self-Service Setting," Studies in Business and Economics, Lucian Blaga University of Sibiu, Faculty of Economic Sciences, vol. 19(1), pages 350-367, April.
    16. Berger, Allen N. & Boot, Arnoud W.A., 2024. "Financial intermediation services and competition analyses: Review and paths forward for improvement," Journal of Financial Intermediation, Elsevier, vol. 57(C).
    17. Greg Buchak & Gregor Matvos & Tomasz Piskorski & Amit Seru, 2024. "Aggregate Lending and Modern Financial Intermediation: Why Bank Balance Sheet Models Are Miscalibrated," NBER Macroeconomics Annual, University of Chicago Press, vol. 38(1), pages 239-287.
    18. Zhao, Meiyu & Zhang, Bo, 2023. "Credit corruption, financial constraint and corporate innovation: Evidence on China," Finance Research Letters, Elsevier, vol. 53(C).
    19. Hasan, Iftekhar & Li, Xiang & Takalo, Tuomas, 2023. "Technological innovation and the bank lending channel of monetary policy transmission," IWH Discussion Papers 14/2021, Halle Institute for Economic Research (IWH), revised 2023.
    20. Desi Volker, 2022. "The Paycheck Protection Program Liquidity Facility," Economic Policy Review, Federal Reserve Bank of New York, vol. 28(1), July.

    More about this item

    Keywords

    Machine learning; Big data; Sampling techniques; Customer churn; Customer retention; Financial services; Community bank.;
    All these keywords.

    JEL classification:

    • C0 - Mathematical and Quantitative Methods - - General
    • C5 - Mathematical and Quantitative Methods - - Econometric Modeling
    • C8 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs
    • G21 - Financial Economics - - Financial Institutions and Services - - - Banks; Other Depository Institutions; Micro Finance Institutions; Mortgages

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spt:apfiba:v:14:y:2024:i:4:f:14_4_5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Eleftherios Spyromitros-Xioufis (email available below). General contact details of provider: http://www.scienpress.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.