IDEAS home Printed from https://ideas.repec.org/a/eee/tefoso/v198y2024ics0040162523006303.html
   My bibliography  Save this article

Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction

Author

Listed:
  • Liu, Zhenkun
  • Jiang, Ping
  • De Bock, Koen W.
  • Wang, Jianzhou
  • Zhang, Lifang
  • Niu, Xinsong

Abstract

Customer retention campaigns increasingly rely on predictive analytics to identify potential churners in a customer base. Traditionally, customer churn prediction was dependent on binary classifiers, which are often optimized for accuracy-based performance measures. However, there is a growing consensus that this approach may not always fulfill the critical business objective of profit maximization, as it overlooks the costs of misclassification and the benefits of accurate classification. This study adopts extreme gradient boosting trees to predict profit-driven customer churn. The class weights and other hyperparameters of these trees are optimized using Bayesian methods based on the profit maximization criterion. Empirical analyses are conducted using real datasets obtained from service providers in multiple markets. The empirical results demonstrate that the proposed model yields significantly higher profits than the benchmark models. Bayesian optimization and adjustment of class weights contributed to enhanced model profitability. Furthermore, when optimizing multiple hyperparameters, the computational cost of model optimization is significantly reduced compared with an exhaustive grid search. Additionally, we demonstrate the robustness of the proposed model through a sensitivity analysis employing Bayesian optimization. Using the proposed model, marketing managers can design targeted marketing plans to retain customer groups with a higher likelihood of churning.

Suggested Citation

  • Liu, Zhenkun & Jiang, Ping & De Bock, Koen W. & Wang, Jianzhou & Zhang, Lifang & Niu, Xinsong, 2024. "Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
  • Handle: RePEc:eee:tefoso:v:198:y:2024:i:c:s0040162523006303
    DOI: 10.1016/j.techfore.2023.122945
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040162523006303
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.techfore.2023.122945?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jabeur, Sami Ben & Gharib, Cheima & Mefteh-Wali, Salma & Arfi, Wissal Ben, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    2. Abbas Keramati & Hajar Ghaneei & Seyed Mohammad Mirmohammadi, 2016. "Developing a prediction model for customer churn from electronic banking services using data mining," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 2(1), pages 1-13, December.
    3. Verbraken, Thomas & Bravo, Cristián & Weber, Richard & Baesens, Bart, 2014. "Development and application of consumer credit scoring models using profit-based classification measures," European Journal of Operational Research, Elsevier, vol. 238(2), pages 505-513.
    4. Liu, Xiaolei & Lin, Zi, 2021. "Impact of Covid-19 pandemic on electricity demand in the UK based on multivariate time series forecasting with Bidirectional Long Short Term Memory," Energy, Elsevier, vol. 227(C).
    5. Amin, Adnan & Shah, Babar & Khattak, Asad Masood & Lopes Moreira, Fernando Joaquim & Ali, Gohar & Rocha, Alvaro & Anwar, Sajid, 2019. "Cross-company customer churn prediction in telecommunication: A comparison of data transformation methods," International Journal of Information Management, Elsevier, vol. 46(C), pages 304-319.
    6. Li, Guo-Dong & Masuda, Shiro & Nagai, Masatake, 2014. "Predicting the subscribers of fixed-line and cellular phone in Japan by a novel prediction model," Technological Forecasting and Social Change, Elsevier, vol. 81(C), pages 321-330.
    7. Alfiero, Simona & Battisti, Enrico & Ηadjielias, Elias, 2022. "Black box technology, usage-based insurance, and prediction of purchase behavior: Evidence from the auto insurance sector," Technological Forecasting and Social Change, Elsevier, vol. 183(C).
    8. Chen, Zhen-Yu & Fan, Zhi-Ping & Sun, Minghe, 2012. "A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data," European Journal of Operational Research, Elsevier, vol. 223(2), pages 461-472.
    9. Kristof Coussement & Stefan Lessmann & Geert Verstraeten, 2017. "A comparative analysis of data preparation algorithms for customer churn prediction: A case study in the telecommunication industry," Post-Print hal-01745261, HAL.
    10. Ouenniche, Jamal & Pérez-Gladish, Blanca & Bouslah, Kais, 2018. "An out-of-sample framework for TOPSIS-based classifiers with application in bankruptcy prediction," Technological Forecasting and Social Change, Elsevier, vol. 131(C), pages 111-116.
    11. K.W. de Bock & D. van den Poel, 2011. "An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction," Post-Print hal-00800160, HAL.
    12. Trizoglou, Pavlos & Liu, Xiaolei & Lin, Zi, 2021. "Fault detection by an ensemble framework of Extreme Gradient Boosting (XGBoost) in the operation of offshore wind turbines," Renewable Energy, Elsevier, vol. 179(C), pages 945-962.
    13. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    14. Shirazi, Farid & Mohammadi, Mahbobeh, 2019. "A big data analytics model for customer churn prediction in the retiree segment," International Journal of Information Management, Elsevier, vol. 48(C), pages 238-253.
    15. Li, Jing-Ping & Mirza, Nawazish & Rahat, Birjees & Xiong, Deping, 2020. "Machine learning and credit ratings prediction in the age of fourth industrial revolution," Technological Forecasting and Social Change, Elsevier, vol. 161(C).
    16. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W. & Lessmann, Stefan, 2020. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1563-1578.
    17. Van den Poel, Dirk & Lariviere, Bart, 2004. "Customer attrition analysis for financial services using proportional hazard models," European Journal of Operational Research, Elsevier, vol. 157(1), pages 196-217, August.
    18. Ebru Pekel Ozmen & Tuncay Ozcan, 2022. "A novel deep learning model based on convolutional neural networks for employee churn prediction," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(3), pages 539-550, April.
    19. Jiang, Cuiqing & Zhou, Yiru & Chen, Bo, 2023. "Mining semantic features in patent text for financial distress prediction," Technological Forecasting and Social Change, Elsevier, vol. 190(C).
    20. Arno de Caigny & Kristof Coussement & Koen W. de Bock, 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," Post-Print hal-01741661, HAL.
    21. Maldonado, Sebastián & López, Julio & Vairetti, Carla, 2020. "Profit-based churn prediction based on Minimax Probability Machines," European Journal of Operational Research, Elsevier, vol. 284(1), pages 273-284.
    22. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    23. Gandomi, A. & Zolfaghari, S., 2013. "Profitability of loyalty reward programs: An analytical investigation," Omega, Elsevier, vol. 41(4), pages 797-807.
    24. Höppner, Sebastiaan & Stripling, Eugen & Baesens, Bart & Broucke, Seppe vanden & Verdonck, Tim, 2020. "Profit driven decision trees for churn prediction," European Journal of Operational Research, Elsevier, vol. 284(3), pages 920-933.
    25. Verbeke, Wouter & Dejaeger, Karel & Martens, David & Hur, Joon & Baesens, Bart, 2012. "New insights into churn prediction in the telecommunication sector: A profit driven data mining approach," European Journal of Operational Research, Elsevier, vol. 218(1), pages 211-229.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liu, Zhenkun & Zhang, Ying & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Hufang & Gao, Yuyang & Chen, Yinghao, 2024. "Profit-driven fusion framework based on bagging and boosting classifiers for potential purchaser prediction," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liu, Zhenkun & Zhang, Ying & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Hufang & Gao, Yuyang & Chen, Yinghao, 2024. "Profit-driven fusion framework based on bagging and boosting classifiers for potential purchaser prediction," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    2. Lewlisa Saha & Hrudaya Kumar Tripathy & Tarek Gaber & Hatem El-Gohary & El-Sayed M. El-kenawy, 2023. "Deep Churn Prediction Method for Telecommunication Industry," Sustainability, MDPI, vol. 15(5), pages 1-21, March.
    3. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    4. De Bock, Koen W. & Coussement, Kristof & Caigny, Arno De & Słowiński, Roman & Baesens, Bart & Boute, Robert N. & Choi, Tsan-Ming & Delen, Dursun & Kraus, Mathias & Lessmann, Stefan & Maldonado, Sebast, 2024. "Explainable AI for Operational Research: A defining framework, methods, applications, and a research agenda," European Journal of Operational Research, Elsevier, vol. 317(2), pages 249-272.
    5. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    6. Maldonado, Sebastián & Domínguez, Gonzalo & Olaya, Diego & Verbeke, Wouter, 2021. "Profit-driven churn prediction for the mutual fund industry: A multisegment approach," Omega, Elsevier, vol. 100(C).
    7. Arno de Caigny & Kristof Coussement & Koen W. de Bock & Stefan Lessmann, 2019. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," Post-Print hal-02275958, HAL.
    8. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W. & Lessmann, Stefan, 2020. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1563-1578.
    9. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    10. Schaeffer, Satu Elisa & Rodriguez Sanchez, Sara Veronica, 2020. "Forecasting client retention — A machine-learning approach," Journal of Retailing and Consumer Services, Elsevier, vol. 52(C).
    11. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    12. Arno de Caigny & Kristof Coussement & Koen de Bock, 2020. "Leveraging fine-grained transaction data for customer life event predictions," Post-Print hal-02507998, HAL.
    13. Szeląg, Marcin & Słowiński, Roman, 2024. "Explaining and predicting customer churn by monotonic rules induced from ordinal data," European Journal of Operational Research, Elsevier, vol. 317(2), pages 414-424.
    14. Ebru Pekel Ozmen & Tuncay Ozcan, 2022. "A novel deep learning model based on convolutional neural networks for employee churn prediction," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(3), pages 539-550, April.
    15. Chandrasekhar Valluri & Sudhakar Raju & Vivek H. Patil, 2022. "Customer determinants of used auto loan churn: comparing predictive performance using machine learning techniques," Journal of Marketing Analytics, Palgrave Macmillan, vol. 10(3), pages 279-296, September.
    16. Chen, Claire Y.T. & Sun, Edward W. & Miao, Wanyu & Lin, Yi-Bing, 2024. "Reconciling business analytics with graphically initialized subspace clustering for optimal nonlinear pricing," European Journal of Operational Research, Elsevier, vol. 312(3), pages 1086-1107.
    17. Koen W. de Bock & Arno de Caigny, 2021. "Spline-rule ensemble classifiers with structured sparsity regularization for interpretable customer churn modeling," Post-Print hal-03391564, HAL.
    18. Louis Geiler & Séverine Affeldt & Mohamed Nadif, 2022. "A survey on machine learning methods for churn prediction," Post-Print hal-03824873, HAL.
    19. Höppner, Sebastiaan & Stripling, Eugen & Baesens, Bart & Broucke, Seppe vanden & Verdonck, Tim, 2020. "Profit driven decision trees for churn prediction," European Journal of Operational Research, Elsevier, vol. 284(3), pages 920-933.
    20. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:tefoso:v:198:y:2024:i:c:s0040162523006303. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.sciencedirect.com/science/journal/00401625 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.