Development of Churn Prediction System for a Marketing Company Using Machine Learning Technique
DOI:
https://doi.org/10.4108/eetismla.9577Keywords:
Churn Prediction, Machine Learning, Ensemble Learning, Adaboost, Gradient Boosting, Extreme Gradient BoostAbstract
INTRODUCTION: In today's competitive marketing landscape, customer churn prediction is vital for marketing organizations to identify patterns, factors, and indicators contributing to customer attrition. This paper focuses on developing a customer churn prediction system using machine learning algorithms.
OBJECTIVES: This paper employed an e-commerce dataset, obtained from the Kaggle repository, and was preprocessed. Important features were selected from the preproccessed dataset before models’ development. The parameters of AdaBoost, Gradient Boosting (GB), and Extreme Gradient Boosting (XGB) were optimized to improve their performance.
METHODS: Techniques such as label encoder, mean imputation, and synthetic minority over-sampling technique (SMOTE) were applied during data preprocessing stage. Ensemble learning algorithms, namely AdaBoost, GB, and XGB were used to develop the model while random search was employed for parameter optimization. Accuracy, precision, recall, and F1-score metrics were used to evaluate the models’ performance.
RESULTS: The results of the models with 15 important selected features before parameter tuning yielded the following scores: AdaBoost attained 87% accuracy, 77% precision, 81% recall, and an 79% F1-score. Gradient boosting outperformed AdaBoost with 89% accuracy, 80% precision, 82% recall, and an 81% F1-score. XGB outperformed the two algorithms (AdaBoost and GB), achieving 97% accuracy, 96% precision, 94% recall, and 95% F1-score. Notably, the Random Search significantly improves Gradient boosting's performance, increasing accuracy from 89% to 97%, precision from 80% to 97%, recall from 82% to 93%, and F1-score from 81% to 95%., making it comparable to XGB results. SHAP analysis reveals that the "Complain" feature was a consistent and key positive driver of predicted churn across all models, implying that customers who register complaints are significantly more likely to churn than those who do not. Additionally, the ‘tenure’ feature which has a strong negative impact on the prediction across the three models implies that longer tenure logically means less likely to churn, which makes the model's behaviour intuitive and trustworthy.
CONCLUSION: The results demonstrate the effectiveness of the system in identifying at-risk customers, enabling businesses to proactively retain customers and reduce churn rates. The findings of this paper showcase the critical importance of effective complaint management and rapid response strategies in customer retention efforts, and suggest that fostering long-term relationships and increasing customer loyalty can be highly effective in reducing churn.
Downloads
References
[1] Huang, X., Wu, Y., Zhang, L., & Lyu, J. A survey on churn analysis: How and why customers leave. Knowledge-Based Systems. 2020; 188.
[2] Akhmetshin, E., Fayzullaev, N., Klochko, E., Shakhov, D., & Lobanova, V. Intelligent Data Analytics using Hybrid Gradient Optimization Algorithm with Machine Learning Model for Customer Churn Prediction. Fusion: Practice & Applications. 2024; 14(2).
[3] Nasiri, A., & Gharebagh, F. H. A hybrid machine learning approach for customer churn prediction in telecommunication sector. Telecommunication Systems. 2019; 71(3): 399-411.
[4] Karaarslan, H., Baştuğ, M., Şen, C. G., & Işik, E. E. A Comparative Study on Customer Churn Analysis Using Machine Learning and Data Enrichment Techniques. Journal of Soft Computing and Decision Analytics; 2024; 2(1): 225-235.
[5] Sikri, A., Jameel, R., Idrees, S. M., & Kaur, H. Enhancing customer retention in telecom industry with machine learning driven churn prediction. Scientific Reports. 2024; 14(1): 13097.
[6] Kusnawi, K., Ipmawati, J., Asadulloh, B. P., Aminuddin, A., Abdulloh, F. F., & Rahardi, M. Leveraging various feature selection methods for churn prediction using various machine learning algorithms. JOIV: International Journal on Informatics Visualization. 2024; 8(2): 897-905.
[7] Prabadevi, B., Shalini, R., & Kavitha, B. R. Customer churning analysis using machine learning algorithms. International Journal of Intelligent Networks. 2023; 4: 145-154.
[8] Lukita, C., Bakti, L. D., Rusilowati, U., Sutarman, A., & Rahardja, U. Predictive and analytics using data mining and machine learning for customer churn prediction. Journal of Applied Data Sciences. 2023; 4(4): 454-465.
[9] Yamini, B., Ramana, K. V., Nalini, M., Devi, D. C., Maheswari, B., & Siva, S. R. Customer churn prediction model in enterprises using machine learning. International Journal of Advanced Technology and Engineering Exploration. 2024; 11(110): 94.
[10] Matuszelański, K., & Kopczewska, K. Customer churn in retail e-commerce business: Spatial and machine learning approach. Journal of Theoretical and Applied Electronic Commerce Research. 2022; 17(1): 165-198.
[11] Wagh, S. K., Andhale, A. A., Wagh, K. S., Pansare, J. R., Ambadekar, S. P., & Gawande, S. H. Customer churn prediction in telecom sector using machine learning techniques. Results in Control and Optimization. 2024; 14.
[12] Khare, P., & Arora, S. Predicting customer churn in saas products using machine learning. International Research Journal of Engineering and Technology (IRJET). 2024; 11(5): 754-765.
[13] Seymen, O. F., Ölmez, E., Doğan, O., Er, O., & Hiziroğlu, K. Customer churn prediction using ordinary artificial neural network and convolutional neural network algorithms: A comparative performance assessment. Gazi University Journal of Science. 2023; 36(2): 720-733.
[14] Pondel, M., Wuczyński, M., Gryncewicz, W., Łysik, Ł., Hernes, M., Rot, A., & Kozina, A. Deep learning for customer churn prediction in e-commerce decision support. In Business Information Systems. 2021; 3-12.
[15] Sam, G., Asuquo, P., & Stephen, B. Customer churn prediction using machine learning models. J Eng Res Rep. 2024; 26(2): 181-193.
[16] Li, A., Yang, T., Zhan, X., Shi, Y., & Li, H. Utilizing Data Science and AI for Customer Churn Prediction in Marketing. Journal of Theory and Practice of Engineering Science.2024; 4(05): 72-79.
[17] Gurung, N., Hasan, M. R., Gazi, M. S., & Chowdhury, F. R. AI-Based Customer Churn Prediction Model for Business Markets in the USA: Exploring the Use of AI and Machine Learning Technologies in Preventing Customer Churn. Journal of Computer Science and Technology Studies. 2024; 6(2): 19-2.
[18] He, C., & Ding, C. H. A novel classification algorithm for customer churn prediction based on hybrid Ensemble-Fusion model. Scientific Reports. 2024; 14(1): 20179.
[19] Maduna, M., Telukdarie, A., Munien, I., Onkonkwo, U., & Vermeulen, A. Smart Customer Churn Management System Using Machine Learning. Procedia Computer Science. 2024; 237: 552-558.
[20] Vo, N. N., Liu, S., Li, X., & Xu, G. Leveraging unstructured call log data for customer churn prediction. Knowledge-Based Systems. 2021; 212: 106586.
[21] Hoppner, S. Stripling, E. Baesens, B. Broucke, S. & Verdonck, T. Profit driven decison trees for churn prediction. European Journal of Operational Research. 2020; 284(3): 920 – 933.
[22] Sook-Ling, L., Mustafa, N., & Abdul-Razak, S.F. Customer churn prediction for telecommunication industry: A Malaysian Case Study. F1000Research. 2021; 10(1): 1-13, doi: 10.12688/f1000research.73597.1.
[23] Tayarani, M. Applications of artificial intelligence in battling against covid-19: A literature review, Chaos, Solitons & Fractals. 2021; 142(1): 110338-110345, doi: https://doi.org/10.1016/j.chaos.2020.110338.
[24] Khan, S., Iqbal, K., Faizullah, S., Fahad, M., Ali, J., & Ahmed, W. Clustering based privacy preserving of big data using fuzzification and anonymization operation. arXiv preprint arXiv. 2020; 2001.01491.
[25] Chaabouni, N., Mosbah, M., Zemmari, A., Sauvignac, C., & Faruki, P. Network Intrusion Detection for IoT Security Based on Learning Techniques. IEEE Communications Surveys Tutorials. 2019; 21(3): 2671–2701
[26] Al-Najjar, D., Al-Rousan, N., & Al-Najjar, H. Machine learning to develop credit card customer churn prediction. Journal of Theoretical and applied electronic commerce research. 2022; 17(4): 1529-1542.
[27] Alotaibi, M. Z., & Haq, M. A. Customer Churn Prediction for Telecommunication Companies using Machine Learning and Ensemble Methods. Engineering, Technology & Applied Science Research. 2024; 14(3): 14572-14578.
[28] Ahmad, N., Awan, M. J., Nobanee, H., Zain, A. M., Naseem, A., & Mahmoud, A. Customer personality analysis for churn prediction using hybrid ensemble models and class balancing techniques. IEEE Access. 2023; 12: 1865-1879.
[29] Dhand, G., Rao, M., Chaudhary, P., & Sheoran, K. (2025). A secure routing and malicious node detection in mobile Ad hoc network using trust value evaluation with improved XGBoost mechanism. Journal of Network and Computer Applications, 235, 104093.
[30] Sadar, U., Agarwal, P., Parveen, S., Dhand, G., & Sheoran, K. Heart disease prediction using machine learning classifiers with various balancing techniques. Proceedings on Engineering Sciences, 2024; 6(4), 1871–1878. ISSN 2620-2832.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Ismaila Yusuf, Sulaiman Olaniyi Abdulsalam, Temitope Oyewumi Jimoh, Ismail Akuji

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.