CUSTOMER CHURN PREDICTION USING THE RANDOM FOREST ALGORITHM
Abstract
Customer churn prediction plays a vital role in modern business, accurately influencing strategic and operational decisions that influence customer loyalty to a service. Customer churn focuses on customer retention being more profitable than attracting new customers because long-term customers provide lower profits and costs while losing customers increases the costs and need to attract new customers. However, customer churn still occurs frequently and cannot be predicted. If customer churn is left unchecked, it will endanger the company or banking industry because it can cause loss of income, damage reputation, and decrease market share. Random Forest, a data mining technique, was used in this research because of its ability to predict and handle many variables. This research aims to predict customer churn using the Random Forest method with datasets from Europe, especially France, Spain, and Germany, hoping to benefit the banking industry by identifying customers at high risk of abandoning services. This research is expected to benefit business people from customer churn predictions. Especially in the banking industry, it can help identify customers at high risk of abandoning service. Thus, companies can take appropriate steps to retain these customers, increase customer retention, strengthen customer loyalty and optimize their business performance. The results of this research are an accurate system for predicting customer churn in the future. The research obtained accuracy results of 87% in predicting customer churn using accuracy testing in the form of a confusion matrix.
Full Text:
PDFReferences
A. De Caigny, K. Coussement, and K. W. De Bock, “A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees,” Eur. J. Oper. Res., vol. 269, no. 2, pp. 760–772, 2018, doi: 10.1016/j.ejor.2018.02.009.
V. Geetha, A. Punitha, A. Nandhini, T. Nandhini, S. Shakila, and R. Sushmitha, “Customer Churn Prediction in Telecommunication Industry Using Random Forest Classifier,” 2020 Int. Conf. Syst. Comput. Autom. Networking, ICSCAN 2020, 2020, doi: 10.1109/ICSCAN49426.2020.9262288.
S. Selvin, R. Vinayakumar, E. A. Gopalakrishnan, V. K. Menon, and K. P. Soman, “Stock price prediction using LSTM, RNN and CNN-sliding window model,” 2017 Int. Conf. Adv. Comput. Commun. Informatics, ICACCI 2017, vol. 2017-Janua, pp. 1643–1647, 2017, doi: 10.1109/ICACCI.2017.8126078.
A. Yosipof, R. C. Guedes, and A. T. García-Sosa, “Data mining and machine learning models for predicting drug likeness and their disease or organ category,” Front. Chem., vol. 6, no. MAY, pp. 1–11, 2018, doi: 10.3389/fchem.2018.00162.
D. Papakyriakou and I. S. Barbounakis, “Data Mining Methods: A Review,” Int. J. Comput. Appl., vol. 183, no. 48, pp. 5–19, 2022, doi: 10.5120/ijca2022921884.
L. P. Muri, B. Pramono, and J. Y. Sari, “Prediksi tingkat penyakit demam berdarah di kota kendari menggunakan metode,” semanTIK, vol. 4, no. 1, pp. 103–112, 2018.
W. A. P. Wanto Anjar, “Analisis Prediksi Indeks Harga Konsumen Berdasarkan Kelompok Kesehatan Dengan Menggunakan MWanto, A. (2019). Analisis Prediksi Indeks Harga Konsumen Berdasarkan Kelompok Kesehatan Dengan Menggunakan Metode Backpropagation. Jurnal & Penelitian Teknik Infor,” J. Penelit. Tek. Inform., vol. 2, no. 2, pp. 37–44, 2017, [Online]. Available: https://zenodo.org/record/1009223#.Wd7norlTbhQ
A. I. Sang, E. Sutoyo, and I. Darmawan, “Analisis Data Mining Untuk Klasifikasi Data Kualitas Udara Dki Jakarta Menggunakan Algoritma Decision Tree Dan Support Vector Machine Data Minning Analysis for Classification of Air Quality Data Dki Jakarta Using Decision Tree Algorthm and Support Vector,” e-Proceeding Eng., vol. 8, no. 5, pp. 8954–8963, 2021.
R. F. Ramadhani, S. S. Prasetiyowati, and Y. Sibaroni, “Performance Analysis of Air Pollution Classification Prediction Map with Decision Tree and ANN,” J. Comput. Syst. Informatics, vol. 3, no. 4, pp. 536–543, 2022, doi: 10.47065/josyc.v3i4.2117.
Z. Jin, J. Shang, Q. Zhu, C. Ling, W. Xie, and B. Qiang, “RFRSF: Employee Turnover Prediction Based on Random Forests and Survival Analysis,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 12343 LNCS, pp. 503–515, 2020, doi: 10.1007/978-3-030-62008-0_35.
K. Schouten, F. Frasincar, and R. Dekker, “An information gain-driven feature study for aspect-based sentiment analysis,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 9612, pp. 48–59, 2016, doi: 10.1007/978-3-319-41754-7_5.
M. Chen and Z. Liu, “Predicting performance of students by optimizing tree components of random forest using genetic algorithm,” Heliyon, vol. 10, no. 12, p. e32570, 2024, doi: 10.1016/j.heliyon.2024.e32570.
Y. Chachoui, N. Azizi, R. Hotte, and T. Bensebaa, “Enhancing algorithmic assessment in education: Equi-fused-data-based SMOTE for balanced learning,” Comput. Educ. Artif. Intell., vol. 6, no. April, p. 100222, 2024, doi: 10.1016/j.caeai.2024.100222.
T. Imbeault-Nepton, J. Maitre, K. Bouchard, and S. Gaboury, “Filtering Data Bins of UWB Radars for Activity Recognition with Random Forest,” Procedia Comput. Sci., vol. 201, no. C, pp. 48–55, 2022, doi: 10.1016/j.procs.2022.03.009.
T. V. Ramana, “A Deep Learning Model for Detection Cancer in Breast,” J. Nurs. Res. Saf. Pract., no. 23, pp. 1–7, 2022, doi: 10.55529/jnrpsp.23.1.7.
U. Shah, S. Garg, N. Sisodiya, N. Dube, and S. Sharma, “Rainfall prediction: Accuracy enhancement using machine learning and forecasting techniques,” PDGC 2018 - 2018 5th Int. Conf. Parallel, Distrib. Grid Comput., no. October 2019, pp. 776–782, 2018, doi: 10.1109/PDGC.2018.8745763.
P. P. Singh, F. I. Anik, R. Senapati, A. Sinha, N. Sakib, and E. Hossain, “Investigating customer churn in banking: A machine learning approach and visualization app for data science and management,” Data Sci. Manag., vol. 7, no. 1, pp. 7–16, 2024, doi: 10.1016/j.dsm.2023.09.002.
DOI: https://doi.org/10.33387/jiko.v7i3.8711
Refbacks
- There are currently no refbacks.