School of Industrial Engineering and Management, International University, Vietnam National University, Ho Chi Minh City, Vietnam.
PLoS One. 2022 May 24;17(5):e0267935. doi: 10.1371/journal.pone.0267935. eCollection 2022.
In this age of fierce competitions, customer retention is one of the most important tasks for many companies. Many previous works proposed models to predict customer churn based on various machine learning techniques. In this study, we proposed an advanced churn prediction model using kernel Support Vector Machines (SVM) algorithm for a telecom company. Baseline SVM models were initially built to find out the most suitable kernel types and will be used to make comparison with other approaches. Dimension reduction strategies such as Sequential Forward Selection (SFS) and Sequential Backward Selection (SBS) were applied to the dataset to find out the most important features. Furthermore, resampling techniques to deal with imbalanced data such as Synthetic Minority Oversampling Technique Tomek Link (SMOTE Tomek) and Synthetic Minority Oversampling Technique ENN (SMOTE ENN) were used on the dataset. Using the above-mentioned techniques, we have obtained better results compared to those obtained from previous works, we achieved an F1-score and accuracy of 99% and 98.9% respectively.
在竞争激烈的时代,客户保留是许多公司最重要的任务之一。许多先前的工作提出了基于各种机器学习技术的客户流失预测模型。在这项研究中,我们提出了一种使用核支持向量机(SVM)算法的先进的客户流失预测模型,用于一家电信公司。最初构建了基线 SVM 模型以找出最合适的核类型,并将其用于与其他方法进行比较。应用了降维策略,如顺序前向选择(SFS)和顺序后向选择(SBS),以从数据集中找到最重要的特征。此外,还在数据集上使用了处理不平衡数据的重采样技术,如合成少数过采样技术 Tomek 链接(SMOTE Tomek)和合成少数过采样技术ENN(SMOTE ENN)。使用上述技术,我们获得了比先前工作更好的结果,我们分别获得了 99%和 98.9%的 F1 分数和准确率。