Ali Khandokar Iftakhar, Muzahidul Islam A K M, Islam Salekul, Shatabda Swakkhar
Department of Computer Science and Engineering, United International University, Plot-2, United City, Badda, Dhaka-1212, Bangladesh.
Heliyon. 2023 Apr 3;9(4):e15163. doi: 10.1016/j.heliyon.2023.e15163. eCollection 2023 Apr.
Early purchase prediction plays a vital role for an e-commerce website. It enables e-shoppers to enlist consumers for product suggestions, offer discount and for many other interventions. Several work has already been done using session log for analyzing customer behavior whether he performs a purchase on the product or not. In most cases, it is difficult to find out and make a list of customers and offer them discount when their session ends. In this paper, we propose a customer's purchase intention prediction model where e-shoppers can detect customer's purpose earlier. First, we apply feature selection technique to select best features. Then the extracted features are fed to train supervised learning models. Several classifiers like support vector machine (SVM), random forest (RF), multilayer perceptron (MLP), decision tree (DT), and XGBoost classifiers have been applied along with oversampling method for balancing the dataset. The experiments were performed on a standard benchmark dataset. Experimental results show that XGBoost classifier with feature selection techniques and oversampling method has the significantly higher area under ROC curve (auROC) score and are under precision-recall curve (auPR) score which are 0.937 and 0.754 respectively. On the other hand accuracy achieved by XGBoost and Decision tree are significantly improved and they are 90.65% and 90.54% respectively. Overall performance of the gradient boosting method is significantly improved compared to other classifiers and state-of-the-art methods. In addition to this, a method for explainable analysis on the problem was outlined.
早期购买预测对电子商务网站起着至关重要的作用。它使电子购物者能够招募消费者以获取产品建议、提供折扣以及进行许多其他干预措施。已经有一些工作利用会话日志来分析客户行为,判断其是否会购买产品。在大多数情况下,当客户会话结束时,很难找出并列出客户名单并向他们提供折扣。在本文中,我们提出了一种客户购买意图预测模型,电子购物者可以更早地检测到客户的意图。首先,我们应用特征选择技术来选择最佳特征。然后将提取的特征输入到训练有监督学习模型中。我们应用了几种分类器,如支持向量机(SVM)、随机森林(RF)、多层感知器(MLP)、决策树(DT)和XGBoost分类器,并结合过采样方法来平衡数据集。实验是在一个标准基准数据集上进行的。实验结果表明,采用特征选择技术和过采样方法的XGBoost分类器在ROC曲线下面积(auROC)得分和精确率-召回率曲线下面积(auPR)得分分别显著更高,分别为0.937和0.754。另一方面,XGBoost和决策树所达到的准确率也显著提高,分别为90.65%和90.54%。与其他分类器和现有技术方法相比,梯度提升方法的整体性能有显著提高。除此之外,还概述了一种针对该问题的可解释分析方法。