Suppr超能文献

一种用于预测在线购物者购买意愿的梯度提升分类器。

A gradient boosting classifier for purchase intention prediction of online shoppers.

作者信息

Ali Khandokar Iftakhar, Muzahidul Islam A K M, Islam Salekul, Shatabda Swakkhar

机构信息

Department of Computer Science and Engineering, United International University, Plot-2, United City, Badda, Dhaka-1212, Bangladesh.

出版信息

Heliyon. 2023 Apr 3;9(4):e15163. doi: 10.1016/j.heliyon.2023.e15163. eCollection 2023 Apr.

Abstract

Early purchase prediction plays a vital role for an e-commerce website. It enables e-shoppers to enlist consumers for product suggestions, offer discount and for many other interventions. Several work has already been done using session log for analyzing customer behavior whether he performs a purchase on the product or not. In most cases, it is difficult to find out and make a list of customers and offer them discount when their session ends. In this paper, we propose a customer's purchase intention prediction model where e-shoppers can detect customer's purpose earlier. First, we apply feature selection technique to select best features. Then the extracted features are fed to train supervised learning models. Several classifiers like support vector machine (SVM), random forest (RF), multilayer perceptron (MLP), decision tree (DT), and XGBoost classifiers have been applied along with oversampling method for balancing the dataset. The experiments were performed on a standard benchmark dataset. Experimental results show that XGBoost classifier with feature selection techniques and oversampling method has the significantly higher area under ROC curve (auROC) score and are under precision-recall curve (auPR) score which are 0.937 and 0.754 respectively. On the other hand accuracy achieved by XGBoost and Decision tree are significantly improved and they are 90.65% and 90.54% respectively. Overall performance of the gradient boosting method is significantly improved compared to other classifiers and state-of-the-art methods. In addition to this, a method for explainable analysis on the problem was outlined.

摘要

早期购买预测对电子商务网站起着至关重要的作用。它使电子购物者能够招募消费者以获取产品建议、提供折扣以及进行许多其他干预措施。已经有一些工作利用会话日志来分析客户行为,判断其是否会购买产品。在大多数情况下,当客户会话结束时,很难找出并列出客户名单并向他们提供折扣。在本文中,我们提出了一种客户购买意图预测模型,电子购物者可以更早地检测到客户的意图。首先,我们应用特征选择技术来选择最佳特征。然后将提取的特征输入到训练有监督学习模型中。我们应用了几种分类器,如支持向量机(SVM)、随机森林(RF)、多层感知器(MLP)、决策树(DT)和XGBoost分类器,并结合过采样方法来平衡数据集。实验是在一个标准基准数据集上进行的。实验结果表明,采用特征选择技术和过采样方法的XGBoost分类器在ROC曲线下面积(auROC)得分和精确率-召回率曲线下面积(auPR)得分分别显著更高,分别为0.937和0.754。另一方面,XGBoost和决策树所达到的准确率也显著提高,分别为90.65%和90.54%。与其他分类器和现有技术方法相比,梯度提升方法的整体性能有显著提高。除此之外,还概述了一种针对该问题的可解释分析方法。

相似文献

1
A gradient boosting classifier for purchase intention prediction of online shoppers.
Heliyon. 2023 Apr 3;9(4):e15163. doi: 10.1016/j.heliyon.2023.e15163. eCollection 2023 Apr.
2
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
3
Explainable machine learning models based on multimodal time-series data for the early detection of Parkinson's disease.
Comput Methods Programs Biomed. 2023 Jun;234:107495. doi: 10.1016/j.cmpb.2023.107495. Epub 2023 Mar 23.
4
Seminal quality prediction using data mining methods.
Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.
7
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
9
Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods.
BMC Bioinformatics. 2022 Oct 1;23(1):410. doi: 10.1186/s12859-022-04965-8.
10
A Machine-Learning-Based Prediction Method for Hypertension Outcomes Based on Medical Data.
Diagnostics (Basel). 2019 Nov 7;9(4):178. doi: 10.3390/diagnostics9040178.

引用本文的文献

1
Discovering action insights from large-scale assessment log data using machine learning.
Sci Rep. 2025 Aug 19;15(1):30412. doi: 10.1038/s41598-025-14802-6.

本文引用的文献

2
iProtGly-SS: A Tool to Accurately Predict Protein Glycation Site Using Structural-Based Features.
Methods Mol Biol. 2022;2499:125-134. doi: 10.1007/978-1-0716-2317-6_5.
3
The impact of e-service quality and customer satisfaction on customer behavior in online shopping.
Heliyon. 2019 Nov 1;5(10):e02690. doi: 10.1016/j.heliyon.2019.e02690. eCollection 2019 Oct.
5
iPHLoc-ES: Identification of bacteriophage protein locations using evolutionary and structural features.
J Theor Biol. 2017 Dec 21;435:229-237. doi: 10.1016/j.jtbi.2017.09.022. Epub 2017 Sep 21.
6
Prediction of User's Web-Browsing Behavior: Application of Markov Model.
IEEE Trans Syst Man Cybern B Cybern. 2012 Aug;42(4):1131-42. doi: 10.1109/TSMCB.2012.2187441. Epub 2012 Mar 2.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验