• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于点击流电子商务数据的购物者意图预测,仅使用最少的浏览信息。

Shopper intent prediction from clickstream e-commerce data with minimal browsing information.

机构信息

ICFO - Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Av. Carl Friedrich Gauss 3, 08860, Castelldefels, Barcelona, Spain.

Department of Cognitive Science and Artificial Intelligence, Tilburg University, Warandelaan 2, 5037 AB, Tilburg, The Netherlands.

出版信息

Sci Rep. 2020 Oct 12;10(1):16983. doi: 10.1038/s41598-020-73622-y.

DOI:10.1038/s41598-020-73622-y
PMID:33046722
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7550603/
Abstract

We address the problem of user intent prediction from clickstream data of an e-commerce website via two conceptually different approaches: a hand-crafted feature-based classification and a deep learning-based classification. In both approaches, we deliberately coarse-grain a new clickstream proprietary dataset to produce symbolic trajectories with minimal information. Then, we tackle the problem of trajectory classification of arbitrary length and ultimately, early prediction of limited-length trajectories, both for balanced and unbalanced datasets. Our analysis shows that k-gram statistics with visibility graph motifs produce fast and accurate classifications, highlighting that purchase prediction is reliable even for extremely short observation windows. In the deep learning case, we benchmarked previous state-of-the-art (SOTA) models on the new dataset, and improved classification accuracy over SOTA performances with our proposed LSTM architecture. We conclude with an in-depth error analysis and a careful evaluation of the pros and cons of the two approaches when applied to realistic industry use cases.

摘要

我们通过两种概念上不同的方法来解决从电子商务网站的点击流数据中预测用户意图的问题

基于手工制作特征的分类和基于深度学习的分类。在这两种方法中,我们都故意将新的点击流专有数据集粗粒度化,以生成具有最小信息量的符号轨迹。然后,我们解决了任意长度的轨迹分类问题,并最终解决了有限长度轨迹的早期预测问题,这两个问题既适用于平衡数据集,也适用于不平衡数据集。我们的分析表明,带有可见图模的 k-gram 统计信息可以产生快速而准确的分类,这表明即使在极短的观察窗口内,购买预测也是可靠的。在深度学习的情况下,我们在新数据集上对以前的最先进(SOTA)模型进行了基准测试,并通过我们提出的 LSTM 架构提高了 SOTA 性能的分类准确性。最后,我们进行了深入的误差分析,并仔细评估了这两种方法在应用于实际行业用例时的优缺点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/2c99fc386dc9/41598_2020_73622_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/e4412d6450d0/41598_2020_73622_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/c767ab5b65ba/41598_2020_73622_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/0bf50b374bde/41598_2020_73622_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/9c2140159263/41598_2020_73622_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/40e9bd71a290/41598_2020_73622_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/ec3e517ab5ec/41598_2020_73622_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/b7fcc719c5f9/41598_2020_73622_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/406011895935/41598_2020_73622_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/36571693d27b/41598_2020_73622_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/37542527a493/41598_2020_73622_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/fd304b584c13/41598_2020_73622_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/64943e2ff9ac/41598_2020_73622_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/0f221233151d/41598_2020_73622_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/2de27e3488b9/41598_2020_73622_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/d6c8730c2d8e/41598_2020_73622_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/4da7e2d34baa/41598_2020_73622_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/56f8ddd7fd14/41598_2020_73622_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/a867ceab4f83/41598_2020_73622_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/3600167b7d27/41598_2020_73622_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/7a6373b8e581/41598_2020_73622_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/2c99fc386dc9/41598_2020_73622_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/e4412d6450d0/41598_2020_73622_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/c767ab5b65ba/41598_2020_73622_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/0bf50b374bde/41598_2020_73622_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/9c2140159263/41598_2020_73622_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/40e9bd71a290/41598_2020_73622_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/ec3e517ab5ec/41598_2020_73622_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/b7fcc719c5f9/41598_2020_73622_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/406011895935/41598_2020_73622_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/36571693d27b/41598_2020_73622_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/37542527a493/41598_2020_73622_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/fd304b584c13/41598_2020_73622_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/64943e2ff9ac/41598_2020_73622_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/0f221233151d/41598_2020_73622_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/2de27e3488b9/41598_2020_73622_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/d6c8730c2d8e/41598_2020_73622_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/4da7e2d34baa/41598_2020_73622_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/56f8ddd7fd14/41598_2020_73622_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/a867ceab4f83/41598_2020_73622_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/3600167b7d27/41598_2020_73622_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/7a6373b8e581/41598_2020_73622_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f65c/7550603/2c99fc386dc9/41598_2020_73622_Fig21_HTML.jpg

相似文献

1
Shopper intent prediction from clickstream e-commerce data with minimal browsing information.基于点击流电子商务数据的购物者意图预测,仅使用最少的浏览信息。
Sci Rep. 2020 Oct 12;10(1):16983. doi: 10.1038/s41598-020-73622-y.
2
A machine learning-based procedure for leveraging clickstream data to investigate early predictability of failure on interactive tasks.一种基于机器学习的方法,利用点击流数据研究交互式任务中早期失败的可预测性。
Behav Res Methods. 2023 Apr;55(3):1392-1412. doi: 10.3758/s13428-022-01844-1. Epub 2022 Jun 1.
3
Automated Landslide-Risk Prediction Using Web GIS and Machine Learning Models.基于 WebGIS 和机器学习模型的自动化滑坡风险预测
Sensors (Basel). 2021 Jul 5;21(13):4620. doi: 10.3390/s21134620.
4
A benchmark dataset and case study for Chinese medical question intent classification.用于中文医学问题意图分类的基准数据集和案例研究。
BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):125. doi: 10.1186/s12911-020-1122-3.
5
Learning hidden patterns from patient multivariate time series data using convolutional neural networks: A case study of healthcare cost prediction.使用卷积神经网络从患者多变量时间序列数据中学习隐藏模式:以医疗保健成本预测为例。
J Biomed Inform. 2020 Nov;111:103565. doi: 10.1016/j.jbi.2020.103565. Epub 2020 Sep 25.
6
A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。
Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.
7
Feature Representation and Data Augmentation for Human Activity Classification Based on Wearable IMU Sensor Data Using a Deep LSTM Neural Network.基于穿戴式 IMU 传感器数据的深度学习 LSTM 神经网络的人体活动分类的特征表示和数据增强。
Sensors (Basel). 2018 Aug 31;18(9):2892. doi: 10.3390/s18092892.
8
IMU-to-Segment Assignment and Orientation Alignment for the Lower Body Using Deep Learning.基于深度学习的下肢 IMU 与节段配准和方向对准。
Sensors (Basel). 2018 Jan 19;18(1):302. doi: 10.3390/s18010302.
9
A comparative analysis on question classification task based on deep learning approaches.基于深度学习方法的问题分类任务比较分析
PeerJ Comput Sci. 2021 Aug 3;7:e570. doi: 10.7717/peerj-cs.570. eCollection 2021.
10
Deep feature classification of angiomyolipoma without visible fat and renal cell carcinoma in abdominal contrast-enhanced CT images with texture image patches and hand-crafted feature concatenation.利用纹理图像补丁和手工特征串联对腹部增强 CT 图像中无可见脂肪的血管平滑肌脂肪瘤和肾细胞癌进行深度特征分类。
Med Phys. 2018 Apr;45(4):1550-1561. doi: 10.1002/mp.12828. Epub 2018 Mar 25.

引用本文的文献

1
Discovering action insights from large-scale assessment log data using machine learning.利用机器学习从大规模评估日志数据中发现行为洞察。
Sci Rep. 2025 Aug 19;15(1):30412. doi: 10.1038/s41598-025-14802-6.
2
SWOOP: top-k similarity joins over set streams.SWOOP:基于集合流的前k相似性连接
VLDB J. 2025;34(1):13. doi: 10.1007/s00778-024-00880-x. Epub 2024 Dec 23.
3
Zero party data between hype and hope.零和数据:炒作与期望之间
Front Big Data. 2022 Aug 30;5:943372. doi: 10.3389/fdata.2022.943372. eCollection 2022.
4
Dichotomic Pattern Mining Integrated With Constraint Reasoning for Digital Behavior Analysis.结合约束推理的二分模式挖掘用于数字行为分析
Front Artif Intell. 2022 Jul 12;5:868085. doi: 10.3389/frai.2022.868085. eCollection 2022.