Chen Guisheng, Li Zhanshan
College of Computer Science and Technology, Jilin University, Changchun 130012, China.
Key Laboratory of Symbolic Computation and Knowledge Engineering, Ministry of Education, Jilin University, Changchun 130012, China.
Entropy (Basel). 2021 Oct 29;23(11):1430. doi: 10.3390/e23111430.
Market basket prediction, which is the basis of product recommendation systems, is the concept of predicting what customers will buy in the next shopping basket based on analysis of their historical shopping records. Although product recommendation systems develop rapidly and have good performance in practice, state-of-the-art algorithms still have plenty of room for improvement. In this paper, we propose a new algorithm combining pattern prediction and preference prediction. In pattern prediction, sequential rules, periodic patterns and association rules are mined and probability models are established based on their statistical characteristics, e.g., the distribution of periods of a periodic pattern, to make a more precise prediction. Products that have a higher probability will have priority to be recommended. If the quantity of recommended products is insufficient, then we make a preference prediction to select more products. Preference prediction is based on the frequency and tendency of products that appear in customers' individual shopping records, where tendency is a new concept to reflect the evolution of customers' shopping preferences. Experiments show that our algorithm outperforms those of the baseline methods and state-of-the-art methods on three of four real-world transaction sequence datasets.
购物篮预测是产品推荐系统的基础,它是基于对客户历史购物记录的分析来预测客户在下一个购物篮中会购买什么的概念。尽管产品推荐系统发展迅速且在实践中表现良好,但最先进的算法仍有很大的改进空间。在本文中,我们提出了一种结合模式预测和偏好预测的新算法。在模式预测中,挖掘序列规则、周期性模式和关联规则,并基于它们的统计特征(例如周期性模式的周期分布)建立概率模型,以进行更精确的预测。具有较高概率的产品将优先被推荐。如果推荐产品的数量不足,那么我们进行偏好预测以选择更多产品。偏好预测基于产品在客户个人购物记录中出现的频率和趋势,其中趋势是一个反映客户购物偏好演变的新概念。实验表明,我们的算法在四个真实世界交易序列数据集中的三个上优于基线方法和最先进的方法。