School of Computer Science, University College Dublin, Dublin, Ireland.
School of Computing, Dublin City University, Dublin, Ireland.
PLoS One. 2024 Aug 26;19(8):e0307180. doi: 10.1371/journal.pone.0307180. eCollection 2024.
In recent years, computational approaches for extracting customer needs from user generated content have been proposed. However, there is a lack of studies that focus on extracting unmet needs for future popular products. Therefore, this study presents a supervised keyphrase classification model which predicts needs that will become popular in real products in the marketplace. To do this, we utilize Trending Customer Needs (TCN)-a monthly dataset of trending keyphrase customer needs occurring in new products during 2011-2021 across multiple categories of Consumer Packaged Goods e.g. toothpaste, eyeliner, beer, etc. We are the first study to use this specific dataset and employ it by training a time series algorithm to learn the relationship between features we generate for each candidate keyphrase on Reddit to the ones in the dataset 1-3 years in the future. We show that our approach outperforms a baseline in the literature and through Multi-Task Learning can accurately predict needs for a category it wasn't trained on e.g. train on toothpaste, cereal, and beer products yet still predict for shampoo products. The findings from this research could provide many advantages to businesses such as gaining early access into markets.
近年来,已经提出了从用户生成的内容中提取客户需求的计算方法。然而,缺乏专注于提取未来热门产品未满足需求的研究。因此,本研究提出了一种监督关键词分类模型,用于预测市场上实际产品中将会流行的需求。为此,我们利用趋势客户需求(TCN)-一个月度数据集,其中包含 2011 年至 2021 年期间在多个消费品类别(例如牙膏、眼线笔、啤酒等)新产品中出现的趋势关键词客户需求。我们是第一个使用这个特定数据集的研究,并通过训练时间序列算法来利用它,该算法学习我们为 Reddit 上每个候选关键词生成的特征与未来 1-3 年数据集中的特征之间的关系。我们表明,我们的方法优于文献中的基线,并且通过多任务学习可以准确地预测其未训练的类别(例如,对牙膏、麦片和啤酒产品进行训练)的需求,而不仅仅是洗发水产品。这项研究的结果可以为企业提供许多优势,例如及早进入市场。