Suppr超能文献

预测有用评论中的显著特征识别及机器学习技术

Identification of significant features and machine learning technique in predicting helpful reviews.

作者信息

Sadeek Quaderi Shah Jafor, Varathan Kasturi Dewi

机构信息

Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, Malaysia.

出版信息

PeerJ Comput Sci. 2024 Jan 23;10:e1745. doi: 10.7717/peerj-cs.1745. eCollection 2024.

Abstract

Consumers nowadays rely heavily on online reviews in making their purchase decisions. However, they are often overwhelmed by the mass amount of product reviews that are being generated on online platforms. Therefore, it is deemed essential to determine the helpful reviews, as it will significantly reduce the number of reviews that each consumer has to ponder. A review is identified as a helpful review if it has significant information that helps the reader in making a purchase decision. Many reviews posted online are lacking a sufficient amount of information used in the decision-making process. Past research has neglected much useful information that can be utilized in predicting helpful reviews. This research identifies significant information which is represented as features categorized as linguistic, metadata, readability, subjectivity, and polarity that have contributed to predicting helpful online reviews. Five machine learning models were compared on two Amazon open datasets, each consisting of 9,882,619 and 65,222 user reviews. The significant features used in the Random Forest technique managed to outperform other techniques used by previous researchers with an accuracy of 89.36%.

摘要

如今,消费者在做出购买决策时严重依赖在线评论。然而,他们常常被在线平台上大量的产品评论所淹没。因此,确定有用的评论被认为至关重要,因为这将显著减少每个消费者需要考虑的评论数量。如果一篇评论包含有助于读者做出购买决策的重要信息,那么它就被认定为有用的评论。许多在线发布的评论缺乏在决策过程中使用的足够信息。过去的研究忽略了许多可用于预测有用评论的有用信息。本研究确定了重要信息,这些信息表现为语言、元数据、可读性、主观性和极性等类别特征,这些特征有助于预测有用的在线评论。在两个亚马逊开放数据集上比较了五种机器学习模型,每个数据集分别包含9,882,619条和65,222条用户评论。随机森林技术中使用的重要特征以89.36%的准确率成功超越了先前研究人员使用的其他技术。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49b9/11636698/db5dfee111f5/peerj-cs-10-1745-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验