使用增强规则从客户评论中提取特征。

Feature extraction from customer reviews using enhanced rules.

作者信息

Santhiran Rajeswary, Varathan Kasturi Dewi, Chiam Yin Kia

机构信息

Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, Malaysia.

Department of Software Engineering, Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, Malaysia.

出版信息

PeerJ Comput Sci. 2024 Jan 31;10:e1821. doi: 10.7717/peerj-cs.1821. eCollection 2024.

DOI:10.7717/peerj-cs.1821

PMID:38435547

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10909217/

Abstract

Opinion mining is gaining significant research interest, as it directly and indirectly provides a better avenue for understanding customers, their sentiments toward a service or product, and their purchasing decisions. However, extracting every opinion feature from unstructured customer review documents is challenging, especially since these reviews are often written in native languages and contain grammatical and spelling errors. Moreover, existing pattern rules frequently exclude features and opinion words that are not strictly nouns or adjectives. Thus, selecting suitable features when analyzing customer reviews is the key to uncovering their actual expectations. This study aims to enhance the performance of explicit feature extraction from product review documents. To achieve this, an approach that employs sequential pattern rules is proposed to identify and extract features with associated opinions. The improved pattern rules total 41, including 16 new rules introduced in this study and 25 existing pattern rules from previous research. An average calculated from the testing results of five datasets showed that the incorporation of this study's 16 new rules significantly improved feature extraction precision by 6%, recall by 6% and F-measure value by 5% compared to the contemporary approach. The new set of rules has proven to be effective in extracting features that were previously overlooked, thus achieving its objective of addressing gaps in existing rules. Therefore, this study has successfully enhanced feature extraction results, yielding an average precision of 0.91, an average recall value of 0.88, and an average F-measure of 0.89.

摘要

观点挖掘正引起广泛的研究兴趣，因为它直接或间接地为理解客户、他们对服务或产品的看法以及他们的购买决策提供了更好的途径。然而，从非结构化的客户评论文件中提取每一个观点特征具有挑战性，特别是因为这些评论通常是用母语撰写的，并且包含语法和拼写错误。此外，现有的模式规则经常排除那些并非严格意义上的名词或形容词的特征和观点词。因此，在分析客户评论时选择合适的特征是揭示他们实际期望的关键。本研究旨在提高从产品评论文件中进行显式特征提取的性能。为实现这一目标，提出了一种采用序列模式规则的方法来识别和提取带有相关观点的特征。改进后的模式规则共有41条，包括本研究中引入的16条新规则和先前研究中的25条现有模式规则。根据五个数据集的测试结果计算得出的平均值表明，与当代方法相比，纳入本研究的16条新规则显著提高了特征提取的精度6%，召回率6%，F值5%。新的规则集已被证明在提取以前被忽视的特征方面是有效的，从而实现了其弥补现有规则差距的目标。因此，本研究成功地提高了特征提取结果，平均精度为0.91，平均召回值为0.88，平均F值为0.89。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2806/10909217/b4ecd87ad13a/peerj-cs-10-1821-g001.jpg

相似文献

Feature extraction from customer reviews using enhanced rules.使用增强规则从客户评论中提取特征。

PeerJ Comput Sci. 2024 Jan 31;10:e1821. doi: 10.7717/peerj-cs.1821. eCollection 2024.

Extracting product features and opinion words using pattern knowledge in customer reviews.利用客户评论中的模式知识提取产品特征和观点词。

ScientificWorldJournal. 2013 Dec 26;2013:394758. doi: 10.1155/2013/394758. eCollection 2013.

A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas.一种使用词依存关系和词元的混合特征集在情感分析中进行方面提取的监督方案。

PeerJ Comput Sci. 2021 Feb 5;7:e347. doi: 10.7717/peerj-cs.347. eCollection 2021.

Vaccine sentiment analysis using BERT + NBSVM and geo-spatial approaches.使用BERT + NBSVM和地理空间方法的疫苗情绪分析。

J Supercomput. 2023 May 7:1-31. doi: 10.1007/s11227-023-05319-8.

HAS: Hybrid Analysis of Sentiments for the perspective of customer review summarization.HAS：从客户评论摘要的角度进行情感混合分析。

J Ambient Intell Humaniz Comput. 2022 Feb 20:1-14. doi: 10.1007/s12652-022-03748-6.

A data package for abstractive opinion summarization, title generation, and rating-based sentiment prediction for airline reviews.一个用于航空公司评论的抽象意见总结、标题生成和基于评分的情感预测的数据包。

Data Brief. 2023 Sep 1;50:109535. doi: 10.1016/j.dib.2023.109535. eCollection 2023 Oct.

A deep learning-based model using hybrid feature extraction approach for consumer sentiment analysis.一种基于深度学习的模型，采用混合特征提取方法进行消费者情绪分析。

J Big Data. 2023;10(1):5. doi: 10.1186/s40537-022-00680-6. Epub 2023 Jan 13.

Feature level fine grained sentiment analysis using boosted long short-term memory with improvised local search whale optimization.使用带有改进局部搜索鲸鱼优化的增强长短期记忆网络进行特征级细粒度情感分析

PeerJ Comput Sci. 2023 Apr 24;9:e1336. doi: 10.7717/peerj-cs.1336. eCollection 2023.

The impact of semantics on aspect level opinion mining.语义对方面级意见挖掘的影响。

PeerJ Comput Sci. 2021 Jun 18;7:e558. doi: 10.7717/peerj-cs.558. eCollection 2021.

Programming techniques for improving rule readability for rule-based information extraction natural language processing pipelines of unstructured and semi-structured medical texts.用于改进基于规则的信息抽取自然语言处理管道的规则可读性的编程技术，这些管道处理非结构化和半结构化的医学文本。

Health Informatics J. 2023 Apr-Jun;29(2):14604582231164696. doi: 10.1177/14604582231164696.

引用本文的文献

Discovering sequential patterns and interrelations among multiple diseases in electronic medical records using cSPADE algorithm.使用cSPADE算法发现电子病历中多种疾病之间的序列模式和相互关系。

Arch Public Health. 2025 Apr 10;83(1):100. doi: 10.1186/s13690-025-01589-1.

Enhancing sentiment analysis of online comments: a novel approach integrating topic modeling and deep learning.增强在线评论的情感分析：一种融合主题建模与深度学习的新方法。

PeerJ Comput Sci. 2024 Dec 5;10:e2542. doi: 10.7717/peerj-cs.2542. eCollection 2024.

本文引用的文献

A lexicon based method to search for extreme opinions.基于词典的方法来搜索极端观点。

PLoS One. 2018 May 25;13(5):e0197816. doi: 10.1371/journal.pone.0197816. eCollection 2018.

Extracting product features and opinion words using pattern knowledge in customer reviews.利用客户评论中的模式知识提取产品特征和观点词。

ScientificWorldJournal. 2013 Dec 26;2013:394758. doi: 10.1155/2013/394758. eCollection 2013.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用增强规则从客户评论中提取特征。

Feature extraction from customer reviews using enhanced rules.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献