一种使用词依存关系和词元的混合特征集在情感分析中进行方面提取的监督方案。

A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas.

作者信息

Bhamare Bhavana R, Prabhu Jeyanthi

机构信息

Department of Computer Science and Engineering, Sathyabama Institute of Science and Technology, Chennai, Tamilnadu, India.

Department of Information Technology, Sathyabama Institute of Science and Technology, Chennai, Tamilnadu, India.

出版信息

PeerJ Comput Sci. 2021 Feb 5;7:e347. doi: 10.7717/peerj-cs.347. eCollection 2021.

DOI:10.7717/peerj-cs.347

PMID:33816997

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7959606/

Abstract

Due to the massive progression of the Web, people post their reviews for any product, movies and places they visit on social media. The reviews available on social media are helpful to customers as well as the product owners to evaluate their products based on different reviews. Analyzing structured data is easy as compared to unstructured data. The reviews are available in an unstructured format. Aspect-Based Sentiment Analysis mines the aspects of a product from the reviews and further determines sentiment for each aspect. In this work, two methods for aspect extraction are proposed. The datasets used for this work are SemEval restaurant review dataset, Yelp and Kaggle datasets. In the first method a multivariate filter-based approach for feature selection is proposed. This method support to select significant features and reduces redundancy among selected features. It shows improvement in 1-score compared to a method that uses only relevant features selected using Term Frequency weight. In another method, selective dependency relations are used to extract features. This is done using Stanford NLP parser. The results gained using features extracted by selective dependency rules are better as compared to features extracted by using all dependency rules. In the hybrid approach, both lemma features and selective dependency relation based features are extracted. Using the hybrid feature set, 94.78% accuracy and 85.24% 1-score is achieved in the aspect category prediction task.

摘要

由于网络的大规模发展，人们会在社交媒体上发布他们对任何产品、电影以及所到访地点的评价。社交媒体上的这些评价对顾客以及产品所有者都很有帮助，他们可以根据不同的评价来评估产品。与非结构化数据相比，分析结构化数据更容易。评价是以非结构化格式呈现的。基于方面的情感分析从评价中挖掘产品的各个方面，并进一步确定每个方面的情感倾向。在这项工作中，提出了两种方面提取方法。用于这项工作的数据集有SemEval餐厅评价数据集、Yelp和Kaggle数据集。在第一种方法中，提出了一种基于多元滤波器的特征选择方法。该方法有助于选择重要特征并减少所选特征之间的冗余。与仅使用基于词频权重选择的相关特征的方法相比，它在F1分数上有提升。在另一种方法中，使用选择性依赖关系来提取特征。这是通过斯坦福自然语言处理解析器完成的。与使用所有依赖规则提取的特征相比，使用选择性依赖规则提取的特征所获得的结果更好。在混合方法中，既提取词元特征，也提取基于选择性依赖关系的特征。使用混合特征集，在方面类别预测任务中实现了94.78%的准确率和85.24%的F1分数。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d012/7959606/92c90f7f6934/peerj-cs-07-347-g001.jpg

相似文献

A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas.

PeerJ Comput Sci. 2021 Feb 5;7:e347. doi: 10.7717/peerj-cs.347. eCollection 2021.

A deep learning-based model using hybrid feature extraction approach for consumer sentiment analysis.

J Big Data. 2023;10(1):5. doi: 10.1186/s40537-022-00680-6. Epub 2023 Jan 13.

HAS: Hybrid Analysis of Sentiments for the perspective of customer review summarization.

J Ambient Intell Humaniz Comput. 2022 Feb 20:1-14. doi: 10.1007/s12652-022-03748-6.

A hybrid dependency-based approach for Urdu sentiment analysis.

Sci Rep. 2023 Dec 12;13(1):22075. doi: 10.1038/s41598-023-48817-8.

tRF-BERT: A transformative approach to aspect-based sentiment analysis in the bengali language.

PLoS One. 2024 Sep 20;19(9):e0308050. doi: 10.1371/journal.pone.0308050. eCollection 2024.

Multi-level aspect based sentiment classification of Twitter data: using hybrid approach in deep learning.

PeerJ Comput Sci. 2021 Apr 13;7:e433. doi: 10.7717/peerj-cs.433. eCollection 2021.

Automatic Classification of Online Doctor Reviews: Evaluation of Text Classifier Algorithms.

J Med Internet Res. 2018 Nov 12;20(11):e11141. doi: 10.2196/11141.

Modeling Structured Dependency Tree with Graph Convolutional Networks for Aspect-Level Sentiment Classification.

Sensors (Basel). 2024 Jan 10;24(2):418. doi: 10.3390/s24020418.

Feature level fine grained sentiment analysis using boosted long short-term memory with improvised local search whale optimization.

PeerJ Comput Sci. 2023 Apr 24;9:e1336. doi: 10.7717/peerj-cs.1336. eCollection 2023.

Feature extraction from customer reviews using enhanced rules.

PeerJ Comput Sci. 2024 Jan 31;10:e1821. doi: 10.7717/peerj-cs.1821. eCollection 2024.

引用本文的文献

Natural language processing for analyzing online customer reviews: a survey, taxonomy, and open research challenges.

PeerJ Comput Sci. 2024 Jul 19;10:e2203. doi: 10.7717/peerj-cs.2203. eCollection 2024.

本文引用的文献

Beyond the topics: how deep learning can improve the discriminability of probabilistic topic modelling.

PeerJ Comput Sci. 2020 Jan 27;6:e252. doi: 10.7717/peerj-cs.252. eCollection 2020.

Supervised and Unsupervised Aspect Category Detection for Sentiment Analysis with Co-occurrence Data.

IEEE Trans Cybern. 2018 Apr;48(4):1263-1275. doi: 10.1109/TCYB.2017.2688801. Epub 2017 Apr 14.

Lexicon-enhanced sentiment analysis framework using rule-based classification scheme.

PLoS One. 2017 Feb 23;12(2):e0171649. doi: 10.1371/journal.pone.0171649. eCollection 2017.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种使用词依存关系和词元的混合特征集在情感分析中进行方面提取的监督方案。

A supervised scheme for aspect extraction in sentiment analysis using the hybrid feature set of word dependency relations and lemmas.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献