Cao Feifei, Huang Xiaomin
School of Economics, Guangdong Peizheng College, Guangzhou, China.
PeerJ Comput Sci. 2023 Oct 9;9:e1578. doi: 10.7717/peerj-cs.1578. eCollection 2023.
Aspect-level sentiment classification task (ASCT) is a natural language processing task that aims to correctly identify specific aspects and determine their sentiment polarity from a given target sentence. Deep learning models have been proven to be effective in aspect-based sentiment classification tasks, and the mainstream Aspect-level sentiment classification (ASC) models currently constructed generally assume that the training and test datasets are Gaussian distribution (e.g., the same language). Once the data distribution changes, the ASC model must be retrained on the new distribution data to achieve good performance. However, acquiring a large amount of labeled data again typically requires a lot of manpower and money, which seems unlikely, especially for the ASC task, as it requires aspect-level annotation. This article analyzes the performance of sequence-based models, graph-based convolutional neural networks, and pre-training language models on the aspect-level sentiment classification task using two sets of comment datasets in Chinese and English, from four perspectives: classification performance, performance with different aspect numbers, specific case performance, and computational cost. In this article, we design a state-of-the-art ASC-based classification method and conduct a systematic study on eight public standard English and Chinese datasets with various commonly used assessment measures that provide directions for cross-language migration. Finally, we discuss the limitations of the study as well as future research directions.
方面级情感分类任务(ASCT)是一项自然语言处理任务,旨在从给定的目标句子中正确识别特定方面并确定其情感极性。深度学习模型已被证明在基于方面的情感分类任务中有效,目前构建的主流方面级情感分类(ASC)模型通常假设训练和测试数据集是高斯分布(例如,同一种语言)。一旦数据分布发生变化,ASC模型必须在新分布的数据上重新训练以获得良好性能。然而,再次获取大量标注数据通常需要大量人力和资金,这似乎不太可能,特别是对于ASC任务,因为它需要方面级标注。本文从分类性能、不同方面数量的性能、具体案例性能和计算成本四个角度,使用两组中文和英文评论数据集,分析了基于序列的模型、基于图的卷积神经网络和预训练语言模型在方面级情感分类任务上的性能。在本文中,我们设计了一种基于ASC的先进分类方法,并使用各种常用评估指标对八个公开标准英文和中文数据集进行了系统研究,为跨语言迁移提供了方向。最后,我们讨论了研究的局限性以及未来的研究方向。