Da'u Aminu, Salim Naomie
School of Computing, Faculty of Engineering, Universiti Teknologi Malaysia, Skudai, Johor, Malaysia.
Department of OTM, Hassan Usman Katsina Polytechnic, Katsina, Nigeria.
PeerJ Comput Sci. 2019 May 6;5:e191. doi: 10.7717/peerj-cs.191. eCollection 2019.
Aspect extraction is a subtask of sentiment analysis that deals with identifying opinion targets in an opinionated text. Existing approaches to aspect extraction typically rely on using handcrafted features, linear and integrated network architectures. Although these methods can achieve good performances, they are time-consuming and often very complicated. In real-life systems, a simple model with competitive results is generally more effective and preferable over complicated models. In this paper, we present a multichannel convolutional neural network for aspect extraction. The model consists of a deep convolutional neural network with two input channels: a word embedding channel which aims to encode semantic information of the words and a part of speech (POS) tag embedding channel to facilitate the sequential tagging process. To get the vector representation of words, we initialized the word embedding channel and the POS channel using pretrained word2vec and one-hot-vector of POS tags, respectively. Both the word embedding and the POS embedding vectors were fed into the convolutional layer and concatenated to a one-dimensional vector, which is finally pooled and processed using a Softmax function for sequence labeling. We finally conducted a series of experiments using four different datasets. The results indicated better performance compared to the baseline models.
方面提取是情感分析的一个子任务,用于在带有观点的文本中识别观点目标。现有的方面提取方法通常依赖于使用手工制作的特征、线性和集成网络架构。尽管这些方法可以取得良好的性能,但它们耗时且通常非常复杂。在实际系统中,一个具有竞争力结果的简单模型通常比复杂模型更有效且更可取。在本文中,我们提出了一种用于方面提取的多通道卷积神经网络。该模型由一个具有两个输入通道的深度卷积神经网络组成:一个词嵌入通道,旨在编码单词的语义信息;一个词性(POS)标签嵌入通道,以促进序列标记过程。为了获得单词的向量表示,我们分别使用预训练的word2vec和词性标签的独热向量初始化词嵌入通道和词性通道。词嵌入向量和词性嵌入向量都被输入到卷积层,并连接成一个一维向量,最后通过池化操作并使用Softmax函数进行序列标记处理。我们最后使用四个不同的数据集进行了一系列实验。结果表明,与基线模型相比,性能更优。