Suppr超能文献

一种针对与旅行相关的中文在线评论内容的情感分析方法。

A sentiment analysis approach for travel-related Chinese online review content.

作者信息

Li Hanyun, Li Wenzao, Zhao Jiacheng, Yu Peizhen, Huang Yao

机构信息

Chengdu University of Information Technology, Chengdu, China.

出版信息

PeerJ Comput Sci. 2023 Aug 23;9:e1538. doi: 10.7717/peerj-cs.1538. eCollection 2023.

Abstract

Using technology for sentiment analysis in the travel industry can extract valuable insights from customer reviews. It can assist businesses in gaining a deeper understanding of their consumers' emotional tendencies and enhance their services' caliber. However, travel-related online reviews are rife with colloquialisms, sparse feature dimensions, metaphors, and sarcasm. As a result, traditional semantic representations of word vectors are inaccurate, and single neural network models do not take into account multiple associative features. To address the above issues, we introduce a dual-channel algorithm that integrates convolutional neural networks (CNN) and bi-directional long and short-term memory (BiLSTM) with an attention mechanism (DC-CBLA). First, the model utilizes the pre-trained BERT, a transformer-based model, to extract a dynamic vector representation for each word that corresponds to the current contextual representation. This process enhances the accuracy of the vector semantic representation. Then, BiLSTM is used to capture the global contextual sequence features of the travel text, while CNN is used to capture the richer local semantic information. A hybrid feature network combining CNN and BiLSTM can improve the model's representation ability. Additionally, the BiLSTM output is feature-weighted using the attention mechanism to enhance the learning of its fundamental features and lessen the influence of noise features on the outcomes. Finally, the Softmax function is used to classify the dual-channel fused features. We conducted an experimental evaluation of two data sets: tourist attractions and tourist hotels. The accuracy of the DC-CBLA model is 95.23% and 89.46%, and that of the F1-score is 97.05% and 93.86%, respectively. The experimental results demonstrate that our proposed DC-CBLA model outperforms other baseline models.

摘要

在旅游业中使用技术进行情感分析可以从客户评论中提取有价值的见解。它可以帮助企业更深入地了解消费者的情感倾向,并提高其服务质量。然而,与旅游相关的在线评论充斥着俗语、稀疏的特征维度、隐喻和讽刺。因此,传统的词向量语义表示不准确,单一神经网络模型也没有考虑到多个关联特征。为了解决上述问题,我们引入了一种双通道算法,该算法将卷积神经网络(CNN)和双向长短时记忆网络(BiLSTM)与注意力机制相结合(DC-CBLA)。首先,该模型利用预训练的基于Transformer的BERT模型为每个单词提取与当前上下文表示相对应的动态向量表示。这一过程提高了向量语义表示的准确性。然后,使用BiLSTM捕捉旅游文本的全局上下文序列特征,而使用CNN捕捉更丰富的局部语义信息。结合CNN和BiLSTM的混合特征网络可以提高模型的表示能力。此外,利用注意力机制对BiLSTM输出进行特征加权,以增强其基本特征的学习,并减少噪声特征对结果的影响。最后,使用Softmax函数对双通道融合特征进行分类。我们对两个数据集进行了实验评估:旅游景点和旅游酒店。DC-CBLA模型的准确率分别为95.23%和89.46%,F1分数分别为97.05%和93.86%。实验结果表明,我们提出的DC-CBLA模型优于其他基线模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/94b6/10495948/c5b8f1bca259/peerj-cs-09-1538-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验