通过整合上下文感知注意力和融合网络进行讽刺检测的增强语义表示学习

Enhanced Semantic Representation Learning for Sarcasm Detection by Integrating Context-Aware Attention and Fusion Network.

作者信息

Hao Shufeng, Yao Jikun, Shi Chongyang, Zhou Yu, Xu Shuang, Li Dengao, Cheng Yinghan

机构信息

College of Data Science, Taiyuan University of Technology, Taiyuan 030024, China.

Key Laboratory of Big Data Fusion Analysis and Application of Shanxi Province, Taiyuan 030024, China.

出版信息

Entropy (Basel). 2023 May 30;25(6):878. doi: 10.3390/e25060878.

DOI:10.3390/e25060878

PMID:37372222

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10297453/

Abstract

Sarcasm is a sophisticated figurative language that is prevalent on social media platforms. Automatic sarcasm detection is significant for understanding the real sentiment tendencies of users. Traditional approaches mostly focus on content features by using lexicon, n-gram, and pragmatic feature-based models. However, these methods ignore the diverse contextual clues that could provide more evidence of the sarcastic nature of sentences. In this work, we propose a Contextual Sarcasm Detection Model (CSDM) by modeling enhanced semantic representations with user profiling and forum topic information, where context-aware attention and a user-forum fusion network are used to obtain diverse representations from distinct aspects. In particular, we employ a Bi-LSTM encoder with context-aware attention to obtain a refined comment representation by capturing sentence composition information and the corresponding context situations. Then, we employ a user-forum fusion network to obtain the comprehensive context representation by capturing the corresponding sarcastic tendencies of the user and the background knowledge about the comments. Our proposed method achieves values of 0.69, 0.70, and 0.83 in terms of accuracy on the Main balanced, Pol balanced and Pol imbalanced datasets, respectively. The experimental results on a large Reddit corpus, SARC, demonstrate that our proposed method achieves a significant performance improvement over state-of-art textual sarcasm detection methods.

摘要

讽刺是一种复杂的比喻性语言，在社交媒体平台上很普遍。自动讽刺检测对于理解用户的真实情感倾向具有重要意义。传统方法大多通过使用基于词典、n-gram和语用特征的模型来关注内容特征。然而，这些方法忽略了各种上下文线索，而这些线索可以为句子的讽刺性质提供更多证据。在这项工作中，我们通过使用用户画像和论坛主题信息对增强的语义表示进行建模，提出了一种上下文讽刺检测模型（CSDM），其中上下文感知注意力和用户-论坛融合网络用于从不同方面获得多样化的表示。具体来说，我们采用带有上下文感知注意力的双向长短期记忆（Bi-LSTM）编码器，通过捕捉句子组成信息和相应的上下文情况来获得精细的评论表示。然后，我们采用用户-论坛融合网络，通过捕捉用户相应的讽刺倾向和关于评论的背景知识来获得全面的上下文表示。我们提出的方法在主平衡数据集、政治平衡数据集和政治不平衡数据集上的准确率分别达到了0.69、0.70和0.83。在一个大型Reddit语料库SARC上的实验结果表明，我们提出的方法相对于现有的文本讽刺检测方法在性能上有显著提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5063/10297453/d4042078f4d5/entropy-25-00878-g001.jpg

相似文献

Enhanced Semantic Representation Learning for Sarcasm Detection by Integrating Context-Aware Attention and Fusion Network.通过整合上下文感知注意力和融合网络进行讽刺检测的增强语义表示学习

Entropy (Basel). 2023 May 30;25(6):878. doi: 10.3390/e25060878.

A contextual-based approach for sarcasm detection.基于语境的反讽检测方法。

Sci Rep. 2024 Jul 4;14(1):15415. doi: 10.1038/s41598-024-65217-8.

Multi-Rule Based Ensemble Feature Selection Model for Sarcasm Type Detection in Twitter.基于多规则集成特征选择模型的 Twitter 反讽类型检测。

Comput Intell Neurosci. 2020 Jan 9;2020:2860479. doi: 10.1155/2020/2860479. eCollection 2020.

Detecting sarcasm in multi-domain datasets using convolutional neural networks and long short term memory network model.使用卷积神经网络和长短期记忆网络模型检测多领域数据集中的讽刺意味。

PeerJ Comput Sci. 2021 Aug 25;7:e645. doi: 10.7717/peerj-cs.645. eCollection 2021.

Interpretable Multi-Head Self-Attention Architecture for Sarcasm Detection in Social Media.用于社交媒体讽刺检测的可解释多头自注意力架构

Entropy (Basel). 2021 Mar 26;23(4):394. doi: 10.3390/e23040394.

Sentiment Analysis and Sarcasm Detection using Deep Multi-Task Learning.使用深度多任务学习的情感分析与讽刺检测

Wirel Pers Commun. 2023;129(3):2213-2237. doi: 10.1007/s11277-023-10235-4. Epub 2023 Mar 4.

Multi-feature fusion framework for sarcasm identification on twitter data: A machine learning based approach.基于机器学习的多特征融合框架在推特数据中的反讽识别。

PLoS One. 2021 Jun 10;16(6):e0252918. doi: 10.1371/journal.pone.0252918. eCollection 2021.

ArSa-Tweets: A novel Arabic sarcasm detection system based on deep learning model.ArSa-Tweets：一种基于深度学习模型的新型阿拉伯语讽刺检测系统。

Heliyon. 2024 Aug 28;10(17):e36892. doi: 10.1016/j.heliyon.2024.e36892. eCollection 2024 Sep 15.

Neural substrates of sarcasm: a functional magnetic-resonance imaging study.讽刺的神经基础：一项功能磁共振成像研究

Brain Res. 2006 Dec 8;1124(1):100-10. doi: 10.1016/j.brainres.2006.09.088. Epub 2006 Nov 7.

SemSeq4FD: Integrating global semantic relationship and local sequential order to enhance text representation for fake news detection.SemSeq4FD：整合全局语义关系和局部顺序以增强用于假新闻检测的文本表示

Expert Syst Appl. 2021 Mar 15;166:114090. doi: 10.1016/j.eswa.2020.114090. Epub 2020 Oct 3.

本文引用的文献

Interpretable Multi-Head Self-Attention Architecture for Sarcasm Detection in Social Media.用于社交媒体讽刺检测的可解释多头自注意力架构

Entropy (Basel). 2021 Mar 26;23(4):394. doi: 10.3390/e23040394.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过整合上下文感知注意力和融合网络进行讽刺检测的增强语义表示学习

Enhanced Semantic Representation Learning for Sarcasm Detection by Integrating Context-Aware Attention and Fusion Network.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献