基于凝聚注意力融合的对比学习的多模态情感分析表示学习。

Multimodal Sentiment Analysis Representations Learning via Contrastive Learning with Condense Attention Fusion.

机构信息

Xinjiang Key Laboratory of Signal Detection and Processing, College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.

College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.

出版信息

Sensors (Basel). 2023 Mar 1;23(5):2679. doi: 10.3390/s23052679.

DOI:10.3390/s23052679

PMID:36904883

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10007095/

Abstract

Multimodal sentiment analysis has gained popularity as a research field for its ability to predict users' emotional tendencies more comprehensively. The data fusion module is a critical component of multimodal sentiment analysis, as it allows for integrating information from multiple modalities. However, it is challenging to combine modalities and remove redundant information effectively. In our research, we address these challenges by proposing a multimodal sentiment analysis model based on supervised contrastive learning, which leads to more effective data representation and richer multimodal features. Specifically, we introduce the MLFC module, which utilizes a convolutional neural network (CNN) and Transformer to solve the redundancy problem of each modal feature and reduce irrelevant information. Moreover, our model employs supervised contrastive learning to enhance its ability to learn standard sentiment features from data. We evaluate our model on three widely-used datasets, namely MVSA-single, MVSA-multiple, and HFM, demonstrating that our model outperforms the state-of-the-art model. Finally, we conduct ablation experiments to validate the efficacy of our proposed method.

摘要

多模态情感分析因其能够更全面地预测用户的情感倾向而成为一个热门的研究领域。数据融合模块是多模态情感分析的关键组成部分，因为它允许整合来自多个模态的信息。然而，有效地结合模态并去除冗余信息是具有挑战性的。在我们的研究中，我们通过提出一种基于监督对比学习的多模态情感分析模型来解决这些挑战，该模型导致更有效的数据表示和更丰富的多模态特征。具体来说，我们引入了 MLFC 模块，该模块利用卷积神经网络 (CNN) 和 Transformer 来解决每个模态特征的冗余问题，并减少无关信息。此外，我们的模型采用监督对比学习来增强其从数据中学习标准情感特征的能力。我们在三个广泛使用的数据集上评估了我们的模型，即 MVSA-single、MVSA-multiple 和 HFM，结果表明我们的模型优于最先进的模型。最后，我们进行了消融实验来验证我们提出的方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55cf/10007095/d275bddc9a1f/sensors-23-02679-g001.jpg

相似文献

Multimodal Sentiment Analysis Representations Learning via Contrastive Learning with Condense Attention Fusion.基于凝聚注意力融合的对比学习的多模态情感分析表示学习。

Sensors (Basel). 2023 Mar 1;23(5):2679. doi: 10.3390/s23052679.

Multi-Modal Representation via Contrastive Learning with Attention Bottleneck Fusion and Attentive Statistics Features.通过带有注意力瓶颈融合和注意力统计特征的对比学习实现多模态表示

Entropy (Basel). 2023 Oct 7;25(10):1421. doi: 10.3390/e25101421.

Hierarchical Fusion Network with Enhanced Knowledge and Contrastive Learning for Multimodal Aspect-Based Sentiment Analysis on Social Media.基于增强知识和对比学习的层次融合网络用于社交媒体上基于多模态方面的情感分析

Sensors (Basel). 2023 Aug 22;23(17):7330. doi: 10.3390/s23177330.

Contrastive self-supervised representation learning without negative samples for multimodal human action recognition.用于多模态人类动作识别的无负样本对比自监督表征学习

Front Neurosci. 2023 Jul 5;17:1225312. doi: 10.3389/fnins.2023.1225312. eCollection 2023.

AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model.AFR-BERT：基于注意力机制的特征相关融合多模态情感分析模型。

PLoS One. 2022 Sep 9;17(9):e0273936. doi: 10.1371/journal.pone.0273936. eCollection 2022.

Multimodal Sentiment Analysis Based on Cross-Modal Attention and Gated Cyclic Hierarchical Fusion Networks.基于跨模态注意力和门控循环层次融合网络的多模态情感分析。

Comput Intell Neurosci. 2022 Aug 9;2022:4767437. doi: 10.1155/2022/4767437. eCollection 2022.

FTMMR: Fusion Transformer for Integrating Multiple Molecular Representations.FTMMR：融合多分子表示的Transformer。

IEEE J Biomed Health Inform. 2024 Jul;28(7):4361-4372. doi: 10.1109/JBHI.2024.3383221. Epub 2024 Jul 2.

Cross-Modal Sentiment Sensing with Visual-Augmented Representation and Diverse Decision Fusion.跨模态情感感知与视觉增强表示和多样化决策融合。

Sensors (Basel). 2021 Dec 23;22(1):74. doi: 10.3390/s22010074.

Joint self-supervised and supervised contrastive learning for multimodal MRI data: Towards predicting abnormal neurodevelopment.基于联合自监督和监督对比学习的多模态 MRI 数据研究：预测异常神经发育

Artif Intell Med. 2024 Nov;157:102993. doi: 10.1016/j.artmed.2024.102993. Epub 2024 Sep 30.

Hierarchical graph contrastive learning of local and global presentation for multimodal sentiment analysis.用于多模态情感分析的局部和全局表示的分层图对比学习

Sci Rep. 2024 Mar 4;14(1):5335. doi: 10.1038/s41598-024-54872-6.

本文引用的文献

CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French.CMU-MOSEAS：一个用于西班牙语、葡萄牙语、德语和法语的多模态语言数据集。

Proc Conf Empir Methods Nat Lang Process. 2020 Nov;2020:1801-1812. doi: 10.18653/v1/2020.emnlp-main.141.

Traffic accident detection and condition analysis based on social networking data.基于社交网络数据的交通事故检测与状态分析。

Accid Anal Prev. 2021 Mar;151:105973. doi: 10.1016/j.aap.2021.105973. Epub 2021 Jan 15.

From Deterministic to Generative: Multimodal Stochastic RNNs for Video Captioning.从确定性到生成式：用于视频字幕的多模态随机循环神经网络

IEEE Trans Neural Netw Learn Syst. 2019 Oct;30(10):3047-3058. doi: 10.1109/TNNLS.2018.2851077. Epub 2018 Aug 16.

Category-Based Deep CCA for Fine-Grained Venue Discovery From Multimodal Data.基于类别的深度典型相关分析用于从多模态数据中进行细粒度场所发现

IEEE Trans Neural Netw Learn Syst. 2019 Apr;30(4):1250-1258. doi: 10.1109/TNNLS.2018.2856253. Epub 2018 Aug 10.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于凝聚注意力融合的对比学习的多模态情感分析表示学习。

Multimodal Sentiment Analysis Representations Learning via Contrastive Learning with Condense Attention Fusion.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献