Suppr超能文献

基于区域双重注意力的视频情绪识别。

Region Dual Attention-Based Video Emotion Recognition.

机构信息

School of Computing, Henan University of Engineering, Zhengzhou, China.

出版信息

Comput Intell Neurosci. 2022 Jun 15;2022:6096325. doi: 10.1155/2022/6096325. eCollection 2022.

Abstract

To solve the emotional differences between different regions of the video frame and make use of the interrelationship between different regions, a region dual attention-based video emotion recognition method (RDAM) is proposed. RDAM takes as input video frame sequences and learns a discriminatory video emotion representation that can make full use of the emotional differences of different regions and the interrelationship between regions. Specifically, we construct two parallel attention modules: one is the regional location attention module, which generates a weight value for each feature region to identify the relative importance of different regions. Based on the weight, the emotion feature that can perceive the emotional sensitive region is generated. The other is the regional relationship attention module, which generates a region relation matrix that represents the interrelationship of different regions of a video frame. Based on the region relation matrix, the emotion feature that can perceive interrelationship between different regions is generated. The outputs of these two attention modules are fused to produce the emotional features of video frames. Then, the features of video frame sequences are fused by attention-based fusion network, and the final emotion feature of the video is produced. The experimental results on the video emotion recognition data sets show that the proposed method outperforms the other related works.

摘要

为了解决视频帧不同区域之间的情感差异,并利用不同区域之间的相互关系,提出了一种基于区域双重注意力的视频情感识别方法(RDAM)。RDAM 以视频帧序列作为输入,学习一种具有判别力的视频情感表示,可以充分利用不同区域的情感差异和区域之间的相互关系。具体来说,我们构建了两个并行的注意力模块:一个是区域位置注意力模块,它为每个特征区域生成一个权重值,以识别不同区域的相对重要性。基于权重,生成能够感知情感敏感区域的情感特征。另一个是区域关系注意力模块,它生成一个表示视频帧不同区域之间相互关系的区域关系矩阵。基于区域关系矩阵,生成能够感知不同区域之间相互关系的情感特征。这两个注意力模块的输出融合在一起,生成视频帧的情感特征。然后,通过基于注意力的融合网络融合视频帧序列的特征,生成视频的最终情感特征。在视频情感识别数据集上的实验结果表明,所提出的方法优于其他相关工作。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b9b/9217593/00719e24f435/CIN2022-6096325.001.jpg

相似文献

1
Region Dual Attention-Based Video Emotion Recognition.
Comput Intell Neurosci. 2022 Jun 15;2022:6096325. doi: 10.1155/2022/6096325. eCollection 2022.
2
Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition.
Comput Intell Neurosci. 2021 Sep 25;2021:5585041. doi: 10.1155/2021/5585041. eCollection 2021.
3
Sparse Spatial-Temporal Emotion Graph Convolutional Network for Video Emotion Recognition.
Comput Intell Neurosci. 2022 Sep 28;2022:3518879. doi: 10.1155/2022/3518879. eCollection 2022.
4
Emotion Recognition from Large-Scale Video Clips with Cross-Attention and Hybrid Feature Weighting Neural Networks.
Int J Environ Res Public Health. 2023 Jan 12;20(2):1400. doi: 10.3390/ijerph20021400.
5
6
A Dual-Path Cross-Modal Network for Video-Music Retrieval.
Sensors (Basel). 2023 Jan 10;23(2):805. doi: 10.3390/s23020805.
7
Emotion Recognition Using Hierarchical Spatiotemporal Electroencephalogram Information from Local to Global Brain Regions.
Bioengineering (Basel). 2023 Sep 4;10(9):1040. doi: 10.3390/bioengineering10091040.
8
Multimodal interaction enhanced representation learning for video emotion recognition.
Front Neurosci. 2022 Dec 19;16:1086380. doi: 10.3389/fnins.2022.1086380. eCollection 2022.
9
AttendAffectNet-Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-Attention.
Sensors (Basel). 2021 Dec 14;21(24):8356. doi: 10.3390/s21248356.

本文引用的文献

1
Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition.
Comput Intell Neurosci. 2021 Sep 25;2021:5585041. doi: 10.1155/2021/5585041. eCollection 2021.
2
Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition.
IEEE Trans Image Process. 2020 Jan 29. doi: 10.1109/TIP.2019.2956143.
3
4
Facial Emotion Recognition: A Survey and Real-World User Experiences in Mixed Reality.
Sensors (Basel). 2018 Feb 1;18(2):416. doi: 10.3390/s18020416.
5
Constants across cultures in the face and emotion.
J Pers Soc Psychol. 1971 Feb;17(2):124-9. doi: 10.1037/h0030377.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验