MIFAD-Net：用于面部表情识别的具有角距离损失的多层交互式特征融合网络

MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition.

作者信息

Cai Weiwei, Gao Ming, Liu Runmin, Mao Jie

机构信息

College of Sports Engineering and Information Technology, Wuhan Sports University, Wuhan, China.

School of Logistics and Transportation, Central South University of Forestry and Technology, Changsha, China.

出版信息

Front Psychol. 2021 Oct 22;12:762795. doi: 10.3389/fpsyg.2021.762795. eCollection 2021.

DOI:10.3389/fpsyg.2021.762795

PMID:34744943

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8569934/

Abstract

Understanding human emotions and psychology is a critical step toward realizing artificial intelligence, and correct recognition of facial expressions is essential for judging emotions. However, the differences caused by changes in facial expression are very subtle, and different expression features are less distinguishable, making it difficult for computers to recognize human facial emotions accurately. Therefore, this paper proposes a novel multi-layer interactive feature fusion network model with angular distance loss. To begin, a multi-layer and multi-scale module is designed to extract global and local features of facial emotions in order to capture part of the feature relationships between different scales, thereby improving the model's ability to discriminate subtle features of facial emotions. Second, a hierarchical interactive feature fusion module is designed to address the issue of loss of useful feature information caused by layer-by-layer convolution and pooling of convolutional neural networks. In addition, the attention mechanism is also used between convolutional layers at different levels. Improve the neural network's discriminative ability by increasing the saliency of information about different features on the layers and suppressing irrelevant information. Finally, we use the angular distance loss function to improve the proposed model's inter-class feature separation and intra-class feature clustering capabilities, addressing the issues of large intra-class differences and high inter-class similarity in facial emotion recognition. We conducted comparison and ablation experiments on the FER2013 dataset. The results illustrate that the performance of the proposed MIFAD-Net is 1.02-4.53% better than the compared methods, and it has strong competitiveness.

摘要

理解人类情感和心理是实现人工智能的关键一步，而正确识别面部表情对于判断情感至关重要。然而，面部表情变化所引起的差异非常细微，不同的表情特征较难区分，这使得计算机难以准确识别人类面部情感。因此，本文提出了一种具有角距离损失的新型多层交互式特征融合网络模型。首先，设计了一个多层多尺度模块来提取面部情感的全局和局部特征，以便捕捉不同尺度之间的部分特征关系，从而提高模型区分面部情感细微特征的能力。其次，设计了一个分层交互式特征融合模块，以解决卷积神经网络逐层卷积和池化导致有用特征信息丢失的问题。此外，还在不同层次的卷积层之间使用了注意力机制。通过增加各层上不同特征信息的显著性并抑制无关信息，提高神经网络的判别能力。最后，我们使用角距离损失函数来提高所提出模型的类间特征分离和类内特征聚类能力，解决面部情感识别中类内差异大、类间相似度高的问题。我们在FER2013数据集上进行了比较和消融实验。结果表明，所提出的MIFAD-Net的性能比比较方法好1.02 - 4.53%，具有很强的竞争力。

相似文献

MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition.MIFAD-Net：用于面部表情识别的具有角距离损失的多层交互式特征融合网络

Front Psychol. 2021 Oct 22;12:762795. doi: 10.3389/fpsyg.2021.762795. eCollection 2021.

Emotion Recognition of Online Education Learners by Convolutional Neural Networks.基于卷积神经网络的在线教育学习者情感识别

Comput Intell Neurosci. 2022 Jun 9;2022:4316812. doi: 10.1155/2022/4316812. eCollection 2022.

Facial Expression Emotion Recognition Model Integrating Philosophy and Machine Learning Theory.融合哲学与机器学习理论的面部表情情感识别模型

Front Psychol. 2021 Sep 27;12:759485. doi: 10.3389/fpsyg.2021.759485. eCollection 2021.

Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition.基于多尺度特征融合和补丁丢弃的增强型混合视觉 Transformer 在面部表情识别中的应用。

Sensors (Basel). 2024 Jun 26;24(13):4153. doi: 10.3390/s24134153.

Face Recognition Algorithm Based on Multiscale Feature Fusion Network.基于多尺度特征融合网络的人脸识别算法

Comput Intell Neurosci. 2022 Mar 18;2022:5810723. doi: 10.1155/2022/5810723. eCollection 2022.

Emotion Recognition from Large-Scale Video Clips with Cross-Attention and Hybrid Feature Weighting Neural Networks.基于交叉注意力和混合特征加权神经网络的大规模视频片段中的情感识别。

Int J Environ Res Public Health. 2023 Jan 12;20(2):1400. doi: 10.3390/ijerph20021400.

FERGCN: facial expression recognition based on graph convolution network.FERGCN：基于图卷积网络的面部表情识别

Mach Vis Appl. 2022;33(3):40. doi: 10.1007/s00138-022-01288-9. Epub 2022 Mar 22.

Micro-expression recognition based on multi-scale 3D residual convolutional neural network.基于多尺度 3D 残差卷积神经网络的微表情识别。

Math Biosci Eng. 2024 Mar 1;21(4):5007-5031. doi: 10.3934/mbe.2024221.

Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition.基于注意力机制融合的多流卷积循环神经网络用于语音情感识别

Entropy (Basel). 2022 Jul 26;24(8):1025. doi: 10.3390/e24081025.

Two-Level Spatio-Temporal Feature Fused Two-Stream Network for Micro-Expression Recognition.基于两级时空特征融合双流网络的微表情识别方法。

Sensors (Basel). 2024 Feb 29;24(5):1574. doi: 10.3390/s24051574.

引用本文的文献

A unified framework harnessing multi-scale feature ensemble and attention mechanism for gastric polyp and protrusion identification in gastroscope imaging.一种利用多尺度特征融合和注意力机制的统一框架，用于胃镜成像中的胃息肉和隆起识别。

Sci Rep. 2025 Feb 17;15(1):5734. doi: 10.1038/s41598-025-90034-y.

Matching Model of Dance Movements and Music Rhythm Features Using Human Posture Estimation.基于人体姿态估计的舞蹈动作与音乐节奏特征匹配模型。

Comput Intell Neurosci. 2022 Jul 13;2022:7331210. doi: 10.1155/2022/7331210. eCollection 2022.

Algorithm Composition and Emotion Recognition Based on Machine Learning.基于机器学习的算法组合与情感识别。

Comput Intell Neurosci. 2022 Jun 6;2022:1092383. doi: 10.1155/2022/1092383. eCollection 2022.

Emotional Experience and Psychological Intervention of Depression Patients Based on SOM.基于 SOM 的抑郁症患者的情绪体验与心理干预

Comput Intell Neurosci. 2022 Mar 24;2022:5064615. doi: 10.1155/2022/5064615. eCollection 2022.

Emotional Analysis Model for Social Hot Topics of Professional Migrant Workers.专业流动务工人员社会热点情绪分析模型。

Comput Intell Neurosci. 2022 Jan 31;2022:3812055. doi: 10.1155/2022/3812055. eCollection 2022.

Emotion Recognition of Foreign Language Teachers in College English Classroom Teaching.大学英语课堂教学中外语教师的情感识别

Front Psychol. 2021 Nov 11;12:788552. doi: 10.3389/fpsyg.2021.788552. eCollection 2021.

本文引用的文献

AGTH-Net: Attention-Based Graph Convolution-Guided Third-Order Hourglass Network for Sports Video Classification.AGTH-Net：基于注意力的图卷积引导三阶沙漏网络的运动视频分类。

J Healthc Eng. 2021 Jul 6;2021:8517161. doi: 10.1155/2021/8517161. eCollection 2021.

Recognizing spontaneous facial expressions of emotion in a small-scale society of Papua New Guinea.在巴布亚新几内亚的一个小规模社会中识别自发的面部情绪表达。

Emotion. 2017 Mar;17(2):337-347. doi: 10.1037/emo0000236. Epub 2016 Oct 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MIFAD-Net：用于面部表情识别的具有角距离损失的多层交互式特征融合网络

MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献