• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于自然环境下面部表情识别的多损失、特征融合及改进的前两名投票集成方法

Multi-loss, feature fusion and improved top-two-voting ensemble for facial expression recognition in the wild.

作者信息

Zhou Guangyao, Xie Yuanlun, Fu Yiqin, Wang Zhaokun

机构信息

School of Computing and Artificial Intelligence, Southwest Jiaotong University, China.

School of Information and Software Engineering, University of Electronic Science and Technology of China, China.

出版信息

Neural Netw. 2025 Mar;183:106937. doi: 10.1016/j.neunet.2024.106937. Epub 2024 Nov 26.

DOI:10.1016/j.neunet.2024.106937
PMID:39615451
Abstract

Facial expression recognition (FER) in the wild is a challenging pattern recognition task affected by the images' low quality and has attracted broad interest in computer vision. Existing FER methods failed to obtain sufficient accuracy to support the practical applications, especially in scenarios with low fault tolerance, which limits the adaptability of FER. Targeting exploring the possibility of further improving the accuracy of FER in the wild, this paper proposes a novel single model named R18+FAML and an ensemble model named R18+FAML-FGA-T2V, which applies intra-feature fusion within a single network, feature fusion among multiple networks, and the ensemble decision strategy. Based on the backbone of ResNet18 (R18), R18+FAML combines internal feature fusion and three attention blocks, as well as uses multiple loss functions (FAML) to improve the diversity of the feature extraction. To effectively integrate feature extractors from multiple networks, we propose feature fusion among networks based on the genetic algorithm (FGA). Comprehensively considering and utilizing more classification information, we propose an ensemble strategy, i.e., the improved top-two-voting (T2V) of multiple networks with the same structure. Combining the above strategies, R18+FAML-FGA-T2V can focus on the main expression-aware areas by integrating interest areas of multiple networks. From experiments on three challenging FER datasets in the wild including RAF-DB, AffectNet-8 and AffectNet-7, our single model R18+FAML and ensemble model R18+FAML-FGA-T2V achieve the accuracies of 90.32,62.17,65.83% and 91.59,63.27,66.63% respectively, both achieving the state-of-the-art results.

摘要

野外面部表情识别(FER)是一项具有挑战性的模式识别任务,受图像质量低的影响,在计算机视觉领域引起了广泛关注。现有的FER方法未能获得足够的准确率来支持实际应用,尤其是在容错率低的场景中,这限制了FER的适应性。为了探索进一步提高野外FER准确率的可能性,本文提出了一种名为R18+FAML的新型单模型和一种名为R18+FAML-FGA-T2V的集成模型,该集成模型在单个网络内应用特征内融合、多个网络间的特征融合以及集成决策策略。基于ResNet18(R18)的骨干网络,R18+FAML结合了内部特征融合和三个注意力块,并使用多个损失函数(FAML)来提高特征提取的多样性。为了有效整合来自多个网络的特征提取器,我们提出了基于遗传算法(FGA)的网络间特征融合。综合考虑并利用更多的分类信息,我们提出了一种集成策略,即对具有相同结构的多个网络进行改进的前两名投票(T2V)。结合上述策略,R18+FAML-FGA-T2V可以通过整合多个网络的感兴趣区域来聚焦主要的表情感知区域。在包括RAF-DB、AffectNet-8和AffectNet-7在内的三个具有挑战性的野外FER数据集上的实验表明,我们的单模型R18+FAML和集成模型R18+FAML-FGA-T2V分别达到了90.32%、62.17%、65.83%和91.59%、63.27%、66.63%的准确率,均取得了当前最优的结果。

相似文献

1
Multi-loss, feature fusion and improved top-two-voting ensemble for facial expression recognition in the wild.用于自然环境下面部表情识别的多损失、特征融合及改进的前两名投票集成方法
Neural Netw. 2025 Mar;183:106937. doi: 10.1016/j.neunet.2024.106937. Epub 2024 Nov 26.
2
TriCAFFNet: A Tri-Cross-Attention Transformer with a Multi-Feature Fusion Network for Facial Expression Recognition.TriCAFFNet:一种具有多特征融合网络的三交叉注意力转换器,用于面部表情识别。
Sensors (Basel). 2024 Aug 21;24(16):5391. doi: 10.3390/s24165391.
3
Enhancing Facial Expression Recognition through Light Field Cameras.通过光场相机增强面部表情识别。
Sensors (Basel). 2024 Sep 3;24(17):5724. doi: 10.3390/s24175724.
4
An Innovative Neighbor Attention Mechanism Based on Coordinates for the Recognition of Facial Expressions.基于坐标的创新型邻居注意力机制在面部表情识别中的应用。
Sensors (Basel). 2024 Nov 20;24(22):7404. doi: 10.3390/s24227404.
5
Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition.基于多尺度特征融合和补丁丢弃的增强型混合视觉 Transformer 在面部表情识别中的应用。
Sensors (Basel). 2024 Jun 26;24(13):4153. doi: 10.3390/s24134153.
6
Enhancing feature selection for multi-pose facial expression recognition using a hybrid of quantum inspired firefly algorithm and artificial bee colony algorithm.使用量子启发式萤火虫算法和人工蜂群算法的混合方法增强多姿态面部表情识别的特征选择
Sci Rep. 2025 Feb 7;15(1):4665. doi: 10.1038/s41598-025-85206-9.
7
A Student Facial Expression Recognition Model Based on Multi-Scale and Deep Fine-Grained Feature Attention Enhancement.基于多尺度和深度细粒度特征注意力增强的学生面部表情识别模型。
Sensors (Basel). 2024 Oct 20;24(20):6748. doi: 10.3390/s24206748.
8
A fine-grained human facial key feature extraction and fusion method for emotion recognition.一种用于情感识别的细粒度人类面部关键特征提取与融合方法。
Sci Rep. 2025 Feb 20;15(1):6153. doi: 10.1038/s41598-025-90440-2.
9
Driver facial emotion tracking using an enhanced residual network with weighted fusion of channel and spatial attention.基于通道和空间注意力加权融合的增强残差网络的驾驶员面部情绪跟踪
Sci Rep. 2025 Apr 12;15(1):12675. doi: 10.1038/s41598-025-97451-z.
10
Facial Expression Recognition-You Only Look Once-Neighborhood Coordinate Attention Mamba: Facial Expression Detection and Classification Based on Neighbor and Coordinates Attention Mechanism.基于邻域坐标注意力机制的 You Only Look Once-Neighborhood Coordinate Attention Mamba 人脸表情识别:人脸表情检测与分类
Sensors (Basel). 2024 Oct 28;24(21):6912. doi: 10.3390/s24216912.