• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TriCAFFNet:一种具有多特征融合网络的三交叉注意力转换器,用于面部表情识别。

TriCAFFNet: A Tri-Cross-Attention Transformer with a Multi-Feature Fusion Network for Facial Expression Recognition.

机构信息

Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan 430079, China.

出版信息

Sensors (Basel). 2024 Aug 21;24(16):5391. doi: 10.3390/s24165391.

DOI:10.3390/s24165391
PMID:39205085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11360608/
Abstract

In recent years, significant progress has been made in facial expression recognition methods. However, tasks related to facial expression recognition in real environments still require further research. This paper proposes a tri-cross-attention transformer with a multi-feature fusion network (TriCAFFNet) to improve facial expression recognition performance under challenging conditions. By combining LBP (Local Binary Pattern) features, HOG (Histogram of Oriented Gradients) features, landmark features, and CNN (convolutional neural network) features from facial images, the model is provided with a rich input to improve its ability to discern subtle differences between images. Additionally, tri-cross-attention blocks are designed to facilitate information exchange between different features, enabling mutual guidance among different features to capture salient attention. Extensive experiments on several widely used datasets show that our TriCAFFNet achieves the SOTA performance on RAF-DB with 92.17%, AffectNet (7 cls) with 67.40%, and AffectNet (8 cls) with 63.49%, respectively.

摘要

近年来,面部表情识别方法取得了重大进展。然而,真实环境中与面部表情识别相关的任务仍需要进一步研究。本文提出了一种具有多特征融合网络的三交叉注意力转换器(TriCAFFNet),以提高挑战性条件下的面部表情识别性能。通过结合面部图像的 LBP(局部二值模式)特征、HOG(方向梯度直方图)特征、地标特征和 CNN(卷积神经网络)特征,为模型提供了丰富的输入,提高了其辨别图像细微差异的能力。此外,设计了三交叉注意块,以促进不同特征之间的信息交换,使不同特征之间能够相互引导,捕捉显著的注意力。在几个广泛使用的数据集上进行的大量实验表明,我们的 TriCAFFNet 在 RAF-DB 上实现了 SOTA 性能,分别为 92.17%、AffectNet(7 cls)为 67.40%和 AffectNet(8 cls)为 63.49%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/a51fb02e2127/sensors-24-05391-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/eca5c92f6af4/sensors-24-05391-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/d92a4cab5614/sensors-24-05391-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/5c70c82b5250/sensors-24-05391-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/9b633494bf26/sensors-24-05391-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/a51fb02e2127/sensors-24-05391-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/eca5c92f6af4/sensors-24-05391-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/d92a4cab5614/sensors-24-05391-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/5c70c82b5250/sensors-24-05391-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/9b633494bf26/sensors-24-05391-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aff/11360608/a51fb02e2127/sensors-24-05391-g005.jpg

相似文献

1
TriCAFFNet: A Tri-Cross-Attention Transformer with a Multi-Feature Fusion Network for Facial Expression Recognition.TriCAFFNet:一种具有多特征融合网络的三交叉注意力转换器,用于面部表情识别。
Sensors (Basel). 2024 Aug 21;24(16):5391. doi: 10.3390/s24165391.
2
Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition.基于多尺度特征融合和补丁丢弃的增强型混合视觉 Transformer 在面部表情识别中的应用。
Sensors (Basel). 2024 Jun 26;24(13):4153. doi: 10.3390/s24134153.
3
Micro-expression recognition based on multi-scale 3D residual convolutional neural network.基于多尺度 3D 残差卷积神经网络的微表情识别。
Math Biosci Eng. 2024 Mar 1;21(4):5007-5031. doi: 10.3934/mbe.2024221.
4
PASTFNet: a paralleled attention spatio-temporal fusion network for micro-expression recognition.PASTFNet:一种用于微表情识别的并行注意力时空融合网络。
Med Biol Eng Comput. 2024 Jun;62(6):1911-1924. doi: 10.1007/s11517-024-03041-y. Epub 2024 Feb 28.
5
Enhancing Facial Expression Recognition through Light Field Cameras.通过光场相机增强面部表情识别。
Sensors (Basel). 2024 Sep 3;24(17):5724. doi: 10.3390/s24175724.
6
CDGT: Constructing diverse graph transformers for emotion recognition from facial videos.构建用于面部视频情感识别的多样化图变换模型。
Neural Netw. 2024 Nov;179:106573. doi: 10.1016/j.neunet.2024.106573. Epub 2024 Jul 25.
7
Face anti-spoofing with cross-stage relation enhancement and spoof material perception.跨阶段关系增强与伪造材料感知的人脸防欺骗。
Neural Netw. 2024 Jul;175:106275. doi: 10.1016/j.neunet.2024.106275. Epub 2024 Mar 27.
8
FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation.FDB-Net:融合 CNN 和 Transformer 的双分支网络用于医学图像分割。
J Xray Sci Technol. 2024;32(4):931-951. doi: 10.3233/XST-230413.
9
Robust facial landmark detection by cross-order cross-semantic deep network.基于跨阶跨语义深度网络的鲁棒人脸关键点检测
Neural Netw. 2021 Apr;136:233-243. doi: 10.1016/j.neunet.2020.11.001. Epub 2020 Nov 16.
10
Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild.学习深度全局多尺度和局部注意力特征,用于野外的面部表情识别。
IEEE Trans Image Process. 2021;30:6544-6556. doi: 10.1109/TIP.2021.3093397. Epub 2021 Jul 21.

本文引用的文献

1
FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients.FER-PCVT:用于中风患者的基于补丁卷积视觉变换器的面部表情识别
Brain Sci. 2022 Nov 28;12(12):1626. doi: 10.3390/brainsci12121626.