• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于联合面部表情检测与分类的双分支多维度注意力机制

Dual-Branch Multi-Dimensional Attention Mechanism for Joint Facial Expression Detection and Classification.

作者信息

Peng Cheng, Li Bohao, Zou Kun, Zhang Bowen, Dai Genan, Tsoi Ah Chung

机构信息

School of Computing, Zhongshan Institute, University of Electronic Science and Technology of China, Zhongshan 528402, China.

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610000, China.

出版信息

Sensors (Basel). 2025 Jun 18;25(12):3815. doi: 10.3390/s25123815.

DOI:10.3390/s25123815
PMID:40573702
Abstract

This paper addresses the central issue arising from the (SDAC) of facial expressions, namely, to balance the competing demands of good global features for detection, and fine features for good facial expression classifications by replacing the feature extraction part of the "neck" network in the feature pyramid network in the You Only Look Once X (YOLOX) framework with a novel architecture involving three attention mechanisms-batch, channel, and neighborhood-which respectively explores the three input dimensions-batch, channel, and spatial. Correlations across a batch of images in the individual path of the dual incoming paths are first extracted by a self attention mechanism in the batch dimension; these two paths are fused together to consolidate their information and then split again into two separate paths; the information along the channel dimension is extracted using a generalized form of channel attention, an adaptive graph channel attention, which provides each element of the incoming signal with a weight that is adapted to the incoming signal. The combination of these two paths, together with two skip connections from the input to the batch attention to the output of the adaptive channel attention, then passes into a residual network, with neighborhood attention to extract fine features in the spatial dimension. This novel dual path architecture has been shown experimentally to achieve a better balance between the competing demands in an SDAC problem than other competing approaches. Ablation studies enable the determination of the relative importance of these three attention mechanisms. Competitive results are obtained on two non-aligned face expression recognition datasets, RAF-DB and SFEW, when compared with other state-of-the-art methods.

摘要

本文探讨了面部表情(SDAC)中出现的核心问题,即通过用一种涉及三种注意力机制(批次、通道和邻域)的新颖架构替换You Only Look Once X(YOLOX)框架中特征金字塔网络的“颈部”网络的特征提取部分,来平衡检测所需的良好全局特征和面部表情良好分类所需的精细特征之间相互竞争的需求,这三种注意力机制分别探索三个输入维度(批次、通道和空间)。首先通过批次维度中的自注意力机制提取双输入路径中单个路径上一批图像之间的相关性;这两条路径融合在一起以整合它们的信息,然后再次拆分为两条单独的路径;沿着通道维度的信息使用一种广义形式的通道注意力(自适应图通道注意力)来提取,它为输入信号的每个元素提供一个适应输入信号的权重。这两条路径的组合,连同从输入到批次注意力再到自适应通道注意力输出的两个跳跃连接,然后进入一个残差网络,利用邻域注意力在空间维度中提取精细特征。实验表明,这种新颖的双路径架构在SDAC问题中比其他竞争方法能更好地平衡相互竞争的需求。消融研究能够确定这三种注意力机制的相对重要性。与其他当前最先进的方法相比,在两个未对齐的面部表情识别数据集RAF-DB和SFEW上获得了具有竞争力的结果。

相似文献

1
Dual-Branch Multi-Dimensional Attention Mechanism for Joint Facial Expression Detection and Classification.用于联合面部表情检测与分类的双分支多维度注意力机制
Sensors (Basel). 2025 Jun 18;25(12):3815. doi: 10.3390/s25123815.
2
Facial Landmark-Driven Keypoint Feature Extraction for Robust Facial Expression Recognition.用于鲁棒面部表情识别的面部地标驱动关键点特征提取
Sensors (Basel). 2025 Jun 16;25(12):3762. doi: 10.3390/s25123762.
3
TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet:一种新颖的跨尺度级联分层Transformer 网络,用于增强视网膜血管分割。
Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.
4
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
5
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
6
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
10
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

本文引用的文献

1
Learning to Explore Sample Relationships.学习探索样本关系。
IEEE Trans Pattern Anal Mach Intell. 2025 Jul;47(7):5445-5459. doi: 10.1109/TPAMI.2025.3549300.
2
An Innovative Neighbor Attention Mechanism Based on Coordinates for the Recognition of Facial Expressions.基于坐标的创新型邻居注意力机制在面部表情识别中的应用。
Sensors (Basel). 2024 Nov 20;24(22):7404. doi: 10.3390/s24227404.
3
Ambiguous facial expression detection for Autism Screening using enhanced YOLOv7-tiny model.使用增强型 YOLOv7-tiny 模型进行自闭症筛查的模糊面部表情检测。
Sci Rep. 2024 Nov 18;14(1):28501. doi: 10.1038/s41598-024-77549-6.
4
Facial Expression Recognition-You Only Look Once-Neighborhood Coordinate Attention Mamba: Facial Expression Detection and Classification Based on Neighbor and Coordinates Attention Mechanism.基于邻域坐标注意力机制的 You Only Look Once-Neighborhood Coordinate Attention Mamba 人脸表情识别:人脸表情检测与分类
Sensors (Basel). 2024 Oct 28;24(21):6912. doi: 10.3390/s24216912.
5
An Assessment of In-the-Wild Datasets for Multimodal Emotion Recognition.多模态情感识别的野外数据集评估。
Sensors (Basel). 2023 May 30;23(11):5184. doi: 10.3390/s23115184.
6
Local Directional Ternary Pattern for Facial Expression Recognition.用于面部表情识别的局部方向三元模式。
IEEE Trans Image Process. 2017 Dec;26(12):6006-6018. doi: 10.1109/TIP.2017.2726010. Epub 2017 Jul 11.
7
Emotion Generation and Emotion Regulation: One or Two Depends on Your Point of View.情绪产生与情绪调节:是一还是二取决于你的观点。
Emot Rev. 2011 Jan;3(1):8-16. doi: 10.1177/1754073910380974.
8
The Future of Psychology: Connecting Mind to Brain.心理学的未来:连接心灵与大脑。
Perspect Psychol Sci. 2009 Jul;4(4):326-39. doi: 10.1111/j.1745-6924.2009.01134.x.
9
Dynamic texture recognition using local binary patterns with an application to facial expressions.基于局部二值模式的动态纹理识别及其在面部表情中的应用
IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):915-28. doi: 10.1109/TPAMI.2007.1110.