CAGNet：一种结合多尺度特征聚合与注意力机制的网络，用于人机交互中的智能面部表情识别。

CAGNet: A Network Combining Multiscale Feature Aggregation and Attention Mechanisms for Intelligent Facial Expression Recognition in Human-Robot Interaction.

作者信息

Zhang Dengpan, Ma Wenwen, Shen Zhihao, Ma Qingping

机构信息

School of Mechanical and Power Engineering, Henan Polytechnic University, Jiaozuo 454000, China.

出版信息

Sensors (Basel). 2025 Jun 11;25(12):3653. doi: 10.3390/s25123653.

DOI:10.3390/s25123653

PMID:40573540

Abstract

The development of Facial Expression Recognition (FER) technology has significantly enhanced the naturalness and intuitiveness of human-robot interaction. In the field of service robots, particularly in applications such as production assistance, caregiving, and daily service communication, efficient FER capabilities are crucial. However, existing Convolutional Neural Network (CNN) models still have limitations in terms of feature representation and recognition accuracy for facial expressions. To address these challenges, we propose CAGNet, a novel network that combines multiscale feature aggregation and attention mechanisms. CAGNet employs a deep learning-based hierarchical convolutional architecture, enhancing the extraction of features at multiple scales through stacked convolutional layers. The network integrates the Convolutional Block Attention Module (CBAM) and Global Average Pooling (GAP) modules to optimize the capture of both local and global features. Additionally, Batch Normalization (BN) layers and Dropout techniques are incorporated to improve model stability and generalization. CAGNet was evaluated on two standard datasets, FER2013 and CK+, and the experiment results demonstrate that the network achieves accuracies of 71.52% and 97.97%, respectively, in FER. These results not only validate the effectiveness and superiority of our approach but also provide a new technical solution for FER. Furthermore, CAGNet offers robust support for the intelligent upgrade of service robots.

摘要

面部表情识别（FER）技术的发展显著提高了人机交互的自然性和直观性。在服务机器人领域，特别是在生产辅助、护理和日常服务通信等应用中，高效的FER能力至关重要。然而，现有的卷积神经网络（CNN）模型在面部表情的特征表示和识别准确性方面仍然存在局限性。为了应对这些挑战，我们提出了CAGNet，一种结合多尺度特征聚合和注意力机制的新型网络。CAGNet采用基于深度学习的分层卷积架构，通过堆叠卷积层增强多尺度特征的提取。该网络集成了卷积块注意力模块（CBAM）和全局平均池化（GAP）模块，以优化局部和全局特征的捕获。此外，还引入了批量归一化（BN）层和随机失活（Dropout）技术来提高模型的稳定性和泛化能力。CAGNet在两个标准数据集FER2013和CK+上进行了评估，实验结果表明，该网络在FER中分别达到了71.52%和97.97%的准确率。这些结果不仅验证了我们方法的有效性和优越性，还为FER提供了一种新的技术解决方案。此外，CAGNet为服务机器人的智能升级提供了有力支持。

相似文献

CAGNet: A Network Combining Multiscale Feature Aggregation and Attention Mechanisms for Intelligent Facial Expression Recognition in Human-Robot Interaction.CAGNet：一种结合多尺度特征聚合与注意力机制的网络，用于人机交互中的智能面部表情识别。

Sensors (Basel). 2025 Jun 11;25(12):3653. doi: 10.3390/s25123653.

Facial Landmark-Driven Keypoint Feature Extraction for Robust Facial Expression Recognition.用于鲁棒面部表情识别的面部地标驱动关键点特征提取

Sensors (Basel). 2025 Jun 16;25(12):3762. doi: 10.3390/s25123762.

Enhanced AlexNet with Gabor and Local Binary Pattern Features for Improved Facial Emotion Recognition.用于改进面部表情识别的具有Gabor和局部二值模式特征的增强型AlexNet

Sensors (Basel). 2025 Jun 19;25(12):3832. doi: 10.3390/s25123832.

New Trends in Emotion Recognition Using Image Analysis by Neural Networks, A Systematic Review.基于神经网络的图像分析的情绪识别新趋势：系统综述。

Sensors (Basel). 2023 Aug 10;23(16):7092. doi: 10.3390/s23167092.

CBAM VGG16: An efficient driver distraction classification using CBAM embedded VGG16 architecture.CBAM-VGG16：一种使用嵌入 CBAM 的 VGG16 架构的高效驾驶员分心分类方法。

Comput Biol Med. 2024 Sep;180:108945. doi: 10.1016/j.compbiomed.2024.108945. Epub 2024 Aug 1.

Skin-CAD: Explainable deep learning classification of skin cancer from dermoscopic images by feature selection of dual high-level CNNs features and transfer learning.皮肤 CAD：基于双高级 CNN 特征选择和迁移学习的皮肤镜图像皮肤癌可解释深度学习分类。

Comput Biol Med. 2024 Aug;178:108798. doi: 10.1016/j.compbiomed.2024.108798. Epub 2024 Jun 25.

A novel deep learning framework for retinal disease detection leveraging contextual and local features cues from retinal images.一种用于视网膜疾病检测的新型深度学习框架，利用来自视网膜图像的上下文和局部特征线索。

Med Biol Eng Comput. 2025 Feb 7. doi: 10.1007/s11517-025-03314-0.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状荟萃分析。

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

本文引用的文献

Facial expression recognition based on deep learning.基于深度学习的面部表情识别。

Comput Methods Programs Biomed. 2022 Mar;215:106621. doi: 10.1016/j.cmpb.2022.106621. Epub 2022 Jan 6.

Challenges in representation learning: a report on three machine learning contests.表示学习中的挑战：三个机器学习竞赛的报告。

Neural Netw. 2015 Apr;64:59-63. doi: 10.1016/j.neunet.2014.09.005. Epub 2014 Dec 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

CAGNet：一种结合多尺度特征聚合与注意力机制的网络，用于人机交互中的智能面部表情识别。

CAGNet: A Network Combining Multiscale Feature Aggregation and Attention Mechanisms for Intelligent Facial Expression Recognition in Human-Robot Interaction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献