Key Laboratory of Modern Teaching Technology, Ministry of Education, Shaanxi Normal University, Xi'an 710062, China.
Department of Information Construction and Management, Shaanxi Normal University, Xi'an 710061, China.
Sensors (Basel). 2023 Aug 13;23(16):7148. doi: 10.3390/s23167148.
In recent years, convolutional neural networks (CNNs) have played a dominant role in facial expression recognition. While CNN-based methods have achieved remarkable success, they are notorious for having an excessive number of parameters, and they rely on a large amount of manually annotated data. To address this challenge, we expand the number of training samples by learning expressions from a face recognition dataset to reduce the impact of a small number of samples on the network training. In the proposed deep joint learning framework, the deep features of the face recognition dataset are clustered, and simultaneously, the parameters of an efficient CNN are learned, thereby marking the data for network training automatically and efficiently. Specifically, first, we develop a new efficient CNN based on the proposed affinity convolution module with much lower computational overhead for deep feature learning and expression classification. Then, we develop an expression-guided deep facial clustering approach to cluster the deep features and generate abundant expression labels from the face recognition dataset. Finally, the AC-based CNN is fine-tuned using an updated training set and a combined loss function. Our framework is evaluated on several challenging facial expression recognition datasets as well as a self-collected dataset. In the context of facial expression recognition applied to the field of education, our proposed method achieved an impressive accuracy of 95.87% on the self-collected dataset, surpassing other existing methods.
近年来,卷积神经网络(CNN)在面部表情识别中占据主导地位。基于 CNN 的方法虽然取得了显著的成功,但它们的参数数量过多,并且依赖于大量的人工标注数据。为了解决这个挑战,我们通过从人脸识别数据集学习表情来扩展训练样本数量,从而减少少量样本对网络训练的影响。在提出的深度联合学习框架中,我们对人脸识别数据集的深度特征进行聚类,并同时学习高效 CNN 的参数,从而自动、高效地为网络训练标记数据。具体来说,首先,我们基于提出的基于亲和卷积模块的新型高效 CNN 进行深度特征学习和表情分类,该模块具有更低的计算开销。然后,我们开发了一种基于表情引导的深度人脸聚类方法,从人脸识别数据集中聚类深度特征并生成丰富的表情标签。最后,使用更新的训练集和组合损失函数对基于 AC 的 CNN 进行微调。我们的框架在多个具有挑战性的面部表情识别数据集以及一个自收集的数据集上进行了评估。在将面部表情识别应用于教育领域的背景下,我们提出的方法在自收集的数据集上取得了令人印象深刻的 95.87%的准确率,超过了其他现有方法。