Rashad Metwally, Alebiary Doaa, Aldawsari Mohammed, Elsawy Ahmed, H AbuEl-Atta Ahmed
Faculty of Computers and Artificial Intelligence, Benha University, Benha, Egypt.
Department of Computer Engineering and Information, College of Engineering, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia.
PeerJ Comput Sci. 2024 Oct 25;10:e2272. doi: 10.7717/peerj-cs.2272. eCollection 2024.
Facial expression recognition (FER) has caught the research community's attention recently because it can affect many real-life applications. Multiple studies have focused on automatic FER, most of which use a machine learning methodology, FER has continued to be a difficult and exciting issue in computer vision. Deep learning has recently drawn increased attention as a solution to several practical issues, including facial expression recognition. This article introduces an efficient method for FER (FERDCNN) verified on five different pre-trained deep CNN (DCNN) models (AlexNet, GoogleNet, ResNet-18, ResNet-50, and ResNet-101). In the proposed method, firstly the input image has been pre-processed using face detection, resizing, gamma correction, and histogram equalization techniques. Secondly, the images go through DCNN to extract deep features. Finally, support vector machine (SVM) and transfer learning are used to classify generated features. Recent methods have been employed to evaluate and contrast the performance of the proposed approach on two publicly standard databases namely, CK+ and JAFFE on the seven classes of fundamental emotions, including anger, disgust, fear, happiness, sadness, and surprise beside neutrality for CK+ and contempt for JAFFE. The suggested method tested Four different traditional supervised classifiers with deep features, Experimental found that AlexNet excels as a feature extractor, while SVM demonstrates superiority as a classifier because of this combination achieving the highest accuracy rates of 99.0% and 95.16% for the CK+ database and the JAFFE datasets, respectively.
面部表情识别(FER)最近引起了研究界的关注,因为它会影响许多现实生活中的应用。多项研究聚焦于自动FER,其中大多数采用机器学习方法,在计算机视觉中,FER一直是一个既困难又令人兴奋的问题。深度学习作为解决包括面部表情识别在内的几个实际问题的方法,最近受到了越来越多的关注。本文介绍了一种在五个不同的预训练深度卷积神经网络(DCNN)模型(AlexNet、GoogleNet、ResNet-18、ResNet-50和ResNet-101)上经过验证的高效FER方法(FERDCNN)。在所提出的方法中,首先使用人脸检测、调整大小、伽马校正和直方图均衡化技术对输入图像进行预处理。其次,图像通过DCNN提取深度特征。最后,使用支持向量机(SVM)和迁移学习对生成的特征进行分类。采用最近的方法在两个公开的标准数据库(即CK+和JAFFE)上评估和对比所提出方法的性能,这两个数据库涉及七种基本情绪类别,包括愤怒、厌恶、恐惧、快乐、悲伤、惊讶,此外CK+还有中性表情,JAFFE还有轻蔑表情。所建议的方法使用深度特征测试了四种不同的传统监督分类器,实验发现AlexNet作为特征提取器表现出色,而SVM作为分类器表现出优越性,因为这种组合在CK+数据库和JAFFE数据集上分别实现了99.0%和95.16%的最高准确率。