School of Electronic Information, Jiangsu University of Science and Technology, Zhenjiang 212003, China.
School of Electronic Information, Jiangsu University of Science and Technology, Zhenjiang 212003, China.
Comput Methods Programs Biomed. 2022 Mar;215:106621. doi: 10.1016/j.cmpb.2022.106621. Epub 2022 Jan 6.
Facial expression recognition technology will play an increasingly important role in our daily life. Autonomous driving, virtual reality and all kinds of robots integrated into our life depend on the development of facial expression recognition technology. Many tasks in the field of computer vision are based on deep learning technology and convolutional neural network. The paper proposes an occluded expression recognition model based on the generated countermeasure network. The model is divided into two modules, namely, occluded face image restoration and face recognition.
Firstly, this paper summarizes the research status of deep facial expression recognition methods in recent ten years and the development of related facial expression database. Then, the current facial expression recognition methods based on deep learning are divided into two categories: Static facial expression recognition and dynamic facial expression recognition. The two methodswill be introduced and summarized respectively. Aiming at the advanced deep expression recognition algorithms in the field, the performance of these algorithms on common expression databases is compared, and the strengths and weaknesses of these algorithms are analyzed in detail.
As the task of facial expression recognition is gradually transferred from the controlled laboratory environment to the challenging real-world environment, with the rapid development of deep learning technology, deep neural network can learn discriminative features, and is gradually applied to automatic facial expression recognition task. The current deep facial expression recognition system is committed to solve the following two problems: (1) Overfitting due to lack of sufficient training data; (2) In the real world environment, other variables that have nothing to do with expression bring interference problems.
From the perspective of algorithm, combining other expression models, such as facial action unit model and pleasure arousal dimension model, as well as other multimodal models, such as audio mode, 3D face depth information and human physiological information, can make expression recognition more practical.
面部表情识别技术在我们的日常生活中将发挥越来越重要的作用。自动驾驶、虚拟现实和各种融入我们生活的机器人都依赖于面部表情识别技术的发展。计算机视觉领域的许多任务都是基于深度学习技术和卷积神经网络的。本文提出了一种基于生成对策网络的遮挡表情识别模型。该模型分为两个模块,即遮挡人脸图像恢复和人脸识别。
首先,本文总结了近十年来深度学习方法在面部表情识别领域的研究现状和相关面部表情数据库的发展。然后,将基于深度学习的当前面部表情识别方法分为两类:静态面部表情识别和动态面部表情识别。分别对这两种方法进行介绍和总结。针对领域内先进的深度学习表情识别算法,对这些算法在常见表情数据库上的性能进行了比较,并详细分析了这些算法的优缺点。
随着面部表情识别任务逐渐从受控的实验室环境转移到具有挑战性的真实世界环境,随着深度学习技术的快速发展,深度神经网络可以学习到有区分性的特征,并逐渐应用于自动面部表情识别任务。当前的深度面部表情识别系统致力于解决以下两个问题:(1)由于缺乏足够的训练数据而导致的过拟合问题;(2)在真实世界环境中,与表情无关的其他变量带来的干扰问题。
从算法角度来看,结合其他表情模型,如面部动作单元模型和愉悦 arousal 维度模型,以及其他多模态模型,如音频模式、3D 人脸深度信息和人体生理信息,可以使表情识别更具实用性。