基于面部解析和视觉Transformer的口罩感知面部表情识别

Face-mask-aware Facial Expression Recognition based on Face Parsing and Vision Transformer.

作者信息

Yang Bo, Wu Jianming, Ikeda Kazushi, Hattori Gen, Sugano Masaru, Iwasawa Yusuke, Matsuo Yutaka

机构信息

KDDI Research, Inc., 2-1-15 Ohara, Fujimino-shi, Saitama, 356-8502, Japan.

The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8654 Japan.

出版信息

Pattern Recognit Lett. 2022 Dec;164:173-182. doi: 10.1016/j.patrec.2022.11.004. Epub 2022 Nov 9.

DOI:10.1016/j.patrec.2022.11.004

PMID:36407855

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9645067/

Abstract

As wearing face masks is becoming an embedded practice due to the COVID-19 pandemic, facial expression recognition (FER) that takes face masks into account is now a problem that needs to be solved. In this paper, we propose a face parsing and vision Transformer-based method to improve the accuracy of face-mask-aware FER. First, in order to improve the precision of distinguishing the unobstructed facial region as well as those parts of the face covered by a mask, we re-train a face-mask-aware face parsing model, based on the existing face parsing dataset automatically relabeled with a face mask and pixel label. Second, we propose a vision Transformer with a cross attention mechanism-based FER classifier, capable of taking both occluded and non-occluded facial regions into account and reweigh these two parts automatically to get the best facial expression recognition performance. The proposed method outperforms existing state-of-the-art face-mask-aware FER methods, as well as other occlusion-aware FER methods, on two datasets that contain three kinds of emotions (M-LFW-FER and M-KDDI-FER datasets) and two datasets that contain seven kinds of emotions (M-FER-2013 and M-CK+ datasets).

摘要

由于新冠疫情，佩戴口罩已成为一种普遍做法，因此考虑到口罩因素的面部表情识别（FER）成为了一个亟待解决的问题。在本文中，我们提出了一种基于面部解析和视觉Transformer的方法，以提高对戴口罩面部表情的识别准确率。首先，为了提高区分未被遮挡的面部区域以及被口罩覆盖的面部区域的精度，我们基于现有的面部解析数据集（该数据集已使用口罩和像素标签自动重新标注）重新训练了一个考虑口罩因素的面部解析模型。其次，我们提出了一种基于交叉注意力机制的视觉Transformer面部表情识别分类器，该分类器能够同时考虑被遮挡和未被遮挡的面部区域，并自动对这两部分进行重新加权，以获得最佳的面部表情识别性能。在包含三种情绪的两个数据集（M-LFW-FER和M-KDDI-FER数据集）以及包含七种情绪的两个数据集（M-FER-2013和M-CK+数据集）上，所提出的方法优于现有的最先进的考虑口罩因素的面部表情识别方法以及其他考虑遮挡因素的面部表情识别方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c98e/9645067/27d3cc082622/gr1_lrg.jpg

相似文献

Face-mask-aware Facial Expression Recognition based on Face Parsing and Vision Transformer.

Pattern Recognit Lett. 2022 Dec;164:173-182. doi: 10.1016/j.patrec.2022.11.004. Epub 2022 Nov 9.

TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition.

Front Neurorobot. 2021 Oct 25;15:763100. doi: 10.3389/fnbot.2021.763100. eCollection 2021.

Automatic facial emotion recognition at the COVID-19 pandemic time.

Multimed Tools Appl. 2023;82(9):12751-12769. doi: 10.1007/s11042-022-14050-0. Epub 2022 Oct 22.

Facial Expression Recognition Robust to Occlusion and to Intra-Similarity Problem Using Relevant Subsampling.

Sensors (Basel). 2023 Feb 27;23(5):2619. doi: 10.3390/s23052619.

The Extensive Usage of the Facial Image Threshing Machine for Facial Emotion Recognition Performance.

Sensors (Basel). 2021 Mar 12;21(6):2026. doi: 10.3390/s21062026.

ULN: An efficient face recognition method for person wearing a mask.

Multimed Tools Appl. 2022;81(29):42393-42411. doi: 10.1007/s11042-022-13495-7. Epub 2022 Aug 12.

An automatic improved facial expression recognition for masked faces.

Neural Comput Appl. 2023;35(20):14963-14972. doi: 10.1007/s00521-023-08498-w. Epub 2023 Apr 1.

A novel DeepMaskNet model for face mask detection and masked facial recognition.

J King Saud Univ Comput Inf Sci. 2022 Nov;34(10):9905-9914. doi: 10.1016/j.jksuci.2021.12.017. Epub 2022 Jan 25.

FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients.

Brain Sci. 2022 Nov 28;12(12):1626. doi: 10.3390/brainsci12121626.

Facial Expression Recognition Based on Squeeze Vision Transformer.

Sensors (Basel). 2022 May 13;22(10):3729. doi: 10.3390/s22103729.

本文引用的文献

A Survey of Visual Transformers.

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7478-7498. doi: 10.1109/TNNLS.2022.3227717. Epub 2024 Jun 3.

"What Is Hidden behind the Mask?" Facial Emotion Recognition at the Time of COVID-19 Pandemic in Cognitively Normal Multiple Sclerosis Patients.

Diagnostics (Basel). 2021 Dec 27;12(1):47. doi: 10.3390/diagnostics12010047.

Facial masks affect emotion recognition in the general population and individuals with autistic traits.

PLoS One. 2021 Sep 30;16(9):e0257740. doi: 10.1371/journal.pone.0257740. eCollection 2021.

Wearing Face Masks Strongly Confuses Counterparts in Reading Emotions.

Front Psychol. 2020 Sep 25;11:566886. doi: 10.3389/fpsyg.2020.566886. eCollection 2020.

Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition.

IEEE Trans Image Process. 2020 Jan 29. doi: 10.1109/TIP.2019.2956143.

A Multi-Task Framework for Facial Attributes Classification through End-to-End Face Parsing and Deep Convolutional Neural Networks.

Sensors (Basel). 2020 Jan 7;20(2):328. doi: 10.3390/s20020328.

Occlusion aware facial expression recognition using CNN with attention mechanism.

IEEE Trans Image Process. 2018 Dec 14. doi: 10.1109/TIP.2018.2886767.

Eye movements during emotion recognition in faces.

J Vis. 2014 Nov 18;14(13):14. doi: 10.1167/14.13.14.

Graph-preserving sparse nonnegative matrix factorization with application to facial expression recognition.

IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):38-52. doi: 10.1109/TSMCB.2010.2044788. Epub 2010 Apr 15.

Robust face recognition via sparse representation.

IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):210-27. doi: 10.1109/TPAMI.2008.79.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于面部解析和视觉Transformer的口罩感知面部表情识别

Face-mask-aware Facial Expression Recognition based on Face Parsing and Vision Transformer.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献