基于频域神经网络的面部表情识别

Facial Expression Recognition Using Frequency Neural Network.

出版信息

IEEE Trans Image Process. 2021;30:444-457. doi: 10.1109/TIP.2020.3037467. Epub 2020 Nov 24.

DOI:10.1109/TIP.2020.3037467

PMID:33201812

Abstract

Facial expression recognition has become a newly-emerging topic in recent decades, which has important value in the field of human-computer interaction. In this paper, we present a deep learning based approach, named frequency neural network (FreNet), for facial expression recognition. Different from convolutional neural network in spatial domain, FreNet inherits the advantages of processing image in frequency domain, such as efficient computation and spatial redundancy elimination. First, we propose the learnable multiplication kernel and construct multiple multiplication layers to learn features in frequency domain. Second, a summarization layer is proposed following multiplication layers to further yield high-level features. Third, based on the property of discrete cosine transform (DCT), we utilize multiplication layers and summarization layer to construct the Basic-FreNet, which can yield high-level features on the widely used DCT feature. Finally, to further achieve better performance on Basic-FreNet, we propose the Block-FreNet in which the weight-shared multiplication kernel is designed for feature learning and the block sub-sampling is designed for dimension reduction. The experimental results show that the Block-FreNet not only achieves superior performance, but also greatly reduces the computational cost. To our best knowledge, the proposed approach is the first attempt to fill in the blank of frequency based deep learning model for facial expression recognition.

摘要

面部表情识别是近几十年来新兴的研究课题，在人机交互领域具有重要的应用价值。本文提出了一种基于深度学习的方法，称为频域神经网络（FreNet），用于面部表情识别。与在空间域的卷积神经网络不同，FreNet继承了频域处理图像的优势，例如高效计算和空间冗余消除。首先，我们提出了可学习的乘法核，并构建了多个乘法层来学习频域特征。其次，在乘法层之后提出了一个汇总层，以进一步生成高级特征。第三，基于离散余弦变换（DCT）的性质，我们利用乘法层和汇总层构建了基本 FreNet，它可以在广泛使用的 DCT 特征上生成高级特征。最后，为了在基本 FreNet 上进一步获得更好的性能，我们提出了块 FreNet，其中共享权重的乘法核用于特征学习，块子采样用于降维。实验结果表明，块 FreNet 不仅具有优异的性能，而且大大降低了计算成本。据我们所知，该方法是首次尝试填补基于频域的深度学习模型在面部表情识别中的空白。

相似文献

Facial Expression Recognition Using Frequency Neural Network.基于频域神经网络的面部表情识别

IEEE Trans Image Process. 2021;30:444-457. doi: 10.1109/TIP.2020.3037467. Epub 2020 Nov 24.

EAC-Net: Deep Nets with Enhancing and Cropping for Facial Action Unit Detection.EAC-Net：用于面部动作单元检测的增强和裁剪的深度网络。

IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2583-2596. doi: 10.1109/TPAMI.2018.2791608. Epub 2018 Jan 10.

Dense Residual Network: Enhancing global dense feature flow for character recognition.密集残差网络：增强字符识别的全局密集特征流。

Neural Netw. 2021 Jul;139:77-85. doi: 10.1016/j.neunet.2021.02.005. Epub 2021 Feb 25.

Micro-expression recognition based on multi-scale 3D residual convolutional neural network.基于多尺度 3D 残差卷积神经网络的微表情识别。

Math Biosci Eng. 2024 Mar 1;21(4):5007-5031. doi: 10.3934/mbe.2024221.

Improved optimizer with deep learning model for emotion detection and classification.基于深度学习模型的情感检测与分类优化器。

Math Biosci Eng. 2024 Jul 17;21(7):6631-6657. doi: 10.3934/mbe.2024290.

Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition. Wasserstein CNN：用于近红外-可见光人脸识别的不变特征学习。

IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1761-1773. doi: 10.1109/TPAMI.2018.2842770. Epub 2018 Jun 1.

Joint Local and Global Information Learning With Single Apex Frame Detection for Micro-Expression Recognition.基于单顶点帧检测的局部和全局信息联合学习的微表情识别。

IEEE Trans Image Process. 2021;30:249-263. doi: 10.1109/TIP.2020.3035042. Epub 2020 Nov 18.

Novel deep neural network based pattern field classification architectures.基于新型深度神经网络的模式场分类架构。

Neural Netw. 2020 Jul;127:82-95. doi: 10.1016/j.neunet.2020.03.011. Epub 2020 Mar 14.

Multilevel and Multiscale Feature Aggregation in Deep Networks for Facial Constitution Classification.深度学习网络中的多层次多尺度特征聚合用于面部特征分类。

Comput Math Methods Med. 2019 Dec 20;2019:1258782. doi: 10.1155/2019/1258782. eCollection 2019.

TriCAFFNet: A Tri-Cross-Attention Transformer with a Multi-Feature Fusion Network for Facial Expression Recognition.TriCAFFNet：一种具有多特征融合网络的三交叉注意力转换器，用于面部表情识别。

Sensors (Basel). 2024 Aug 21;24(16):5391. doi: 10.3390/s24165391.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于频域神经网络的面部表情识别

Facial Expression Recognition Using Frequency Neural Network.

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献