基于双通路生成对抗网络方法的多视角人脸合成图像的二维面部地标定位方法

2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.

作者信息

Alhlffee Mahmood H B, Huang Yea-Shuan, Chen Yi-An

机构信息

College of Computer Science and Electrical Engineering, Chung-Hua University, Hsinchu, Taiwan.

Department of Computer Science and Information Engineering, Chung-Hua University, Hsinchu, Taiwan.

出版信息

PeerJ Comput Sci. 2022 Feb 16;8:e897. doi: 10.7717/peerj-cs.897. eCollection 2022.

DOI:10.7717/peerj-cs.897

PMID:35494834

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9044338/

Abstract

One of the key challenges in facial recognition is multi-view face synthesis from a single face image. The existing generative adversarial network (GAN) deep learning methods have been proven to be effective in performing facial recognition with a set of pre-processing, post-processing and feature representation techniques to bring a frontal view into the same position in-order to achieve high accuracy face identification. However, these methods still perform relatively weak in generating high quality frontal-face image samples under extreme face pose scenarios. The novel framework architecture of the two-pathway generative adversarial network (TP-GAN), has made commendable progress in the face synthesis model, making it possible to perceive global structure and local details in an unsupervised manner. More importantly, the TP-GAN solves the problems of photorealistic frontal view synthesis by relying on texture details of the landmark detection and synthesis functions, which limits its ability to achieve the desired performance in generating high-quality frontal face image samples under extreme pose. We propose, in this paper, a landmark feature-based method (LFM) for robust pose-invariant facial recognition, which aims to improve image resolution quality of the generated frontal faces under a variety of facial poses. We therefore augment the existing TP-GAN generative global pathway with a well-constructed 2D face landmark localization to cooperate with the local pathway structure in a landmark sharing manner to incorporate empirical face pose into the learning process, and improve the encoder-decoder global pathway structure for better representation of facial image features by establishing robust feature extractors that select meaningful features that ease the operational workflow toward achieving a balanced learning strategy, thus significantly improving the photorealistic face image resolution. We verify the effectiveness of our proposed method on both Multi-PIE and FEI datasets. The quantitative and qualitative experimental results show that our proposed method not only generates high quality perceptual images under extreme poses but also significantly improves upon the TP-GAN results.

摘要

人脸识别中的关键挑战之一是从单张人脸图像进行多视角人脸合成。现有的生成对抗网络（GAN）深度学习方法已被证明在执行人脸识别时是有效的，通过一系列预处理、后处理和特征表示技术，将正面视图调整到相同位置，以实现高精度的人脸识别。然而，这些方法在极端人脸姿态场景下生成高质量正面人脸图像样本时，表现仍然相对较弱。双路径生成对抗网络（TP-GAN）的新颖框架架构在人脸合成模型方面取得了值得称赞的进展，使其能够以无监督的方式感知全局结构和局部细节。更重要的是，TP-GAN通过依赖地标检测和合成功能的纹理细节解决了逼真正面视图合成的问题，这限制了其在极端姿态下生成高质量正面人脸图像样本时达到理想性能的能力。在本文中，我们提出了一种基于地标特征的方法（LFM）用于鲁棒的姿态不变人脸识别，旨在提高在各种人脸姿态下生成的正面人脸的图像分辨率质量。因此，我们通过精心构建的二维人脸地标定位增强现有的TP-GAN生成全局路径，以地标共享的方式与局部路径结构协作，将经验性人脸姿态纳入学习过程，并改进编码器-解码器全局路径结构，通过建立鲁棒的特征提取器来更好地表示人脸图像特征，该提取器选择有意义的特征，简化操作流程以实现平衡的学习策略，从而显著提高逼真人脸图像分辨率。我们在Multi-PIE和FEI数据集上验证了我们提出方法的有效性。定量和定性实验结果表明，我们提出的方法不仅在极端姿态下生成高质量的感知图像，而且在TP-GAN的结果上有显著改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a55/9044338/7ae62f47db19/peerj-cs-08-897-g001.jpg

相似文献

2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.

PeerJ Comput Sci. 2022 Feb 16;8:e897. doi: 10.7717/peerj-cs.897. eCollection 2022.

Look More Into Occlusion: Realistic Face Frontalization and Recognition With BoostGAN.

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):214-228. doi: 10.1109/TNNLS.2020.2978127. Epub 2021 Jan 4.

Geometry Guided Pose-invariant Facial Expression Recognition.

IEEE Trans Image Process. 2020 Feb 12. doi: 10.1109/TIP.2020.2972114.

Representation Learning by Rotating Your Faces.

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):3007-3021. doi: 10.1109/TPAMI.2018.2868350. Epub 2018 Sep 3.

F³A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks.

IEEE Trans Image Process. 2021;30:8658-8670. doi: 10.1109/TIP.2021.3112059. Epub 2021 Oct 21.

Identity preserving multi-pose facial expression recognition using fine tuned VGG on the latent space vector of generative adversarial network.

Math Biosci Eng. 2021 Apr 28;18(4):3699-3717. doi: 10.3934/mbe.2021186.

A medical image classification method based on self-regularized adversarial learning.

Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.

Generative adversarial networks with decoder-encoder output noises.

Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.

Common feature learning for brain tumor MRI synthesis by context-aware generative adversarial network.

Med Image Anal. 2022 Jul;79:102472. doi: 10.1016/j.media.2022.102472. Epub 2022 May 4.

Face Frontalization Using an Appearance-Flow-Based Convolutional Neural Network.

IEEE Trans Image Process. 2019 May;28(5):2187-2199. doi: 10.1109/TIP.2018.2883554. Epub 2018 Nov 28.

引用本文的文献

RoFace: A robust face representation approach for accurate classification.

Heliyon. 2023 Jan 20;9(2):e13053. doi: 10.1016/j.heliyon.2023.e13053. eCollection 2023 Feb.

本文引用的文献

3D Talking Face With Personalized Pose Dynamics.

IEEE Trans Vis Comput Graph. 2023 Feb;29(2):1438-1449. doi: 10.1109/TVCG.2021.3117484. Epub 2022 Dec 29.

Representation Learning by Rotating Your Faces.

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):3007-3021. doi: 10.1109/TPAMI.2018.2868350. Epub 2018 Sep 3.

Ultrasensitive fluorescent proteins for imaging neuronal activity.

Nature. 2013 Jul 18;499(7458):295-300. doi: 10.1038/nature12354.

Multi-PIE.

Proc Int Conf Autom Face Gesture Recognit. 2010 May 1;28(5):807-813. doi: 10.1016/j.imavis.2009.08.002.

Face description with local binary patterns: application to face recognition.

IEEE Trans Pattern Anal Mach Intell. 2006 Dec;28(12):2037-41. doi: 10.1109/TPAMI.2006.244.

Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters.

J Opt Soc Am A. 1985 Jul;2(7):1160-9. doi: 10.1364/josaa.2.001160.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于双通路生成对抗网络方法的多视角人脸合成图像的二维面部地标定位方法

2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.

作者信息

Alhlffee Mahmood H B, Huang Yea-Shuan, Chen Yi-An

机构信息

College of Computer Science and Electrical Engineering, Chung-Hua University, Hsinchu, Taiwan.

Department of Computer Science and Information Engineering, Chung-Hua University, Hsinchu, Taiwan.

出版信息

PeerJ Comput Sci. 2022 Feb 16;8:e897. doi: 10.7717/peerj-cs.897. eCollection 2022.

DOI:10.7717/peerj-cs.897

PMID:35494834

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9044338/

Abstract

摘要

基于双通路生成对抗网络方法的多视角人脸合成图像的二维面部地标定位方法

2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于双通路生成对抗网络方法的多视角人脸合成图像的二维面部地标定位方法

2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献