基于无监督模型的人脸自动编码器的高保真单目人脸重建。

High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):357-370. doi: 10.1109/TPAMI.2018.2876842. Epub 2018 Oct 18.

DOI:10.1109/TPAMI.2018.2876842

Abstract

In this work, we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional encoder network with an expert-designed generative model that serves as decoder. The core innovation is the differentiable parametric decoder that encapsulates image formation analytically based on a generative model. Our decoder takes as input a code vector with exactly defined semantic meaning that encodes detailed face pose, shape, expression, skin reflectance, and scene illumination. Due to this new way of combining CNN-based with model-based face reconstruction, the CNN-based encoder learns to extract semantically meaningful parameters from a single monocular input image. For the first time, a CNN encoder and an expert-designed generative model can be trained end-to-end in an unsupervised manner, which renders training on very large (unlabeled) real world datasets feasible. The obtained reconstructions compare favorably to current state-of-the-art approaches in terms of quality and richness of representation. This work is an extended version of [1] , where we additionally present a stochastic vertex sampling technique for faster training of our networks, and moreover, we propose and evaluate analysis-by-synthesis and shape-from-shading refinement approaches to achieve a high-fidelity reconstruction.

摘要

在这项工作中，我们提出了一种新颖的基于模型的深度卷积自动编码器，旨在解决从单个野外彩色图像重建 3D 人脸的极具挑战性的问题。为此，我们将卷积编码器网络与专家设计的生成模型相结合，作为解码器。核心创新是可区分的参数解码器，它基于生成模型对图像形成进行分析。我们的解码器以具有明确定义语义的码向量作为输入，该码向量编码详细的人脸姿势、形状、表情、皮肤反射率和场景光照。由于这种将基于 CNN 的方法与基于模型的人脸重建相结合的新方法，基于 CNN 的编码器学会了从单个单目输入图像中提取语义上有意义的参数。首次可以以端到端的方式对 CNN 编码器和专家设计的生成模型进行无监督训练，从而使在非常大的（未标记）真实世界数据集上进行训练成为可能。与目前最先进的方法相比，我们获得的重建在质量和表示的丰富性方面都具有优势。这项工作是[1]的扩展版本，我们还提出了一种随机顶点采样技术，用于加快我们网络的训练，此外，我们还提出并评估了分析合成和阴影细化方法，以实现高保真重建。

相似文献

High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder.基于无监督模型的人脸自动编码器的高保真单目人脸重建。

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):357-370. doi: 10.1109/TPAMI.2018.2876842. Epub 2018 Oct 18.

Variational autoencoder: An unsupervised model for encoding and decoding fMRI activity in visual cortex.变分自编码器：一种用于对视觉皮层的 fMRI 活动进行编码和解码的无监督模型。

Neuroimage. 2019 Sep;198:125-136. doi: 10.1016/j.neuroimage.2019.05.039. Epub 2019 May 16.

Features Guided Face Super-Resolution via Hybrid Model of Deep Learning and Random Forests.基于深度学习和随机森林混合模型的特征引导人脸超分辨率。

IEEE Trans Image Process. 2021;30:4157-4170. doi: 10.1109/TIP.2021.3069554. Epub 2021 Apr 9.

3D-Aided Dual-Agent GANs for Unconstrained Face Recognition.基于 3D 辅助的双代理 GAN 用于无约束人脸识别。

IEEE Trans Pattern Anal Mach Intell. 2019 Oct;41(10):2380-2394. doi: 10.1109/TPAMI.2018.2858819. Epub 2018 Jul 23.

Neural Shape Parsers for Constructive Solid Geometry.用于构造立体几何的神经形状解析器。

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2628-2640. doi: 10.1109/TPAMI.2020.3044749. Epub 2022 Apr 1.

CNN-Based Real-Time Dense Face Reconstruction with Inverse-Rendered Photo-Realistic Face Images.基于卷积神经网络的实时密集人脸重建与逆渲染逼真人脸图像

IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1294-1307. doi: 10.1109/TPAMI.2018.2837742. Epub 2018 May 17.

Unsupervised 3D End-to-End Medical Image Registration With Volume Tweening Network.无监督的 3D 端到端医学图像配准方法，采用体素插值网络。

IEEE J Biomed Health Inform. 2020 May;24(5):1394-1404. doi: 10.1109/JBHI.2019.2951024. Epub 2019 Nov 1.

3-D Convolutional Encoder-Decoder Network for Low-Dose CT via Transfer Learning From a 2-D Trained Network.基于二维网络训练的迁移学习的用于低剂量 CT 的三维卷积编解码器网络。

IEEE Trans Med Imaging. 2018 Jun;37(6):1522-1534. doi: 10.1109/TMI.2018.2832217.

MRI super-resolution reconstruction for MRI-guided adaptive radiotherapy using cascaded deep learning: In the presence of limited training data and unknown translation model.基于级联深度学习的 MRI 引导自适应放疗中 MRI 超分辨率重建：在有限的训练数据和未知的平移模型的情况下。

Med Phys. 2019 Sep;46(9):4148-4164. doi: 10.1002/mp.13717. Epub 2019 Aug 7.

Representation Learning by Rotating Your Faces.旋转人脸进行表示学习。

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):3007-3021. doi: 10.1109/TPAMI.2018.2868350. Epub 2018 Sep 3.

引用本文的文献

Application of three-dimensional reconstruction technology in dentistry: a narrative review.三维重建技术在牙科中的应用：叙述性综述。

BMC Oral Health. 2023 Sep 4;23(1):630. doi: 10.1186/s12903-023-03142-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于无监督模型的人脸自动编码器的高保真单目人脸重建。

High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献