从自然图像中学习3D人脸可变形模型

On Learning 3D Face Morphable Model from In-the-Wild Images.

作者信息

Tran Luan, Liu Xiaoming

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):157-171. doi: 10.1109/TPAMI.2019.2927975. Epub 2020 Dec 4.

DOI:10.1109/TPAMI.2019.2927975

Abstract

As a classic statistical model of 3D facial shape and albedo, 3D Morphable Model (3DMM) is widely used in facial analysis, e.g., model fitting, image synthesis. Conventional 3DMM is learned from a set of 3D face scans with associated well-controlled 2D face images, and represented by two sets of PCA basis functions. Due to the type and amount of training data, as well as, the linear bases, the representation power of 3DMM can be limited. To address these problems, this paper proposes an innovative framework to learn a nonlinear 3DMM model from a large set of in-the-wild face images, without collecting 3D face scans. Specifically, given a face image as input, a network encoder estimates the projection, lighting, shape and albedo parameters. Two decoders serve as the nonlinear 3DMM to map from the shape and albedo parameters to the 3D shape and albedo, respectively. With the projection parameter, lighting, 3D shape, and albedo, a novel analytically-differentiable rendering layer is designed to reconstruct the original input face. The entire network is end-to-end trainable with only weak supervision. We demonstrate the superior representation power of our nonlinear 3DMM over its linear counterpart, and its contribution to face alignment, 3D reconstruction, and face editing. Source code and additional results can be found at our project page: http://cvlab.cse.msu.edu/project-nonlinear-3dmm.html.

摘要

作为一种经典的三维面部形状和反照率统计模型，三维可变形模型（3DMM）在面部分析中得到了广泛应用，例如模型拟合、图像合成。传统的3DMM是从一组带有相关的、控制良好的二维面部图像的三维面部扫描数据中学习得到的，并用两组主成分分析（PCA）基函数来表示。由于训练数据的类型和数量以及线性基的原因，3DMM的表示能力可能会受到限制。为了解决这些问题，本文提出了一种创新框架，可从大量自然场景面部图像中学习非线性3DMM模型，而无需收集三维面部扫描数据。具体来说，给定一张面部图像作为输入，一个网络编码器估计投影、光照、形状和反照率参数。两个解码器作为非线性3DMM，分别从形状和反照率参数映射到三维形状和反照率。利用投影参数、光照、三维形状和反照率，设计了一个新颖的可解析微分渲染层来重建原始输入面部。整个网络在仅弱监督的情况下是端到端可训练的。我们展示了我们的非线性3DMM相对于其线性对应模型具有更强的表示能力，以及它在面部对齐、三维重建和面部编辑方面的贡献。源代码和其他结果可在我们的项目页面找到：http://cvlab.cse.msu.edu/project-nonlinear-3dmm.html

相似文献

On Learning 3D Face Morphable Model from In-the-Wild Images.从自然图像中学习3D人脸可变形模型

IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):157-171. doi: 10.1109/TPAMI.2019.2927975. Epub 2020 Dec 4.

Beyond 3DMM: Learning to Capture High-Fidelity 3D Face Shape.超越三维形态模型：学习捕捉高保真三维面部形状

IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1442-1457. doi: 10.1109/TPAMI.2022.3164131. Epub 2023 Jan 6.

Weighted regularized statistical shape space projection for breast 3D model reconstruction.基于加权正则化统计形状空间投影的乳房 3D 模型重建。

Med Image Anal. 2018 Jul;47:164-179. doi: 10.1016/j.media.2018.04.007. Epub 2018 May 2.

Inequality-Constrained 3D Morphable Face Model Fitting.不等式约束的三维可变形人脸模型拟合

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1305-1318. doi: 10.1109/TPAMI.2023.3334948. Epub 2024 Jan 8.

Inequality-Constrained and Robust 3D Face Model Fitting.不等式约束与鲁棒三维人脸模型拟合

Comput Vis ECCV. 2020;12354:433-449.

A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D Faces.一种用于跨异质 3D 人脸密集语义对应关系的稀疏且局部连贯的可变形人脸模型。

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6667-6682. doi: 10.1109/TPAMI.2021.3090942. Epub 2022 Sep 14.

Cross-Domain and Disentangled Face Manipulation With 3D Guidance.

IEEE Trans Vis Comput Graph. 2023 Apr;29(4):2053-2066. doi: 10.1109/TVCG.2021.3139913. Epub 2023 Feb 28.

Histogram-Based CRC for 3D-Aided Pose-Invariant Face Recognition.基于直方图的 3D 辅助不变姿态人脸识别。

Sensors (Basel). 2019 Feb 13;19(4):759. doi: 10.3390/s19040759.

Large Scale 3D Morphable Models.大规模三维可变形模型

Int J Comput Vis. 2018;126(2):233-254. doi: 10.1007/s11263-017-1009-7. Epub 2017 Apr 8.

Self-supervised Learning of Detailed 3D Face Reconstruction.详细3D面部重建的自监督学习

IEEE Trans Image Process. 2020 Aug 27;PP. doi: 10.1109/TIP.2020.3017347.

引用本文的文献

Building 3D Generative Models from Minimal Data.利用最少数据构建3D生成模型。

Int J Comput Vis. 2024;132(2):555-580. doi: 10.1007/s11263-023-01870-2. Epub 2023 Sep 13.

Blood Pressure Estimation Based on PPG and ECG Signals Using Knowledge Distillation.基于 PPG 和 ECG 信号的知识蒸馏血压估计

Cardiovasc Eng Technol. 2024 Feb;15(1):39-51. doi: 10.1007/s13239-023-00695-x. Epub 2024 Jan 8.

Inequality-Constrained 3D Morphable Face Model Fitting.不等式约束的三维可变形人脸模型拟合

IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1305-1318. doi: 10.1109/TPAMI.2023.3334948. Epub 2024 Jan 8.

A Lightweight Monocular 3D Face Reconstruction Method Based on Improved 3D Morphing Models.一种基于改进三维变形模型的轻量级单目三维人脸重建方法。

Sensors (Basel). 2023 Jul 27;23(15):6713. doi: 10.3390/s23156713.

: Texture-Enhanced Deep Face Reconstruction in the Wild.野外纹理增强的深度人脸重建

Sensors (Basel). 2023 Jul 19;23(14):6525. doi: 10.3390/s23146525.

A Preprocessing Manifold Learning Strategy Based on t-Distributed Stochastic Neighbor Embedding.一种基于t分布随机邻域嵌入的预处理流形学习策略

Entropy (Basel). 2023 Jul 14;25(7):1065. doi: 10.3390/e25071065.

Three Dimensional Shape Reconstruction via Polarization Imaging and Deep Learning.基于偏振成像和深度学习的三维形状重建。

Sensors (Basel). 2023 May 9;23(10):4592. doi: 10.3390/s23104592.

Adaptive 3D Model-Based Facial Expression Synthesis and Pose Frontalization.基于自适应 3D 模型的表情合成与姿态正面化。

Sensors (Basel). 2020 May 1;20(9):2578. doi: 10.3390/s20092578.

Three-Dimensional Face Reconstruction Using Multi-View-Based Bilinear Model.基于多视角双线性模型的三维人脸重建。

Sensors (Basel). 2019 Jan 23;19(3):459. doi: 10.3390/s19030459.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从自然图像中学习3D人脸可变形模型

On Learning 3D Face Morphable Model from In-the-Wild Images.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献