Tran Luan, Liu Xiaoming
IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):157-171. doi: 10.1109/TPAMI.2019.2927975. Epub 2020 Dec 4.
As a classic statistical model of 3D facial shape and albedo, 3D Morphable Model (3DMM) is widely used in facial analysis, e.g., model fitting, image synthesis. Conventional 3DMM is learned from a set of 3D face scans with associated well-controlled 2D face images, and represented by two sets of PCA basis functions. Due to the type and amount of training data, as well as, the linear bases, the representation power of 3DMM can be limited. To address these problems, this paper proposes an innovative framework to learn a nonlinear 3DMM model from a large set of in-the-wild face images, without collecting 3D face scans. Specifically, given a face image as input, a network encoder estimates the projection, lighting, shape and albedo parameters. Two decoders serve as the nonlinear 3DMM to map from the shape and albedo parameters to the 3D shape and albedo, respectively. With the projection parameter, lighting, 3D shape, and albedo, a novel analytically-differentiable rendering layer is designed to reconstruct the original input face. The entire network is end-to-end trainable with only weak supervision. We demonstrate the superior representation power of our nonlinear 3DMM over its linear counterpart, and its contribution to face alignment, 3D reconstruction, and face editing. Source code and additional results can be found at our project page: http://cvlab.cse.msu.edu/project-nonlinear-3dmm.html.
作为一种经典的三维面部形状和反照率统计模型,三维可变形模型(3DMM)在面部分析中得到了广泛应用,例如模型拟合、图像合成。传统的3DMM是从一组带有相关的、控制良好的二维面部图像的三维面部扫描数据中学习得到的,并用两组主成分分析(PCA)基函数来表示。由于训练数据的类型和数量以及线性基的原因,3DMM的表示能力可能会受到限制。为了解决这些问题,本文提出了一种创新框架,可从大量自然场景面部图像中学习非线性3DMM模型,而无需收集三维面部扫描数据。具体来说,给定一张面部图像作为输入,一个网络编码器估计投影、光照、形状和反照率参数。两个解码器作为非线性3DMM,分别从形状和反照率参数映射到三维形状和反照率。利用投影参数、光照、三维形状和反照率,设计了一个新颖的可解析微分渲染层来重建原始输入面部。整个网络在仅弱监督的情况下是端到端可训练的。我们展示了我们的非线性3DMM相对于其线性对应模型具有更强的表示能力,以及它在面部对齐、三维重建和面部编辑方面的贡献。源代码和其他结果可在我们的项目页面找到:http://cvlab.cse.msu.edu/project-nonlinear-3dmm.html