IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9269-9284. doi: 10.1109/TPAMI.2021.3125598. Epub 2022 Nov 7.
Over the last years, with the advent of Generative Adversarial Networks (GANs), many face analysis tasks have accomplished astounding performance, with applications including, but not limited to, face generation and 3D face reconstruction from a single "in-the-wild" image. Nevertheless, to the best of our knowledge, there is no method which can produce render-ready high-resolution 3D faces from "in-the-wild" images and this can be attributed to the: (a) scarcity of available data for training, and (b) lack of robust methodologies that can successfully be applied on very high-resolution data. In this paper, we introduce the first method that is able to reconstruct photorealistic render-ready 3D facial geometry and BRDF from a single "in-the-wild" image. To achieve this, we capture a large dataset of facial shape and reflectance, which we have made public. Moreover, we define a fast and photorealistic differentiable rendering methodology with accurate facial skin diffuse and specular reflection, self-occlusion and subsurface scattering approximation. With this, we train a network that disentangles the facial diffuse and specular reflectance components from a mesh and texture with baked illumination, scanned or reconstructed with a 3DMM fitting method. As we demonstrate in a series of qualitative and quantitative experiments, our method outperforms the existing arts by a significant margin and reconstructs authentic, 4K by 6K-resolution 3D faces from a single low-resolution image, that are ready to be rendered in various applications and bridge the uncanny valley.
在过去的几年中,随着生成对抗网络(GAN)的出现,许多面部分析任务都取得了惊人的性能,其应用包括但不限于人脸生成和从单张“野外”图像重建 3D 人脸。然而,据我们所知,还没有一种方法可以从“野外”图像生成可渲染的高分辨率 3D 人脸,这可以归因于:(a)用于训练的可用数据稀缺,以及(b)缺乏可以成功应用于超高分辨率数据的稳健方法。在本文中,我们介绍了第一种能够从单张“野外”图像重建逼真的可渲染 3D 面部几何形状和 BRDF 的方法。为了实现这一目标,我们采集了大量的面部形状和反射率数据集,并将其公开。此外,我们定义了一种快速且逼真的可微分渲染方法,具有准确的面部皮肤漫反射和镜面反射、自遮挡和次表面散射逼近。通过这种方法,我们训练了一个网络,该网络可以从带有烘焙照明的网格和纹理中分离出面部漫反射和镜面反射分量,这些网格和纹理是使用 3DMM 拟合方法扫描或重建的。正如我们在一系列定性和定量实验中所展示的,我们的方法在很大程度上优于现有技术,并且可以从单个低分辨率图像重建真实的、4K 乘 6K 分辨率的 3D 人脸,这些人脸已经准备好在各种应用中进行渲染,并弥合了恐怖谷。