Fei Fan, Cheng Yean, Zhu Yongjie, Zheng Qian, Li Si, Pan Gang, Shi Boxin
IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):1079-1092. doi: 10.1109/TPAMI.2023.3328453. Epub 2024 Jan 8.
This paper proposes a novel pipeline to estimate a non-parametric environment map with high dynamic range from a single human face image. Lighting-independent and -dependent intrinsic images of the face are first estimated separately in a cascaded network. The influence of face geometry on the two lighting-dependent intrinsics, diffuse shading and specular reflection, are further eliminated by distributing the intrinsics pixel-wise onto spherical representations using the surface normal as indices. This results in two representations simulating images of a diffuse sphere and a glossy sphere under the input scene lighting. Taking into account the distinctive nature of light sources and ambient terms, we further introduce a two-stage lighting estimator to predict both accurate and realistic lighting from these two representations. Our model is trained supervisedly on a large-scale and high-quality synthetic face image dataset. We demonstrate that our method allows accurate and detailed lighting estimation and intrinsic decomposition, outperforming state-of-the-art methods both qualitatively and quantitatively on real face images.
本文提出了一种新颖的流程,用于从单张人脸图像估计具有高动态范围的非参数环境地图。首先在级联网络中分别估计人脸的光照无关和光照相关本征图像。通过使用表面法线作为索引,将本征图像逐像素地分布到球面表示上,进一步消除了人脸几何形状对两种光照相关本征(漫反射阴影和镜面反射)的影响。这产生了两种表示,模拟了输入场景光照下的漫反射球体和光泽球体的图像。考虑到光源和环境项的独特性质,我们进一步引入了一个两阶段光照估计器,从这两种表示中预测准确且逼真的光照。我们的模型在大规模高质量合成人脸图像数据集上进行有监督训练。我们证明,我们的方法能够进行准确且详细的光照估计和本征分解,在真实人脸图像上的定性和定量表现均优于现有方法。