Xiang Xue-Kun, Yuan Yu-Jie, Hu Wen-Bo, Liu Yu-Tao, Ma Yue-Wen, Gao Lin
IEEE Trans Vis Comput Graph. 2025 Oct;31(10):9213-9224. doi: 10.1109/TVCG.2025.3590394.
3D reconstruction from multi-view images is a long-standing problem in computer graphic. Neural 3D reconstruction, especially NeuS and its variants, has improved reconstruction quality compared to traditional methods. However, it is still a challenge for these methods to reconstruct fine-grained geometric details since the spherical harmonic positional encoding lacks the ability to express high-frequency signals. In this paper, we propose a multi-resolution tri-plane feature encoding that leverages the detail reconstruction capabilities of high-resolution tri-plane while using the smoothness of low-resolution tri-plane to suppress high-frequency artifacts. Additionally, a progressive training strategy is introduced, gradually merging scene details from coarse to fine granularity, enhancing reconstruction quality while maintaining training stability and reducing difficulty. Furthermore, to address reconstruction challenges arising from sparse viewpoints and inconsistent lighting in image datasets, we introduce normal priors as supervision and propose consistency verification for multi-view normal priors, which assesses the accuracy of normal priors and effectively supervise the reconstructed surfaces. Moreover, we propose a perturbing and fine-tuning strategy on regions of unreliable normal priors to further improve the quality of geometric surface reconstruction.
从多视图图像进行3D重建是计算机图形学中一个长期存在的问题。神经3D重建,特别是NeuS及其变体,与传统方法相比提高了重建质量。然而,由于球谐位置编码缺乏表达高频信号的能力,这些方法重建细粒度几何细节仍然是一个挑战。在本文中,我们提出了一种多分辨率三平面特征编码,它利用高分辨率三平面的细节重建能力,同时利用低分辨率三平面的平滑性来抑制高频伪影。此外,引入了一种渐进训练策略,从粗粒度到细粒度逐步合并场景细节,在保持训练稳定性和降低难度的同时提高重建质量。此外,为了解决图像数据集中稀疏视点和不一致光照带来的重建挑战,我们引入法线先验作为监督,并提出多视图法线先验的一致性验证,以评估法线先验的准确性并有效地监督重建表面。此外,我们针对不可靠法线先验的区域提出了一种扰动和微调策略,以进一步提高几何表面重建的质量。