Liu Zheng-Ning, Cao Yan-Pei, Kuang Zheng-Fei, Kobbelt Leif, Hu Shi-Min
IEEE Trans Vis Comput Graph. 2021 Jan;27(1):83-97. doi: 10.1109/TVCG.2019.2937300. Epub 2020 Nov 24.
We present a learning-based approach to reconstructing high-resolution three-dimensional (3D) shapes with detailed geometry and high-fidelity textures. Albeit extensively studied, algorithms for 3D reconstruction from multi-view depth-and-color (RGB-D) scans are still prone to measurement noise and occlusions; limited scanning or capturing angles also often lead to incomplete reconstructions. Propelled by recent advances in 3D deep learning techniques, in this paper, we introduce a novel computation- and memory-efficient cascaded 3D convolutional network architecture, which learns to reconstruct implicit surface representations as well as the corresponding color information from noisy and imperfect RGB-D maps. The proposed 3D neural network performs reconstruction in a progressive and coarse-to-fine manner, achieving unprecedented output resolution and fidelity. Meanwhile, an algorithm for end-to-end training of the proposed cascaded structure is developed. We further introduce Human10, a newly created dataset containing both detailed and textured full-body reconstructions as well as corresponding raw RGB-D scans of 10 subjects. Qualitative and quantitative experimental results on both synthetic and real-world datasets demonstrate that the presented approach outperforms existing state-of-the-art work regarding visual quality and accuracy of reconstructed models.
我们提出了一种基于学习的方法,用于重建具有精细几何形状和高保真纹理的高分辨率三维(3D)形状。尽管对从多视图深度和颜色(RGB-D)扫描进行3D重建的算法进行了广泛研究,但这些算法仍然容易受到测量噪声和遮挡的影响;有限的扫描或捕获角度也常常导致重建不完整。受3D深度学习技术最新进展的推动,在本文中,我们引入了一种新颖的计算和内存高效的级联3D卷积网络架构,该架构学习从噪声和不完美的RGB-D地图中重建隐式表面表示以及相应的颜色信息。所提出的3D神经网络以渐进和粗到细的方式进行重建,实现了前所未有的输出分辨率和保真度。同时,开发了一种用于所提出的级联结构的端到端训练算法。我们进一步引入了Human10,这是一个新创建的数据集,包含10名受试者的详细和纹理化全身重建以及相应的原始RGB-D扫描。在合成数据集和真实世界数据集上的定性和定量实验结果表明,所提出的方法在重建模型的视觉质量和准确性方面优于现有的最先进工作。