IEEE Trans Vis Comput Graph. 2022 Jul;28(7):2668-2681. doi: 10.1109/TVCG.2020.3036868. Epub 2022 May 26.
We present a new framework for online dense 3D reconstruction of indoor scenes by using only depth sequences. This research is particularly useful in cases with a poor light condition or in a nearly featureless indoor environment. The lack of RGB information makes long-range camera pose estimation difficult in a large indoor environment. The key idea of our research is to take advantage of the geometric prior of Manhattan scenes in each stage of the reconstruction pipeline with the specific aim to reduce the cumulative registration error and overall odometry drift in a long sequence. This idea is further boosted by local Manhattan frame growing and the local-to-global strategy that leads to implicit loop closure handling for a large indoor scene. Our proposed pipeline, namely ManhattanFusion, starts with planar alignment and local pose optimization where the Manhattan constraints are imposed to create detailed local segments. These segments preserve intrinsic scene geometry by minimizing the odometry drift even under complex and long trajectories. The final model is generated by integrating all local segments into a global volumetric representation under the constraint of Manhattan frame-based registration across segments. Our algorithm outperforms others that use depth data only in terms of both the mean distance error and the absolute trajectory error, and it is also very competitive compared with RGB-D based reconstruction algorithms. Moreover, our algorithm outperforms the state-of-the-art in terms of the surface area coverage by 10-40 percent, largely due to the usefulness and effectiveness of the Manhattan assumption through the reconstruction pipeline.
我们提出了一种新的框架,用于仅使用深度序列在线密集重建室内场景的三维模型。这项研究在光照条件差或室内环境几乎没有特征的情况下特别有用。由于缺乏 RGB 信息,在大型室内环境中,远距离相机位姿估计变得困难。我们研究的关键思想是在重建管道的每个阶段利用曼哈顿场景的几何先验,目的是减少长序列中的累积注册误差和整体里程计漂移。这个想法通过局部曼哈顿帧增长和局部到全局的策略得到进一步增强,从而导致大室内场景的隐式闭环处理。我们提出的管道,即 ManhattanFusion,从平面对齐和局部姿态优化开始,在这些阶段施加曼哈顿约束来创建详细的局部段。这些段通过最小化里程计漂移来保留固有场景几何,即使在复杂和长轨迹下也是如此。最后,通过在段之间基于曼哈顿框架的注册约束下将所有局部段集成到全局体积表示中,生成最终模型。我们的算法在平均距离误差和绝对轨迹误差方面都优于仅使用深度数据的其他算法,与基于 RGB-D 的重建算法相比也非常有竞争力。此外,我们的算法在表面积覆盖率方面比最先进的算法高出 10-40%,这主要是由于通过重建管道的曼哈顿假设的有用性和有效性。