IEEE Trans Pattern Anal Mach Intell. 2017 Sep;39(9):1730-1743. doi: 10.1109/TPAMI.2016.2613051. Epub 2016 Sep 23.
Both image segmentation and dense 3D modeling from images represent an intrinsically ill-posed problem. Strong regularizers are therefore required to constrain the solutions from being 'too noisy'. These priors generally yield overly smooth reconstructions and/or segmentations in certain regions while they fail to constrain the solution sufficiently in other areas. In this paper, we argue that image segmentation and dense 3D reconstruction contribute valuable information to each other's task. As a consequence, we propose a mathematical framework to formulate and solve a joint segmentation and dense reconstruction problem. On the one hand knowing about the semantic class of the geometry provides information about the likelihood of the surface direction. On the other hand the surface direction provides information about the likelihood of the semantic class. Experimental results on several data sets highlight the advantages of our joint formulation. We show how weakly observed surfaces are reconstructed more faithfully compared to a geometry only reconstruction. Thanks to the volumetric nature of our formulation we also infer surfaces which cannot be directly observed for example the surface between the ground and a building. Finally, our method returns a semantic segmentation which is consistent across the whole dataset.
图像分割和密集的三维建模都代表了一个本质上不适定的问题。因此,需要强大的正则化器来约束解不“过于嘈杂”。这些先验通常会导致在某些区域产生过于平滑的重建和/或分割,而在其他区域则无法充分约束解。在本文中,我们认为图像分割和密集的三维重建可以相互提供有价值的信息。因此,我们提出了一个数学框架来公式化和解决联合分割和密集重建问题。一方面,了解几何形状的语义类提供了有关表面方向可能性的信息。另一方面,表面方向提供了有关语义类可能性的信息。在几个数据集上的实验结果突出了我们联合公式的优势。我们展示了与仅基于几何的重建相比,如何更忠实地重建观察到的弱表面。由于我们的公式具有体积性质,我们还可以推断出无法直接观察到的表面,例如地面和建筑物之间的表面。最后,我们的方法返回了整个数据集一致的语义分割。