利用单目图像序列的高分辨率重建进行 3-D 模型定位。

3-D model localization using high-resolution reconstruction of monocular image sequences.

机构信息

Service E/ETS/V, Aerospatiale, Magny-les-Hameaux.

出版信息

IEEE Trans Image Process. 1997;6(1):175-88. doi: 10.1109/83.552105.

Abstract

In this paper, we present a complete system for the recognition and localization of a three-dimensional (3-D) model from a sequence of monocular images with known motion. The originality of this system is twofold. First, it uses a purely 3-D approach, starting from the 3-D reconstruction of the scene and ending by the 3-D matching of the model. Second, unlike most monocular systems, we do not use token tracking to match successive images. Rather, subpixel contour matching is used to recover more precisely complete 3-D contours. In contrast with the token tracking approaches, which yield a representation of the 3-D scene based on disconnected segments or points, this approach provides us with a denser and higher level representation of the scene. The reconstructed contours are fused along successive images using a simple result derived from the Kalman filter theory. The fusion process increases the localization precision and the robustness of the 3-D reconstruction. Finally, corners are extracted from the 3-D contours. They are used to generate hypotheses of the model position, using a hypothesize-and-verify algorithm that is described in detail. This algorithm yields a robust recognition and precise localization of the model in the scene. Results are presented on infrared image sequences with different resolutions, demonstrating the precision of the localization as well as the robustness and the low computational complexity of the algorithms.

摘要

在本文中，我们提出了一个完整的系统，用于从具有已知运动的单目图像序列中识别和定位三维（3-D）模型。该系统的创新性有两点。首先，它采用纯粹的 3-D 方法，从场景的 3-D 重建开始，最终通过模型的 3-D 匹配结束。其次，与大多数单目系统不同，我们不使用标记跟踪来匹配连续的图像。相反，我们使用亚像素轮廓匹配来更精确地恢复完整的 3-D 轮廓。与基于不连续片段或点的标记跟踪方法相比，这种方法为我们提供了场景的更密集和更高层次的表示。使用从卡尔曼滤波器理论得出的简单结果，沿着连续的图像融合重建的轮廓。融合过程提高了 3-D 重建的定位精度和鲁棒性。最后，从 3-D 轮廓中提取角点。使用假设验证算法生成模型位置的假设，该算法详细描述。该算法实现了模型在场景中的鲁棒识别和精确定位。不同分辨率的红外图像序列的结果表明了定位的精度以及算法的鲁棒性和低计算复杂度。