IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1465-1479. doi: 10.1109/TPAMI.2017.2708711. Epub 2017 May 26.
We present an algorithm for estimating the pose of a rigid object in real-time under challenging conditions. Our method effectively handles poorly textured objects in cluttered, changing environments, even when their appearance is corrupted by large occlusions, and it relies on grayscale images to handle metallic environments on which depth cameras would fail. As a result, our method is suitable for practical Augmented Reality applications including industrial environments. At the core of our approach is a novel representation for the 3D pose of object parts: We predict the 3D pose of each part in the form of the 2D projections of a few control points. The advantages of this representation is three-fold: We can predict the 3D pose of the object even when only one part is visible; when several parts are visible, we can easily combine them to compute a better pose of the object; the 3D pose we obtain is usually very accurate, even when only few parts are visible. We show how to use this representation in a robust 3D tracking framework. In addition to extensive comparisons with the state-of-the-art, we demonstrate our method on a practical Augmented Reality application for maintenance assistance in the ATLAS particle detector at CERN.
我们提出了一种在具有挑战性的条件下实时估计刚体物体姿态的算法。我们的方法能够有效地处理在杂乱、变化的环境中纹理较差的物体,即使它们的外观被大遮挡物损坏,并且它依赖于灰度图像来处理深度相机无法处理的金属环境。因此,我们的方法适用于包括工业环境在内的实际增强现实应用。我们方法的核心是一种用于物体部分 3D 姿态的新表示形式:我们以几个控制点的二维投影的形式预测每个部分的 3D 姿态。这种表示形式的优点有三:即使只有一个部分可见,我们也可以预测物体的 3D 姿态;当几个部分可见时,我们可以轻松地将它们组合起来计算物体的更好姿态;即使只有很少的部分可见,我们获得的 3D 姿态通常也非常准确。我们展示了如何在稳健的 3D 跟踪框架中使用这种表示形式。除了与最先进的方法进行广泛比较外,我们还在 ATLAS 粒子探测器在 CERN 的维护辅助的实际增强现实应用中展示了我们的方法。