Nicolescu Mircea, Medioni Gérard
Department of Computer Science, University of Nevada, 1664 N. Virginia St., Reno, NV 89557, USA.
IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):739-52. doi: 10.1109/TPAMI.2005.91.
Most approaches for motion analysis and interpretation rely on restrictive parametric models and involve iterative methods which depend heavily on initial conditions and are subject to instability. Further difficulties are encountered in image regions where motion is not smooth-typically around motion boundaries. This work addresses the problem of visual motion analysis and interpretation by formulating it as an inference of motion layers from a noisy and possibly sparse point set in a 4D space. The core of the method is based on a layered 4D representation of data and a voting scheme for affinity propagation. The inherent problem caused by the ambiguity of 2D to 3D interpretation is usually handled by adding additional constraints, such as rigidity. However, enforcing such a global constraint has been problematic in the combined presence of noise and multiple independent motions. By decoupling the processes of matching, outlier rejection, segmentation, and interpretation, we extract accurate motion layers based on the smoothness of image motion, then locally enforce rigidity for each layer in order to infer its 3D structure and motion. The proposed framework is noniterative and consistently handles both smooth moving regions and motion discontinuities without using any prior knowledge of the motion model.
大多数运动分析与解释方法依赖于严格的参数模型,且涉及迭代方法,这些方法严重依赖初始条件,还容易出现不稳定性。在运动不平稳的图像区域(通常是运动边界周围)会遇到更多困难。这项工作通过将视觉运动分析与解释问题表述为从4D空间中一个有噪声且可能稀疏的点集推断运动层,来解决该问题。该方法的核心基于数据的分层4D表示和用于亲和传播的投票方案。由2D到3D解释的模糊性所导致的固有问题通常通过添加额外约束(如刚性)来处理。然而,在噪声和多个独立运动同时存在的情况下,强制实施这样的全局约束一直存在问题。通过将匹配、离群值剔除、分割和解释过程解耦,我们基于图像运动的平滑性提取准确的运动层,然后对每层局部强制实施刚性,以推断其3D结构和运动。所提出的框架是非迭代的,并且在不使用任何运动模型先验知识的情况下,能够一致地处理平滑运动区域和运动不连续性。