Moreno-Noguer Francesc, Sanfeliu Alberto, Samaras Dimitris
Computer Vision Laboratory, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.
IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):670-85. doi: 10.1109/TPAMI.2007.70727.
We propose a new technique for fusing multiple cues to robustly segment an object from its background in video sequences that suffer from abrupt changes of both illumination and position of the target. Robustness is achieved by the integration of appearance and geometric object features and by their estimation using Bayesian filters, such as Kalman or particle filters. In particular, each filter estimates the state of a specific object feature, conditionally dependent on another feature estimated by a distinct filter. This dependence provides improved target representations, permitting to segment it out from the background even in non-stationary sequences. Considering that the procedure of the Bayesian filters may be described by a "hypotheses generation--hypotheses correction" strategy, the major novelty of our methodology compared to previous approaches is that the mutual dependence between filters is considered during the feature observation, i.e, into the "hypotheses correction" stage,instead of considering it when generating the hypotheses. This proves to be much more effective in terms of accuracy and reliability. The proposed method is analytically justified and applied to develop a robust tracking system that adapts online and simultaneously the color space where the image points are represented, the color distributions, the contour of the object and its bounding box. Results with synthetic data and real video sequences demonstrate the robustness and versatility of our method.
我们提出了一种新技术,用于融合多个线索,以便在遭受目标光照和位置突然变化的视频序列中,从背景中稳健地分割出物体。通过整合外观和几何物体特征,并使用贝叶斯滤波器(如卡尔曼滤波器或粒子滤波器)对其进行估计,从而实现稳健性。具体而言,每个滤波器估计特定物体特征的状态,该状态有条件地依赖于由另一个不同滤波器估计的另一个特征。这种依赖性提供了改进的目标表示,即使在非平稳序列中也能将其从背景中分割出来。考虑到贝叶斯滤波器的过程可以用“假设生成——假设校正”策略来描述,与先前方法相比,我们方法的主要新颖之处在于,在特征观察期间,即在“假设校正”阶段考虑滤波器之间的相互依赖性,而不是在生成假设时考虑它。这在准确性和可靠性方面被证明要有效得多。所提出的方法经过分析论证,并应用于开发一个稳健的跟踪系统,该系统可以在线同时适应图像点所表示的颜色空间、颜色分布、物体的轮廓及其边界框。合成数据和真实视频序列的结果证明了我们方法的稳健性和通用性。