Xiao Jiangjian, Shah Mubarak
School of Computer Science, University of Central Florida, Orlando, FL 32816, USA.
IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1644-59. doi: 10.1109/TPAMI.2005.202.
Extracting layers from video is very important for video representation, analysis, compression, and synthesis. Assuming that a scene can be approximately described by multiple planar regions, this paper describes a robust and novel approach to automatically extract a set of affine or projective transformations induced by these regions, detect the occlusion pixels over multiple consecutive frames, and segment the scene into several motion layers. First, after determining a number of seed regions using correspondences in two frames, we expand the seed regions and reject the outliers employing the graph cuts method integrated with level set representation. Next, these initial regions are merged into several initial layers according to the motion similarity. Third, an occlusion order constraint on multiple frames is explored, which enforces that the occlusion area increases with the temporal order in a short period and effectively maintains segmentation consistency over multiple consecutive frames. Then, the correct layer segmentation is obtained by using a graph cuts algorithm and the occlusions between the overlapping layers are explicitly determined. Several experimental results are demonstrated to show that our approach is effective and robust.
从视频中提取层对于视频表示、分析、压缩和合成非常重要。假设一个场景可以由多个平面区域近似描述,本文描述了一种稳健且新颖的方法,用于自动提取由这些区域诱导的一组仿射或射影变换,检测多个连续帧上的遮挡像素,并将场景分割为几个运动层。首先,在使用两帧中的对应关系确定一些种子区域后,我们扩展种子区域并采用与水平集表示相结合的图割方法剔除异常值。接下来,根据运动相似性将这些初始区域合并为几个初始层。第三,探索多帧上的遮挡顺序约束,该约束强制遮挡区域在短时间内随时间顺序增加,并有效地保持多个连续帧上的分割一致性。然后,通过使用图割算法获得正确的层分割,并明确确定重叠层之间的遮挡。展示了几个实验结果以表明我们的方法是有效且稳健的。