Giraldo Jhony H, Javed Sajid, Bouwmans Thierry
IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2485-2503. doi: 10.1109/TPAMI.2020.3042093. Epub 2022 Apr 1.
Moving Object Segmentation (MOS) is a fundamental task in computer vision. Due to undesirable variations in the background scene, MOS becomes very challenging for static and moving camera sequences. Several deep learning methods have been proposed for MOS with impressive performance. However, these methods show performance degradation in the presence of unseen videos; and usually, deep learning models require large amounts of data to avoid overfitting. Recently, graph learning has attracted significant attention in many computer vision applications since they provide tools to exploit the geometrical structure of data. In this work, concepts of graph signal processing are introduced for MOS. First, we propose a new algorithm that is composed of segmentation, background initialization, graph construction, unseen sampling, and a semi-supervised learning method inspired by the theory of recovery of graph signals. Second, theoretical developments are introduced, showing one bound for the sample complexity in semi-supervised learning, and two bounds for the condition number of the Sobolev norm. Our algorithm has the advantage of requiring less labeled data than deep learning methods while having competitive results on both static and moving camera videos. Our algorithm is also adapted for Video Object Segmentation (VOS) tasks and is evaluated on six publicly available datasets outperforming several state-of-the-art methods in challenging conditions.
运动目标分割(MOS)是计算机视觉中的一项基础任务。由于背景场景中存在不良变化,MOS对于静态和动态相机序列而言都极具挑战性。针对MOS已经提出了几种深度学习方法,其性能令人印象深刻。然而,这些方法在面对未见视频时会出现性能下降的情况;而且通常来说,深度学习模型需要大量数据以避免过拟合。近来,图学习在许多计算机视觉应用中引起了广泛关注,因为它们提供了利用数据几何结构的工具。在这项工作中,我们将图信号处理的概念引入到MOS中。首先,我们提出了一种新算法,该算法由分割、背景初始化、图构建、未见样本采样以及一种受图信号恢复理论启发的半监督学习方法组成。其次,介绍了理论进展,给出了半监督学习中样本复杂度的一个界,以及Sobolev范数条件数的两个界。我们的算法具有比深度学习方法所需标记数据更少的优势,同时在静态和动态相机视频上都取得了具有竞争力的结果。我们的算法还适用于视频目标分割(VOS)任务,并在六个公开可用数据集上进行了评估,在具有挑战性的条件下优于几种当前最先进的方法。