IEEE Trans Image Process. 2018 Oct;27(10):4810-4824. doi: 10.1109/TIP.2018.2845123.
Recovering the background and foreground parts from video frames has important applications in video surveillance. Under the assumption that the background parts are stationary and the foreground are sparse, most of existing methods are based on the framework of robust principal component analysis (RPCA), i.e., modeling the background and foreground parts as a low-rank and sparse matrices, respectively. However, in realistic complex scenarios, the conventional norm sparse regularizer often fails to well characterize the varying sparsity of the foreground components. How to select the sparsity regularizer parameters adaptively according to the local statistics is critical to the success of the RPCA framework for background subtraction task. In this paper, we propose to model the sparse component with a Gaussian scale mixture (GSM) model. Compared with the conventional norm, the GSM-based sparse model has the advantages of jointly estimating the variances of the sparse coefficients (and hence the regularization parameters) and the unknown sparse coefficients, leading to significant estimation accuracy improvements. Moreover, considering that the foreground parts are highly structured, a structured extension of the GSM model is further developed. Specifically, the input frame is divided into many homogeneous regions using superpixel segmentation. By characterizing the set of sparse coefficients in each homogeneous region with the same GSM prior, the local dependencies among the sparse coefficients can be effectively exploited, leading to further improvements for background subtraction. Experimental results on several challenging scenarios show that the proposed method performs much better than most of existing background subtraction methods in terms of both performance and speed.
从视频帧中恢复背景和前景部分在视频监控中有重要的应用。假设背景部分是静止的,前景部分是稀疏的,大多数现有的方法都是基于鲁棒主成分分析(RPCA)的框架,即分别将背景和前景部分建模为低秩和稀疏矩阵。然而,在现实复杂的场景中,传统的范数稀疏正则化器往往无法很好地描述前景分量的变化稀疏性。如何根据局部统计信息自适应地选择稀疏正则化参数对于 RPCA 框架在背景减除任务中的成功至关重要。在本文中,我们提出用高斯尺度混合(GSM)模型来建模稀疏分量。与传统的范数相比,基于 GSM 的稀疏模型具有联合估计稀疏系数(因此也是正则化参数)和未知稀疏系数的方差的优势,从而显著提高了估计精度。此外,考虑到前景部分高度结构化,进一步开发了 GSM 模型的结构扩展。具体来说,使用超像素分割将输入帧分成许多均匀区域。通过用相同的 GSM 先验来描述每个均匀区域中的稀疏系数集,可以有效地利用稀疏系数之间的局部依赖性,从而进一步提高背景减除的性能。在几个具有挑战性的场景上的实验结果表明,与大多数现有的背景减除方法相比,所提出的方法在性能和速度方面都有很大的提高。