IEEE Trans Neural Netw Learn Syst. 2015 Nov;26(11):2874-90. doi: 10.1109/TNNLS.2015.2399233. Epub 2015 Feb 26.
Various sparse-representation-based methods have been proposed to solve tracking problems, and most of them employ least squares (LSs) criteria to learn the sparse representation. In many tracking scenarios, traditional LS-based methods may not perform well owing to the presence of heavy-tailed noise. In this paper, we present a tracking approach using an approximate least absolute deviation (LAD)-based multitask multiview sparse learning method to enjoy robustness of LAD and take advantage of multiple types of visual features, such as intensity, color, and texture. The proposed method is integrated in a particle filter framework, where learning the sparse representation for each view of the single particle is regarded as an individual task. The underlying relationship between tasks across different views and different particles is jointly exploited in a unified robust multitask formulation based on LAD. In addition, to capture the frequently emerging outlier tasks, we decompose the representation matrix to two collaborative components that enable a more robust and accurate approximation. We show that the proposed formulation can be effectively approximated by Nesterov's smoothing method and efficiently solved using the accelerated proximal gradient method. The presented tracker is implemented using four types of features and is tested on numerous synthetic sequences and real-world video sequences, including the CVPR2013 tracking benchmark and ALOV++ data set. Both the qualitative and quantitative results demonstrate the superior performance of the proposed approach compared with several state-of-the-art trackers.
已经提出了各种基于稀疏表示的方法来解决跟踪问题,其中大多数方法采用最小二乘(LS)准则来学习稀疏表示。在许多跟踪场景中,由于存在重尾噪声,传统的基于 LS 的方法可能表现不佳。在本文中,我们提出了一种使用基于近似最小绝对偏差(LAD)的多任务多视图稀疏学习方法的跟踪方法,该方法既具有 LAD 的稳健性,又利用了多种类型的视觉特征,如强度、颜色和纹理。所提出的方法集成在粒子滤波器框架中,其中将每个单粒子的每个视图的稀疏表示学习视为单个任务。基于 LAD 的统一鲁棒多任务公式联合利用了不同视图和不同粒子之间任务之间的内在关系。此外,为了捕获经常出现的异常任务,我们将表示矩阵分解为两个协作分量,从而实现更稳健和准确的逼近。我们表明,所提出的公式可以通过 Nesterov 的平滑方法有效地近似,并使用加速近端梯度方法有效地求解。所提出的跟踪器使用四种类型的特征实现,并在许多合成序列和真实世界视频序列上进行了测试,包括 CVPR2013 跟踪基准和 ALOV++数据集。定性和定量结果都表明,与几种最先进的跟踪器相比,所提出的方法具有更好的性能。