IEEE Trans Image Process. 2018 Apr;27(4):2022-2037. doi: 10.1109/TIP.2017.2777183. Epub 2017 Nov 23.
The use of multiple features has been shown to be an effective strategy for visual tracking because of their complementary contributions to appearance modeling. The key problem is how to learn a fused representation from multiple features for appearance modeling. Different features extracted from the same object should share some commonalities in their representations while each feature should also have some feature-specific representation patterns which reflect its complementarity in appearance modeling. Different from existing multi-feature sparse trackers which only consider the commonalities among the sparsity patterns of multiple features, this paper proposes a novel multiple sparse representation framework for visual tracking which jointly exploits the shared and feature-specific properties of different features by decomposing multiple sparsity patterns. Moreover, we introduce a novel online multiple metric learning to efficiently and adaptively incorporate the appearance proximity constraint, which ensures that the learned commonalities of multiple features are more representative. Experimental results on tracking benchmark videos and other challenging videos demonstrate the effectiveness of the proposed tracker.
使用多种特征已被证明是视觉跟踪的一种有效策略,因为它们对外观建模有互补的贡献。关键问题是如何从多种特征中学习融合表示来进行外观建模。从同一物体提取的不同特征在表示上应该共享一些共性,而每个特征也应该具有一些特征特定的表示模式,反映其在外观建模中的互补性。与现有的仅考虑多个特征的稀疏表示模式之间共性的多特征稀疏跟踪器不同,本文提出了一种新的多稀疏表示框架,通过分解多个稀疏模式来联合利用不同特征的共享和特征特定属性。此外,我们引入了一种新的在线多度量学习方法,以有效地和自适应地合并外观相似性约束,从而确保学习到的多个特征的共性更具代表性。在跟踪基准视频和其他挑战性视频上的实验结果证明了所提出的跟踪器的有效性。