IEEE Trans Image Process. 2016 Oct;25(10):4555-64. doi: 10.1109/TIP.2016.2592701. Epub 2016 Jul 18.
Sparse representation has been successfully applied to visual tracking by finding the best candidate with a minimal reconstruction error using target templates. However, most sparse representation-based tracking methods only consider holistic rather than local appearance to discriminate between target and background regions, and hence may not perform well when target objects are heavily occluded. In this paper, we develop a simple yet robust tracking algorithm based on a coarse and fine structural local sparse appearance model. The proposed method exploits both partial and structural information of a target object based on sparse coding using the dictionary composed of patches from multiple target templates. The likelihood obtained by averaging and pooling operations exploits consistent appearance of object parts, thereby helping not only locate targets accurately but also handle partial occlusion. To update templates more accurately without introducing occluding regions, we introduce an occlusion detection scheme to account for pixels belonging to the target objects. The proposed method is evaluated on a large benchmark data set with three evaluation metrics. Experimental results demonstrate that the proposed tracking algorithm performs favorably against several state-of-the-art methods.
稀疏表示已成功应用于视觉跟踪,通过使用目标模板找到具有最小重建误差的最佳候选者。然而,大多数基于稀疏表示的跟踪方法仅考虑整体而不是局部外观来区分目标和背景区域,因此在目标物体被严重遮挡时可能无法很好地工作。在本文中,我们基于粗精结构局部稀疏外观模型开发了一种简单而鲁棒的跟踪算法。所提出的方法利用基于稀疏编码的字典,该字典由多个目标模板的补丁组成,从而利用目标对象的部分和结构信息。通过平均和池化操作获得的似然度利用了对象部分的一致外观,从而不仅有助于准确地定位目标,而且还可以处理部分遮挡。为了在不引入遮挡区域的情况下更准确地更新模板,我们引入了一种遮挡检测方案来考虑属于目标对象的像素。所提出的方法在具有三个评估指标的大型基准数据集上进行了评估。实验结果表明,所提出的跟踪算法在性能上优于几种最先进的方法。