IEEE Trans Image Process. 2016 Dec;25(12):5743-5756. doi: 10.1109/TIP.2016.2614135. Epub 2016 Sep 27.
Integrating multiple different yet complementary feature representations has been proved to be an effective way for boosting tracking performance. This paper investigates how to perform robust object tracking in challenging scenarios by adaptively incorporating information from grayscale and thermal videos, and proposes a novel collaborative algorithm for online tracking. In particular, an adaptive fusion scheme is proposed based on collaborative sparse representation in Bayesian filtering framework. We jointly optimize sparse codes and the reliable weights of different modalities in an online way. In addition, this paper contributes a comprehensive video benchmark, which includes 50 grayscale-thermal sequences and their ground truth annotations for tracking purpose. The videos are with high diversity and the annotations were finished by one single person to guarantee consistency. Extensive experiments against other state-of-the-art trackers with both grayscale and grayscale-thermal inputs demonstrate the effectiveness of the proposed tracking approach. Through analyzing quantitative results, we also provide basic insights and potential future research directions in grayscale-thermal tracking.
事实证明,整合多种不同但互补的特征表示是提高跟踪性能的有效方法。本文研究如何通过自适应融合来自灰度视频和热成像视频的信息,在具有挑战性的场景中进行鲁棒的目标跟踪,并提出了一种用于在线跟踪的新型协作算法。具体而言,在贝叶斯滤波框架下,基于协作稀疏表示提出了一种自适应融合方案。我们以在线方式联合优化不同模态的稀疏编码和可靠权重。此外,本文还提供了一个全面的视频基准,其中包括50个灰度-热成像序列及其用于跟踪目的的地面真值注释。这些视频具有高度的多样性,并且注释由一人完成以保证一致性。针对其他具有灰度和灰度-热成像输入的先进跟踪器进行的大量实验证明了所提出跟踪方法的有效性。通过分析定量结果,我们还提供了关于灰度-热成像跟踪的基本见解和潜在的未来研究方向。