IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2465-2480. doi: 10.1109/TPAMI.2016.2644963. Epub 2016 Dec 23.
This paper presents a method, called AOGTracker, for simultaneously tracking, learning and parsing (TLP) of unknown objects in video sequences with a hierarchical and compositional And-Or graph (AOG) representation. The TLP method is formulated in the Bayesian framework with a spatial and a temporal dynamic programming (DP) algorithms inferring object bounding boxes on-the-fly. During online learning, the AOG is discriminatively learned using latent SVM [1] to account for appearance (e.g., lighting and partial occlusion) and structural (e.g., different poses and viewpoints) variations of a tracked object, as well as distractors (e.g., similar objects) in background. Three key issues in online inference and learning are addressed: (i) maintaining purity of positive and negative examples collected online, (ii) controling model complexity in latent structure learning, and (iii) identifying critical moments to re-learn the structure of AOG based on its intrackability. The intrackability measures uncertainty of an AOG based on its score maps in a frame. In experiments, our AOGTracker is tested on two popular tracking benchmarks with the same parameter setting: the TB-100/50/CVPR2013 benchmarks , [3] , and the VOT benchmarks [4] -VOT 2013, 2014, 2015 and TIR2015 (thermal imagery tracking). In the former, our AOGTracker outperforms state-of-the-art tracking algorithms including two trackers based on deep convolutional network [5] , [6] . In the latter, our AOGTracker outperforms all other trackers in VOT2013 and is comparable to the state-of-the-art methods in VOT2014, 2015 and TIR2015.
本文提出了一种名为 AOGTracker 的方法,用于使用分层和组合的与或图 (AOG) 表示同时跟踪、学习和解析 (TLP) 视频序列中的未知对象。TLP 方法是在贝叶斯框架中制定的,使用空间和时间动态规划 (DP) 算法实时推断对象边界框。在线学习过程中,使用潜在 SVM 对 AOG 进行有判别力的学习,以解释跟踪对象的外观 (例如,光照和部分遮挡) 和结构 (例如,不同的姿势和视角) 变化,以及背景中的干扰物 (例如,相似的对象)。在线推理和学习中解决了三个关键问题:(i) 在线收集的正例和负例的纯度,(ii) 潜在结构学习中模型复杂性的控制,以及 (iii) 根据其可跟踪性识别重新学习 AOG 结构的关键时刻。可跟踪性根据其在一帧中的得分图来测量 AOG 的不确定性。在实验中,我们的 AOGTracker 在两个具有相同参数设置的流行跟踪基准上进行了测试:TB-100/50/CVPR2013 基准测试集 [3] 和 VOT 基准测试集 [4]-VOT2013、2014、2015 和 TIR2015 (热图像跟踪)。在前者中,我们的 AOGTracker 优于包括两个基于深度卷积网络的跟踪器在内的最先进的跟踪算法 [5]、[6]。在后一个基准测试中,我们的 AOGTracker 在 VOT2013 中优于所有其他跟踪器,并且在 VOT2014、2015 和 TIR2015 中与最先进的方法相当。