Li Xi, Zhao Liming, Ji Wei, Wu Yiming, Wu Fei, Yang Ming-Hsuan, Tao Dacheng, Reid Ian
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):915-927. doi: 10.1109/TPAMI.2018.2818132. Epub 2018 Mar 22.
In the fields of computer vision and graphics, keypoint-based object tracking is a fundamental and challenging problem, which is typically formulated in a spatio-temporal context modeling framework. However, many existing keypoint trackers are incapable of effectively modeling and balancing the following three aspects in a simultaneous manner: temporal model coherence across frames, spatial model consistency within frames, and discriminative feature construction. To address this problem, we propose a robust keypoint tracker based on spatio-temporal multi-task structured output optimization driven by discriminative metric learning. Consequently, temporal model coherence is characterized by multi-task structured keypoint model learning over several adjacent frames; spatial model consistency is modeled by solving a geometric verification based structured learning problem; discriminative feature construction is enabled by metric learning to ensure the intra-class compactness and inter-class separability. To achieve the goal of effective object tracking, we jointly optimize the above three modules in a spatio-temporal multi-task learning scheme. Furthermore, we incorporate this joint learning scheme into both single-object and multi-object tracking scenarios, resulting in robust tracking results. Experiments over several challenging datasets have justified the effectiveness of our single-object and multi-object trackers against the state-of-the-art.
在计算机视觉和图形学领域,基于关键点的目标跟踪是一个基本且具有挑战性的问题,通常在时空上下文建模框架中进行表述。然而,许多现有的关键点跟踪器无法同时有效地对以下三个方面进行建模和平衡:跨帧的时间模型一致性、帧内的空间模型一致性以及判别性特征构建。为了解决这个问题,我们提出了一种基于判别度量学习驱动的时空多任务结构化输出优化的鲁棒关键点跟踪器。因此,时间模型一致性通过在几个相邻帧上进行多任务结构化关键点模型学习来表征;空间模型一致性通过解决基于几何验证的结构化学习问题来建模;判别性特征构建通过度量学习来实现,以确保类内紧凑性和类间可分离性。为了实现有效的目标跟踪,我们在时空多任务学习方案中联合优化上述三个模块。此外,我们将这种联合学习方案纳入单目标和多目标跟踪场景,从而得到鲁棒的跟踪结果。在几个具有挑战性的数据集上进行的实验证明了我们的单目标和多目标跟踪器相对于现有技术的有效性。