IEEE Trans Cybern. 2021 Dec;51(12):6305-6318. doi: 10.1109/TCYB.2020.2980618. Epub 2021 Dec 22.
A long-term visual object tracking performance evaluation methodology and a benchmark are proposed. Performance measures are designed by following a long-term tracking definition to maximize the analysis probing strength. The new measures outperform existing ones in interpretation potential and in better distinguishing between different tracking behaviors. We show that these measures generalize the short-term performance measures, thus linking the two tracking problems. Furthermore, the new measures are highly robust to temporal annotation sparsity and allow annotation of sequences hundreds of times longer than in the current datasets without increasing manual annotation labor. A new challenging dataset of carefully selected sequences with many target disappearances is proposed. A new tracking taxonomy is proposed to position trackers on the short-term/long-term spectrum. The benchmark contains an extensive evaluation of the largest number of long-term trackers and comparison to state-of-the-art short-term trackers. We analyze the influence of tracking architecture implementations to long-term performance and explore various redetection strategies as well as the influence of visual model update strategies to long-term tracking drift. The methodology is integrated in the VOT toolkit to automate experimental analysis and benchmarking and to facilitate the future development of long-term trackers.
提出了一种长期视觉目标跟踪性能评估方法和基准。通过遵循长期跟踪定义来设计性能指标,以最大程度地提高分析探测强度。这些新的指标在解释能力上优于现有的指标,并能更好地区分不同的跟踪行为。我们表明,这些指标推广了短期性能指标,从而将两个跟踪问题联系起来。此外,这些新指标对时间注释稀疏性具有高度鲁棒性,允许对比当前数据集长数百倍的序列进行注释,而无需增加手动注释工作量。提出了一个新的具有许多目标消失的精心挑选序列的具有挑战性的数据集。提出了一种新的跟踪分类法,将跟踪器置于短期/长期范围内。基准包含对数量最多的长期跟踪器的广泛评估,并与最新的短期跟踪器进行比较。我们分析了跟踪架构实现对长期性能的影响,并探索了各种重新检测策略以及视觉模型更新策略对长期跟踪漂移的影响。该方法已集成到 VOT 工具包中,以实现实验分析和基准测试的自动化,并促进长期跟踪器的未来发展。