Vision and Learning Laboratory, Department of Computer Engineering, Inha University, Incheon 22212, Korea.
Sensors (Basel). 2022 Oct 18;22(20):7943. doi: 10.3390/s22207943.
Effective multi-object tracking is still challenging due to the trade-off between tracking accuracy and speed. Because the recent multi-object tracking (MOT) methods leverage object appearance and motion models so as to associate detections between consecutive frames, the key for effective multi-object tracking is to reduce the computational complexity of learning both models. To this end, this work proposes global appearance and motion models to discriminate multiple objects instead of learning local object-specific models. In concrete detail, it learns a global appearance model using contrastive learning between object appearances. In addition, we learn a global relation motion model using relative motion learning between objects. Moreover, this paper proposes object constraint learning for improving tracking efficiency. This study considers the discriminability of the models as a constraint, and learns both models when inconsistency with the constraint occurs. Therefore, object constraint learning differs from the conventional online learning for multi-object tracking which updates learnable parameters per frame. This work incorporates global models and object constraint learning into the confidence-based association method, and compare our tracker with the state-of-the-art methods on public available MOT Challenge datasets. As a result, we achieve 64.5% MOTA (multi-object tracking accuracy) and 6.54 Hz tracking speed on the MOT16 test dataset. The comparison results show that our methods can contribute to improve tracking accuracy and tracking speed together.
由于跟踪精度和速度之间的权衡,有效的多目标跟踪仍然具有挑战性。由于最近的多目标跟踪 (MOT) 方法利用目标外观和运动模型来关联连续帧之间的检测,因此有效多目标跟踪的关键是降低学习这两种模型的计算复杂度。为此,这项工作提出了全局外观和运动模型来区分多个对象,而不是学习特定于对象的局部模型。具体来说,它使用对象外观之间的对比学习来学习全局外观模型。此外,我们使用对象之间的相对运动学习来学习全局关系运动模型。此外,本文提出了目标约束学习来提高跟踪效率。本研究将模型的可辨别性视为约束条件,并在出现不一致时学习这两种模型。因此,目标约束学习与传统的多目标跟踪在线学习不同,后者每帧更新可学习参数。这项工作将全局模型和目标约束学习纳入置信度关联方法中,并在公共可用的 MOT 挑战赛数据集上与最先进的方法进行比较。结果,我们在 MOT16 测试数据集上实现了 64.5%的 MOTA(多目标跟踪精度)和 6.54Hz 的跟踪速度。比较结果表明,我们的方法可以有助于提高跟踪精度和跟踪速度。