IEEE Trans Image Process. 2023;32:2147-2159. doi: 10.1109/TIP.2023.3263104. Epub 2023 Apr 6.
The supervised one-shot multi-object tracking (MOT) algorithms have achieved satisfactory performance benefiting from a large amount of labeled data. However, in real applications, acquiring plenty of laborious manual annotations is not practical. It is necessary to adapt the one-shot MOT model trained on a labeled domain to an unlabeled domain, yet such domain adaptation is a challenging problem. The main reason is that it has to detect and associate multiple moving objects distributed in various spatial locations, but there are obvious discrepancies in style, object identity, quantity, and scale among different domains. Motivated by this, we propose a novel inference-domain network evolution to enhance the generalization ability of the one-shot MOT model. Specifically, we design a spatial topology-based one-shot network (STONet) to perform the one-shot MOT task, where a self-supervision mechanism is employed to stimulate the feature extractor to learn the spatial contexts without any annotated information. Furthermore, a temporal identity aggregation (TIA) module is proposed to assist STONet to weaken the adverse effects of noisy labels in the network evolution. This designed TIA aggregates historical embeddings with the same identity to learn cleaner and more reliable pseudo labels. In the inference domain, the proposed STONet with TIA performs pseudo label collection and parameter update progressively to realize the network evolution from the labeled source domain to an unlabeled inference domain. Extensive experiments and ablation studies conducted on MOT15, MOT17, and MOT20, demonstrate the effectiveness of our proposed model.
有监督的单次多目标跟踪 (MOT) 算法得益于大量的标记数据,取得了令人满意的性能。然而,在实际应用中,获取大量费力的人工标注是不切实际的。有必要将在标记域中训练的单次 MOT 模型适应当前无标记的域,但这种域自适应是一个具有挑战性的问题。主要原因是它必须检测和关联分布在不同空间位置的多个移动目标,但不同域之间的风格、对象身份、数量和比例存在明显差异。受此启发,我们提出了一种新的推理域网络进化方法,以增强单次 MOT 模型的泛化能力。具体来说,我们设计了一种基于空间拓扑的单次网络 (STONet) 来执行单次 MOT 任务,其中采用了自监督机制来激励特征提取器学习空间上下文,而无需任何标注信息。此外,还提出了一个时间身份聚合 (TIA) 模块来辅助 STONet 减轻网络进化中噪声标签的不利影响。所设计的 TIA 聚合具有相同身份的历史嵌入,以学习更干净和更可靠的伪标签。在推理域中,所提出的具有 TIA 的 STONet 执行伪标签收集和参数更新,以逐步实现从有标记源域到无标记推理域的网络进化。在 MOT15、MOT17 和 MOT20 上进行的广泛实验和消融研究表明了我们所提出模型的有效性。