Department of Computer Science, Sichuan University, Chengdu 610017, China.
Sensors (Basel). 2022 Sep 1;22(17):6597. doi: 10.3390/s22176597.
As a prevailing solution for visual tracking, Siamese networks manifest high performance via convolution neural networks and weight-sharing schemes. Most existing Siamese networks have adopted various offline training strategies to realize precise tracking by comparing the extracted target features with template features. However, their performances may degrade when dealing with unknown targets. The tracker is unable to learn background information through offline training, and it is susceptible to background interference, which finally leads to tracking failure. In this paper, we propose a twin-branch architecture (dubbed SiamOT) to mitigate the above problem in existing Siamese networks, wherein one branch is a classical Siamese network, and the other branch is an online training branch. Especially, the proposed online branch utilizes feature fusion and attention mechanism, which is able to capture and update both the target and the background information so as to refine the description of the target. Extensive experiments have been carried out on three mainstream benchmarks, along with an ablation study, to validate the effectiveness of SiamOT. It turns out that SiamOT achieves superior performance with stronger target discrimination abilities.
作为视觉跟踪的一种主流解决方案,孪生网络通过卷积神经网络和权重共享方案表现出了很高的性能。大多数现有的孪生网络已经采用了各种离线训练策略,通过将提取的目标特征与模板特征进行比较来实现精确跟踪。然而,当处理未知目标时,它们的性能可能会下降。跟踪器无法通过离线训练学习背景信息,并且容易受到背景干扰,最终导致跟踪失败。在本文中,我们提出了一种孪生分支结构(称为 SiamOT),以减轻现有孪生网络中的上述问题,其中一个分支是经典的孪生网络,另一个分支是在线训练分支。特别是,所提出的在线分支利用特征融合和注意力机制,能够捕获和更新目标和背景信息,从而细化对目标的描述。我们在三个主流基准上进行了广泛的实验,并进行了消融研究,以验证 SiamOT 的有效性。结果表明,SiamOT 具有更强的目标判别能力,因此性能更优。