Yao Siyuan, Han Xiaoguang, Zhang Hua, Wang Xiao, Cao Xiaochun
IEEE Trans Image Process. 2021;30:4814-4827. doi: 10.1109/TIP.2021.3076272. Epub 2021 May 7.
In most recent years, Siamese trackers have drawn great attention because of their well-balanced accuracy and efficiency. Although these approaches have achieved great success, the discriminative power of the conventional Siamese trackers is still limited by the insufficient template-candidate representation. Most of the existing approaches take non-aligned features to learn a similarity function for template-candidate matching, while the target object's geometrical transformation is seldom explored. To address this problem, we propose a novel Siamese tracking framework, which enables to dynamically transform the template-candidate features to a more discriminative viewpoint for similarity matching. Specifically, we reformulate the template-candidate matching problem of the conventional Siamese tracker from the perspective of Lucas-Kanade (LK) image alignment approach. A Lucas-Kanade network (LKNet) is proposed and incorporated to the Siamese architecture to learn aligned feature representations in data-driven trainable manner, which is able to enhance the model adaptability in challenging scenarios. Within this framework, we propose two Siamese trackers named LK-Siam and LK-SiamRPN to validate the effectiveness. Extensive experiments conducted on the prevalent datasets show that the proposed method is more competitive over a number of state-of-the-art methods.
近年来,暹罗跟踪器因其平衡的准确性和效率而备受关注。尽管这些方法取得了巨大成功,但传统暹罗跟踪器的判别能力仍然受到模板-候选表示不足的限制。现有的大多数方法采用未对齐的特征来学习用于模板-候选匹配的相似性函数,而很少探索目标对象的几何变换。为了解决这个问题,我们提出了一种新颖的暹罗跟踪框架,该框架能够将模板-候选特征动态变换到更具判别力的视角进行相似性匹配。具体来说,我们从卢卡斯-卡纳德(LK)图像对齐方法的角度重新构建了传统暹罗跟踪器的模板-候选匹配问题。提出了一个卢卡斯-卡纳德网络(LKNet)并将其纳入暹罗架构,以数据驱动的可训练方式学习对齐的特征表示,这能够增强模型在具有挑战性场景中的适应性。在此框架内,我们提出了两个名为LK-Siam和LK-SiamRPN的暹罗跟踪器来验证其有效性。在流行数据集上进行的大量实验表明,所提出的方法比许多现有方法更具竞争力。