• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过独立的目标无关检测和有效的暹罗跨任务交互实现鲁棒视觉目标跟踪

Toward Robust Visual Object Tracking With Independent Target-Agnostic Detection and Effective Siamese Cross-Task Interaction.

作者信息

Xu Tianyang, Feng Zhenhua, Wu Xiao-Jun, Kittler Josef

出版信息

IEEE Trans Image Process. 2023;32:1541-1554. doi: 10.1109/TIP.2023.3246800. Epub 2023 Mar 6.

DOI:10.1109/TIP.2023.3246800
PMID:37027596
Abstract

Advanced Siamese visual object tracking architectures are jointly trained using pair-wise input images to perform target classification and bounding box regression. They have achieved promising results in recent benchmarks and competitions. However, the existing methods suffer from two limitations: First, though the Siamese structure can estimate the target state in an instance frame, provided the target appearance does not deviate too much from the template, the detection of the target in an image cannot be guaranteed in the presence of severe appearance variations. Second, despite the classification and regression tasks sharing the same output from the backbone network, their specific modules and loss functions are invariably designed independently, without promoting any interaction. Yet, in a general tracking task, the centre classification and bounding box regression tasks are collaboratively working to estimate the final target location. To address the above issues, it is essential to perform target-agnostic detection so as to promote cross-task interactions in a Siamese-based tracking framework. In this work, we endow a novel network with a target-agnostic object detection module to complement the direct target inference, and to avoid or minimise the misalignment of the key cues of potential template-instance matches. To unify the multi-task learning formulation, we develop a cross-task interaction module to ensure consistent supervision of the classification and regression branches, improving the synergy of different branches. To eliminate potential inconsistencies that may arise within a multi-task architecture, we assign adaptive labels, rather than fixed hard labels, to supervise the network training more effectively. The experimental results obtained on several benchmarks, i.e., OTB100, UAV123, VOT2018, VOT2019, and LaSOT, demonstrate the effectiveness of the advanced target detection module, as well as the cross-task interaction, exhibiting superior tracking performance as compared with the state-of-the-art tracking methods.

摘要

先进的连体视觉目标跟踪架构通过成对输入图像进行联合训练,以执行目标分类和边界框回归。它们在最近的基准测试和竞赛中取得了令人瞩目的成果。然而,现有方法存在两个局限性:第一,尽管连体结构可以在实例帧中估计目标状态,前提是目标外观与模板的偏差不太大,但在存在严重外观变化的情况下,无法保证在图像中检测到目标。第二,尽管分类和回归任务共享骨干网络的相同输出,但它们的特定模块和损失函数总是独立设计的,没有促进任何交互。然而,在一般的跟踪任务中,中心分类和边界框回归任务协同工作以估计最终目标位置。为了解决上述问题,在基于连体的跟踪框架中执行目标无关检测以促进跨任务交互至关重要。在这项工作中,我们为一个新颖的网络赋予了一个目标无关的目标检测模块,以补充直接目标推理,并避免或最小化潜在模板-实例匹配的关键线索的错位。为了统一多任务学习公式,我们开发了一个跨任务交互模块,以确保对分类和回归分支进行一致的监督,提高不同分支的协同作用。为了消除多任务架构中可能出现的潜在不一致性,我们分配自适应标签,而不是固定的硬标签,以更有效地监督网络训练。在几个基准测试(即OTB100、UAV123、VOT2018、VOT2019和LaSOT)上获得的实验结果证明了先进目标检测模块以及跨任务交互的有效性,与最先进的跟踪方法相比,展现出卓越的跟踪性能。

相似文献

1
Toward Robust Visual Object Tracking With Independent Target-Agnostic Detection and Effective Siamese Cross-Task Interaction.通过独立的目标无关检测和有效的暹罗跨任务交互实现鲁棒视觉目标跟踪
IEEE Trans Image Process. 2023;32:1541-1554. doi: 10.1109/TIP.2023.3246800. Epub 2023 Mar 6.
2
Robust Template Adjustment Siamese Network for Object Visual Tracking.用于目标视觉跟踪的鲁棒模板调整暹罗网络
Sensors (Basel). 2021 Feb 20;21(4):1466. doi: 10.3390/s21041466.
3
Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks.利用连体网络学习视觉目标跟踪中目标的几何信息。
Sensors (Basel). 2021 Nov 23;21(23):7790. doi: 10.3390/s21237790.
4
HROM: Learning High-Resolution Representation and Object-Aware Masks for Visual Object Tracking.HROM:用于视觉目标跟踪的学习高分辨率表示和对象感知掩模。
Sensors (Basel). 2020 Aug 26;20(17):4807. doi: 10.3390/s20174807.
5
SiamBAN: Target-Aware Tracking With Siamese Box Adaptive Network.暹罗边界感知网络:基于暹罗框自适应网络的目标感知跟踪
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):5158-5173. doi: 10.1109/TPAMI.2022.3195759. Epub 2023 Mar 7.
6
SiamCAN: Real-Time Visual Tracking Based on Siamese Center-Aware Network.暹罗CAN:基于暹罗中心感知网络的实时视觉跟踪
IEEE Trans Image Process. 2021;30:3597-3609. doi: 10.1109/TIP.2021.3060905. Epub 2021 Mar 17.
7
Object Relocation Visual Tracking Based on Histogram Filter and Siamese Network in Intelligent Transportation.基于直方图滤波和孪生网络的智能交通中目标重定位视觉跟踪。
Sensors (Basel). 2022 Nov 8;22(22):8591. doi: 10.3390/s22228591.
8
Target-aware transformer tracking with hard occlusion instance generation.基于硬遮挡实例生成的目标感知Transformer跟踪
Front Neurorobot. 2024 Jan 10;17:1323188. doi: 10.3389/fnbot.2023.1323188. eCollection 2023.
9
Learning to Rank Proposals for Siamese Visual Tracking.学习对孪生视觉跟踪的提案进行排序。
IEEE Trans Image Process. 2021;30:8785-8796. doi: 10.1109/TIP.2021.3120305. Epub 2021 Oct 27.
10
Dual Aligned Siamese Dense Regression Tracker.双对齐暹罗密集回归跟踪器
IEEE Trans Image Process. 2022;31:3630-3643. doi: 10.1109/TIP.2022.3166638. Epub 2022 May 26.