• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TDIOT:深度视频目标跟踪的目标驱动推理

TDIOT: Target-Driven Inference for Deep Video Object Tracking.

作者信息

Gurkan Filiz, Cerkezi Llukman, Cirakman Ozgun, Gunsel Bilge

出版信息

IEEE Trans Image Process. 2021;30:7938-7951. doi: 10.1109/TIP.2021.3112010. Epub 2021 Sep 22.

DOI:10.1109/TIP.2021.3112010
PMID:34534080
Abstract

Recent tracking-by-detection approaches use deep object detectors as target detection baseline, because of their high performance on still images. For effective video object tracking, object detection is integrated with a data association step performed by either a custom design inference architecture or an end-to-end joint training for tracking purpose. In this work, we adopt the former approach and use the pre-trained Mask R-CNN deep object detector as the baseline. We introduce a novel inference architecture placed on top of FPN-ResNet101 backbone of Mask R-CNN to jointly perform detection and tracking, without requiring additional training for tracking purpose. The proposed single object tracker, TDIOT, applies an appearance similarity-based temporal matching for data association. To tackle tracking discontinuities, we incorporate a local search and matching module into the inference head layer that exploits SiamFC. Moreover, to improve robustness to scale changes, we introduce a scale adaptive region proposal network that enables to search for the target at an adaptively enlarged spatial neighborhood specified by the trace of the target. In order to meet long term tracking requirements, a low cost verification layer is incorporated into the inference architecture to monitor presence of the target based on its LBP histogram model. Performance evaluation on videos from VOT2016, VOT2018, and VOT-LT2018 datasets demonstrate that TDIOT achieves higher accuracy compared to the state-of-the-art short-term trackers while it provides comparable performance in long term tracking. We also compare our tracker on LaSOT dataset where we observe that TDIOT provides comparable performance with other methods that are trained on LaSOT. The source code and TDIOT output videos are accessible at https://github.com/msprITU/TDIOT.

摘要

最近的基于检测的跟踪方法将深度目标检测器用作目标检测基线,因为它们在静态图像上具有高性能。为了实现有效的视频目标跟踪,目标检测与通过自定义设计推理架构或用于跟踪目的的端到端联合训练执行的数据关联步骤相结合。在这项工作中,我们采用前一种方法,并使用预训练的Mask R-CNN深度目标检测器作为基线。我们引入了一种新颖的推理架构,该架构置于Mask R-CNN的FPN-ResNet101骨干之上,以联合执行检测和跟踪,而无需为跟踪目的进行额外训练。所提出的单目标跟踪器TDIOT应用基于外观相似性的时间匹配进行数据关联。为了解决跟踪不连续性问题,我们将一个利用SiamFC的局部搜索和匹配模块纳入推理头层。此外,为了提高对尺度变化的鲁棒性,我们引入了一个尺度自适应区域提议网络,该网络能够在由目标轨迹指定的自适应扩大的空间邻域中搜索目标。为了满足长期跟踪要求,在推理架构中纳入了一个低成本验证层,以基于目标的LBP直方图模型监测目标的存在。对来自VOT2016、VOT2018和VOT-LT2018数据集的视频进行的性能评估表明,与最先进的短期跟踪器相比,TDIOT实现了更高的准确率,同时在长期跟踪中提供了可比的性能。我们还在LaSOT数据集上比较了我们的跟踪器,在该数据集中我们观察到TDIOT与在LaSOT上训练的其他方法具有可比的性能。源代码和TDIOT输出视频可在https://github.com/msprITU/TDIOT上获取。

相似文献

1
TDIOT: Target-Driven Inference for Deep Video Object Tracking.TDIOT:深度视频目标跟踪的目标驱动推理
IEEE Trans Image Process. 2021;30:7938-7951. doi: 10.1109/TIP.2021.3112010. Epub 2021 Sep 22.
2
Effective Local and Global Search for Fast Long-Term Tracking.用于快速长期跟踪的有效局部和全局搜索
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):460-474. doi: 10.1109/TPAMI.2022.3153645. Epub 2022 Dec 5.
3
Training-Based Methods for Comparison of Object Detection Methods for Visual Object Tracking.基于训练的方法用于视觉目标跟踪中目标检测方法的比较。
Sensors (Basel). 2018 Nov 16;18(11):3994. doi: 10.3390/s18113994.
4
A Visual Tracker Offering More Solutions.一款提供更多解决方案的视觉追踪器。
Sensors (Basel). 2020 Sep 19;20(18):5374. doi: 10.3390/s20185374.
5
SiamHYPER: Learning a Hyperspectral Object Tracker From an RGB-Based Tracker.暹罗超光谱跟踪器:从基于RGB的跟踪器学习高光谱目标跟踪器
IEEE Trans Image Process. 2022;31:7116-7129. doi: 10.1109/TIP.2022.3216995. Epub 2022 Nov 16.
6
A Discriminative Single-Shot Segmentation Network for Visual Object Tracking.一种用于视觉目标跟踪的判别式单阶段分割网络。
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9742-9755. doi: 10.1109/TPAMI.2021.3137933. Epub 2022 Nov 7.
7
Cascaded Correlation Refinement for Robust Deep Tracking.用于鲁棒深度跟踪的级联相关优化
IEEE Trans Neural Netw Learn Syst. 2021 Mar;32(3):1276-1288. doi: 10.1109/TNNLS.2020.2984256. Epub 2021 Mar 1.
8
Toward Robust Visual Object Tracking With Independent Target-Agnostic Detection and Effective Siamese Cross-Task Interaction.通过独立的目标无关检测和有效的暹罗跨任务交互实现鲁棒视觉目标跟踪
IEEE Trans Image Process. 2023;32:1541-1554. doi: 10.1109/TIP.2023.3246800. Epub 2023 Mar 6.
9
Learning Self-Corrective Network via Adaptive Self-Labeling and Dynamic NMS for High-Performance Long-Term Tracking.通过自适应自标注和动态非极大值抑制学习自校正网络以实现高性能长期跟踪
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):653-664. doi: 10.1109/TNNLS.2023.3327486. Epub 2025 Jan 7.
10
Learning Dynamic Compact Memory Embedding for Deformable Visual Object Tracking.学习用于可变形视觉目标跟踪的动态紧凑内存嵌入
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5656-5670. doi: 10.1109/TNNLS.2022.3208605. Epub 2024 Apr 4.