• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自适应集成感知跟踪。

Adaptive ensemble perception tracking.

机构信息

School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, China.

School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, China; Peng Cheng Laboratory, Shenzhen, China.

出版信息

Neural Netw. 2021 Oct;142:316-328. doi: 10.1016/j.neunet.2021.05.003. Epub 2021 May 15.

DOI:10.1016/j.neunet.2021.05.003
PMID:34082287
Abstract

Recently, tracking models based on bounding box regression (such as region proposal networks), built on the Siamese network, have attracted much attention. Despite their promising performance, these trackers are less effective in perceiving the target information in the following two aspects. First, existing regression models cannot take a global view of a large-scale target since the effective receptive field of a neuron is too small to cover the target with a large scale. Second, the neurons with a fixed receptive field (RF) size in these models cannot adapt to the scale and aspect ratio changes of the target. In this paper, we propose an adaptive ensemble perception tracking framework to address these issues. Specifically, we first construct a per-pixel prediction model, which predicts the target state at each pixel of the correlated feature. On top of the per-pixel prediction model, we then develop a confidence-guided ensemble prediction mechanism. The ensemble mechanism adaptively fuses the predictions of multiple pixels with the guidance of confidence maps, which enlarges the perception range and enhances the adaptive perception ability at the object-level. In addition, we introduce a receptive field adaption model to enhance the adaptive perception ability at the neuron-level, which adjusts the RF by adaptively integrating the features with different RFs. Extensive experimental results on the VOT2018, VOT2016, UAV123, LaSOT, and TC128 datasets demonstrate that the proposed algorithm performs favorably against the state-of-the-art methods in terms of accuracy and speed.

摘要

最近,基于边界框回归(如区域提议网络)的跟踪模型,构建在孪生网络上,引起了广泛关注。尽管这些跟踪器性能很有前景,但在以下两个方面,它们对目标信息的感知能力较差。首先,现有回归模型不能全局观察大目标,因为神经元的有效感受野太小,无法覆盖大尺度的目标。其次,这些模型中具有固定感受野(RF)大小的神经元无法适应目标的尺度和纵横比变化。在本文中,我们提出了一种自适应集成感知跟踪框架来解决这些问题。具体来说,我们首先构建了一个逐像素预测模型,该模型预测相关特征中每个像素的目标状态。在逐像素预测模型的基础上,我们开发了一种置信度引导的集成预测机制。该集成机制自适应地融合多个像素的预测,并在置信图的指导下,扩大感知范围,增强对象级别的自适应感知能力。此外,我们引入了一个感受野自适应模型,通过自适应地整合具有不同 RF 的特征来增强神经元级别的自适应感知能力。在 VOT2018、VOT2016、UAV123、LaSOT 和 TC128 数据集上的广泛实验结果表明,所提出的算法在准确性和速度方面均优于最先进的方法。

相似文献

1
Adaptive ensemble perception tracking.自适应集成感知跟踪。
Neural Netw. 2021 Oct;142:316-328. doi: 10.1016/j.neunet.2021.05.003. Epub 2021 May 15.
2
Robust Template Adjustment Siamese Network for Object Visual Tracking.用于目标视觉跟踪的鲁棒模板调整暹罗网络
Sensors (Basel). 2021 Feb 20;21(4):1466. doi: 10.3390/s21041466.
3
TGAN: A simple model update strategy for visual tracking via template-guidance attention network.TGAN:一种基于模板引导注意力网络的简单视觉跟踪模型更新策略。
Neural Netw. 2021 Dec;144:61-74. doi: 10.1016/j.neunet.2021.08.010. Epub 2021 Aug 16.
4
Improving Object Tracking by Added Noise and Channel Attention.添加噪声和通道注意力以改进目标跟踪。
Sensors (Basel). 2020 Jul 6;20(13):3780. doi: 10.3390/s20133780.
5
SiamCAN: Real-Time Visual Tracking Based on Siamese Center-Aware Network.暹罗CAN:基于暹罗中心感知网络的实时视觉跟踪
IEEE Trans Image Process. 2021;30:3597-3609. doi: 10.1109/TIP.2021.3060905. Epub 2021 Mar 17.
6
Siamese anchor-free object tracking with multiscale spatial attentions.基于多尺度空间注意力的暹罗无锚点目标跟踪
Sci Rep. 2021 Nov 25;11(1):22908. doi: 10.1038/s41598-021-02095-4.
7
Siamese Implicit Region Proposal Network With Compound Attention for Visual Tracking.基于复合注意力的暹罗隐式区域提案网络的视觉跟踪。
IEEE Trans Image Process. 2022;31:1882-1894. doi: 10.1109/TIP.2022.3148876. Epub 2022 Feb 16.
8
HROM: Learning High-Resolution Representation and Object-Aware Masks for Visual Object Tracking.HROM:用于视觉目标跟踪的学习高分辨率表示和对象感知掩模。
Sensors (Basel). 2020 Aug 26;20(17):4807. doi: 10.3390/s20174807.
9
Learning to Rank Proposals for Siamese Visual Tracking.学习对孪生视觉跟踪的提案进行排序。
IEEE Trans Image Process. 2021;30:8785-8796. doi: 10.1109/TIP.2021.3120305. Epub 2021 Oct 27.
10
Toward Robust Visual Object Tracking With Independent Target-Agnostic Detection and Effective Siamese Cross-Task Interaction.通过独立的目标无关检测和有效的暹罗跨任务交互实现鲁棒视觉目标跟踪
IEEE Trans Image Process. 2023;32:1541-1554. doi: 10.1109/TIP.2023.3246800. Epub 2023 Mar 6.