• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于提案的视觉跟踪:使用空间级联变换区域提案网络。

Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network.

机构信息

Faculty of Space, Xi'an Institute of Optics and Precision Mechanics of CAS, Xi'an 710119, China.

School of Astronautics, Northwestern Polytechnical Universty, Xi'an 710072, China.

出版信息

Sensors (Basel). 2020 Aug 26;20(17):4810. doi: 10.3390/s20174810.

DOI:10.3390/s20174810
PMID:32858907
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7506765/
Abstract

Region proposal network (RPN) based trackers employ the classification and regression block to generate the proposals, the proposal that contains the highest similarity score is formulated to be the groundtruth candidate of next frame. However, region proposal network based trackers cannot make the best of the features from different convolutional layers, and the original loss function cannot alleviate the data imbalance issue of the training procedure. We propose the Spatial Cascaded Transformed RPN to combine the RPN and STN (spatial transformer network) together, in order to successfully obtain the proposals of high quality, which can simultaneously improves the robustness. The STN can transfer the spatial transformed features though different stages, which extends the spatial representation capability of such networks handling complex scenarios such as scale variation and affine transformation. We break the restriction though an easy samples penalization loss (shrinkage loss) instead of smooth L1 function. Moreover, we perform the multi-cue proposals re-ranking to guarantee the accuracy of the proposed tracker. We extensively prove the effectiveness of our proposed method on the ablation studies of the tracking datasets, which include OTB-2015 (Object Tracking Benchmark 2015), VOT-2018 (Visual Object Tracking 2018), LaSOT (Large Scale Single Object Tracking), TrackingNet (A Large-Scale Dataset and Benchmark for Object Tracking in the Wild) and UAV123 (UAV Tracking Dataset).

摘要

基于区域提议网络(RPN)的跟踪器使用分类和回归块来生成提议,具有最高相似度得分的提议被制定为下一个帧的地面真实候选者。然而,基于区域提议网络的跟踪器无法充分利用来自不同卷积层的特征,并且原始的损失函数无法缓解训练过程中的数据不平衡问题。我们提出了空间级联变换 RPN,将 RPN 和 STN(空间变换网络)结合在一起,以成功获得高质量的提议,同时提高鲁棒性。STN 可以通过不同的阶段传输空间变换特征,从而扩展了这些网络处理复杂场景(如尺度变化和仿射变换)的空间表示能力。我们通过简单的样本惩罚损失(收缩损失)而不是平滑 L1 函数来打破这种限制。此外,我们执行多线索提议重新排序,以保证所提出的跟踪器的准确性。我们在跟踪数据集的消融研究中广泛证明了我们提出的方法的有效性,包括 OTB-2015(Object Tracking Benchmark 2015)、VOT-2018(Visual Object Tracking 2018)、LaSOT(大规模单目标跟踪)、TrackingNet(野外目标跟踪的大规模数据集和基准)和 UAV123(UAV 跟踪数据集)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/2b1b5b1982fd/sensors-20-04810-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/f7058604b422/sensors-20-04810-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/dd7419b2f179/sensors-20-04810-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/7dfa128ab638/sensors-20-04810-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/5033b2fce5dd/sensors-20-04810-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/b9f6c6b93517/sensors-20-04810-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/3d6e9a42c4d2/sensors-20-04810-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/7ba8d254e8de/sensors-20-04810-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/47b25b113bc5/sensors-20-04810-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/9ac0faad7c21/sensors-20-04810-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/347c078cf9b7/sensors-20-04810-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/c7538a4c7067/sensors-20-04810-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/2b1b5b1982fd/sensors-20-04810-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/f7058604b422/sensors-20-04810-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/dd7419b2f179/sensors-20-04810-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/7dfa128ab638/sensors-20-04810-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/5033b2fce5dd/sensors-20-04810-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/b9f6c6b93517/sensors-20-04810-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/3d6e9a42c4d2/sensors-20-04810-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/7ba8d254e8de/sensors-20-04810-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/47b25b113bc5/sensors-20-04810-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/9ac0faad7c21/sensors-20-04810-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/347c078cf9b7/sensors-20-04810-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/c7538a4c7067/sensors-20-04810-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/648b/7506765/2b1b5b1982fd/sensors-20-04810-g012.jpg

相似文献

1
Proposal-Based Visual Tracking Using Spatial Cascaded Transformed Region Proposal Network.基于提案的视觉跟踪:使用空间级联变换区域提案网络。
Sensors (Basel). 2020 Aug 26;20(17):4810. doi: 10.3390/s20174810.
2
Learning to Rank Proposals for Siamese Visual Tracking.学习对孪生视觉跟踪的提案进行排序。
IEEE Trans Image Process. 2021;30:8785-8796. doi: 10.1109/TIP.2021.3120305. Epub 2021 Oct 27.
3
Deep Object Tracking With Shrinkage Loss.基于收缩损失的深度目标跟踪
IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2386-2401. doi: 10.1109/TPAMI.2020.3041332. Epub 2022 Apr 1.
4
Siamese Implicit Region Proposal Network With Compound Attention for Visual Tracking.基于复合注意力的暹罗隐式区域提案网络的视觉跟踪。
IEEE Trans Image Process. 2022;31:1882-1894. doi: 10.1109/TIP.2022.3148876. Epub 2022 Feb 16.
5
EMAT: Efficient feature fusion network for visual tracking via optimized multi-head attention.EMAT:通过优化多头注意力进行视觉跟踪的高效特征融合网络。
Neural Netw. 2024 Apr;172:106110. doi: 10.1016/j.neunet.2024.106110. Epub 2024 Jan 6.
6
Siamese anchor-free object tracking with multiscale spatial attentions.基于多尺度空间注意力的暹罗无锚点目标跟踪
Sci Rep. 2021 Nov 25;11(1):22908. doi: 10.1038/s41598-021-02095-4.
7
DeforT: Deformable transformer for visual tracking.DeforT:用于视觉跟踪的可变形 Transformer。
Neural Netw. 2024 Aug;176:106380. doi: 10.1016/j.neunet.2024.106380. Epub 2024 May 8.
8
Distractor-Aware Deep Regression for Visual Tracking.基于分心物感知的深度回归视觉跟踪方法
Sensors (Basel). 2019 Jan 18;19(2):387. doi: 10.3390/s19020387.
9
Weighted Kernel Filter Based Anti-Air Object Tracking for Thermal Infrared Systems.基于加权核滤波的热红外系统反空中目标跟踪。
Sensors (Basel). 2020 Jul 22;20(15):4081. doi: 10.3390/s20154081.
10
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

本文引用的文献

1
Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking.基于软掩模的特征融合与通道和空间注意力的鲁棒视觉目标跟踪
Sensors (Basel). 2020 Jul 20;20(14):4021. doi: 10.3390/s20144021.
2
Cascade R-CNN: High Quality Object Detection and Instance Segmentation.级联 R-CNN:高质量目标检测和实例分割。
IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1483-1498. doi: 10.1109/TPAMI.2019.2956516. Epub 2021 Apr 1.
3
Shrinkage estimation of non-negative mean vector with unknown covariance under balance loss.
平衡损失下协方差未知的非负均值向量的收缩估计
J Inequal Appl. 2018;2018(1):331. doi: 10.1186/s13660-018-1919-0. Epub 2018 Dec 3.
4
Focal Loss for Dense Object Detection.用于密集目标检测的焦散损失
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.
5
Robust Visual Tracking Based on Adaptive Convolutional Features and Offline Siamese Tracker.基于自适应卷积特征和离线孪生跟踪器的鲁棒视觉跟踪
Sensors (Basel). 2018 Jul 20;18(7):2359. doi: 10.3390/s18072359.
6
Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data.从不平衡数据中进行深度特征表示的成本敏感学习。
IEEE Trans Neural Netw Learn Syst. 2018 Aug;29(8):3573-3587. doi: 10.1109/TNNLS.2017.2732482. Epub 2017 Aug 17.
7
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
8
Visual Tracking: An Experimental Survey.视觉跟踪:实验综述。
IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1442-68. doi: 10.1109/TPAMI.2013.230.
9
Object Tracking Benchmark.目标跟踪基准测试。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1834-48. doi: 10.1109/TPAMI.2014.2388226.