基于循环神经网络的视觉跟踪轨迹预测器

Trajectory Predictor by Using Recurrent Neural Networks in Visual Tracking.

出版信息

IEEE Trans Cybern. 2017 Oct;47(10):3172-3183. doi: 10.1109/TCYB.2017.2705345.

DOI:10.1109/TCYB.2017.2705345

Abstract

Motion models have been proved to be a crucial part in the visual tracking process. In recent trackers, particle filter and sliding windows-based motion models have been widely used. Treating motion models as a sequence prediction problem, we can estimate the motion of objects using their trajectories. Moreover, it is possible to transfer the learned knowledge from annotated trajectories to new objects. Inspired by recent advance in deep learning for visual feature extraction and sequence prediction, we propose a trajectory predictor to learn prior knowledge from annotated trajectories and transfer it to predict the motion of target objects. In this predictor, convolutional neural networks extract the visual features of target objects. Long short-term memory model leverages the annotated trajectory priors as well as sequential visual information, which includes the tracked features and center locations of the target object, to predict the motion. Furthermore, to extend this method to videos in which it is difficult to obtain annotated trajectories, a dynamic weighted motion model that combines the proposed trajectory predictor with a random sampler is proposed. To evaluate the transfer performance of the proposed trajectory predictor, we annotated a real-world vehicle dataset. Experiment results on both this real-world vehicle dataset and an online tracker benchmark dataset indicate that the proposed method outperforms several state-of-the-art trackers.

摘要

运动模型已被证明是视觉跟踪过程中的关键部分。在最近的跟踪器中，粒子滤波器和基于滑动窗口的运动模型被广泛使用。将运动模型视为序列预测问题，我们可以使用物体的轨迹来估计它们的运动。此外，还可以将从带注释轨迹中学习到的知识转移到新物体上。受深度学习在视觉特征提取和序列预测方面的最新进展的启发，我们提出了一种轨迹预测器，以从带注释的轨迹中学习先验知识并将其转移到预测目标物体的运动中。在该预测器中，卷积神经网络提取目标物体的视觉特征。长短期记忆模型利用带注释的轨迹先验以及序列视觉信息，包括跟踪特征和目标物体的中心位置，来预测运动。此外，为了将这种方法扩展到难以获得带注释轨迹的视频中，提出了一种动态加权运动模型，将所提出的轨迹预测器与随机抽样器相结合。为了评估所提出的轨迹预测器的迁移性能，我们对一个真实世界的车辆数据集进行了注释。在这个真实世界的车辆数据集和一个在线跟踪器基准数据集上的实验结果表明，该方法优于几个最先进的跟踪器。

相似文献

Trajectory Predictor by Using Recurrent Neural Networks in Visual Tracking.

IEEE Trans Cybern. 2017 Oct;47(10):3172-3183. doi: 10.1109/TCYB.2017.2705345.

Hedging Deep Features for Visual Tracking.

IEEE Trans Pattern Anal Mach Intell. 2019 May;41(5):1116-1130. doi: 10.1109/TPAMI.2018.2828817. Epub 2018 Apr 20.

Deep Attention Models for Human Tracking Using RGBD.

Sensors (Basel). 2019 Feb 13;19(4):750. doi: 10.3390/s19040750.

SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network.

IEEE Trans Med Imaging. 2018 May;37(5):1114-1126. doi: 10.1109/TMI.2017.2787657.

Action-Driven Visual Object Tracking With Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2239-2252. doi: 10.1109/TNNLS.2018.2801826.

End-to-End Active Object Tracking and Its Real-World Deployment via Reinforcement Learning.

IEEE Trans Pattern Anal Mach Intell. 2020 Jun;42(6):1317-1332. doi: 10.1109/TPAMI.2019.2899570. Epub 2019 Feb 14.

Curved trajectory prediction using a self-organizing neural network.

Int J Neural Syst. 2000 Feb;10(1):59-70. doi: 10.1142/S0129065700000065.

Jointly Feature Learning and Selection for Robust Tracking via a Gating Mechanism.

PLoS One. 2016 Aug 30;11(8):e0161808. doi: 10.1371/journal.pone.0161808. eCollection 2016.

Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks.

Med Image Anal. 2018 Jul;47:203-218. doi: 10.1016/j.media.2018.05.001. Epub 2018 May 9.

Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models.

Comput Intell Neurosci. 2018 Feb 12;2018:1639561. doi: 10.1155/2018/1639561. eCollection 2018.

引用本文的文献

Robust and enhanced 360° visual tracking based on dynamic gnomonic projection.

J R Soc N Z. 2025 Jun 22;55(6):2169-2197. doi: 10.1080/03036758.2025.2519148. eCollection 2025.

Local Extremum Mapping for Weak Supervision Learning on Mammogram Classification and Localization.

Bioengineering (Basel). 2025 Mar 21;12(4):325. doi: 10.3390/bioengineering12040325.

Determination of Vehicle Trajectory through Optimization of Vehicle Bounding Boxes Using a Convolutional Neural Network.

Sensors (Basel). 2019 Sep 30;19(19):4263. doi: 10.3390/s19194263.

Pedestrian Trajectory Prediction in Extremely Crowded Scenarios.

Sensors (Basel). 2019 Mar 11;19(5):1223. doi: 10.3390/s19051223.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于循环神经网络的视觉跟踪轨迹预测器

Trajectory Predictor by Using Recurrent Neural Networks in Visual Tracking.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献