• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SimpleTrackV2:重新思考多目标跟踪的时间特征

SimpleTrackV2: Rethinking the Timing Characteristics for Multi-Object Tracking.

作者信息

Ding Yan, Ling Yuchen, Zhang Bozhi, Li Jiaxin, Guo Lingxi, Yang Zhe

机构信息

Key Laboratory of Dynamics and Control of Flight Vehicle, Ministry of Education, School of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, China.

Science and Technology on Space Physics Laboratory, Beijing 100076, China.

出版信息

Sensors (Basel). 2024 Sep 17;24(18):6015. doi: 10.3390/s24186015.

DOI:10.3390/s24186015
PMID:39338760
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11436168/
Abstract

Multi-object tracking tasks aim to assign unique trajectory codes to targets in video frames. Most detection-based tracking methods use Kalman filtering algorithms for trajectory prediction, directly utilizing associated target features for trajectory updates. However, this approach often fails, with camera jitter and transient target loss in real-world scenarios. This paper rethinks state prediction and fusion based on target temporal features to address these issues and proposes the SimpleTrackV2 algorithm, building on the previously designed SimpleTrack. Firstly, to address the poor prediction performance of linear motion models in complex scenes, we designed a target state prediction algorithm called LSTM-MP, based on long short-term memory (LSTM). This algorithm encodes the target's historical motion information using LSTM and decodes it with a multilayer perceptron (MLP) to achieve target state prediction. Secondly, to mitigate the effect of occlusion on target state saliency, we designed a spatiotemporal attention-based target appearance feature fusion (TSA-FF) target state fusion algorithm based on the attention mechanism. TSA-FF calculates adaptive fusion coefficients to enhance target state fusion, thereby improving the accuracy of subsequent data association. To demonstrate the effectiveness of the proposed method, we compared SimpleTrackV2 with the baseline model SimpleTrack on the MOT17 dataset. We also conducted ablation experiments on TSA-FF and LSTM-MP for SimpleTrackV2, exploring the optimal number of fusion frames and the impact of different loss functions on model performance. The experimental results show that SimpleTrackV2 handles camera jitter and target occlusion better, achieving improvements of 1.6%, 3.2%, and 6.1% in MOTA, IDF1, and HOTA, respectively, compared to the SimpleTrack algorithm.

摘要

多目标跟踪任务旨在为视频帧中的目标分配唯一的轨迹代码。大多数基于检测的跟踪方法使用卡尔曼滤波算法进行轨迹预测,直接利用相关目标特征进行轨迹更新。然而,在现实场景中,由于相机抖动和目标的短暂丢失,这种方法常常失败。本文基于目标的时间特征重新思考状态预测和融合,以解决这些问题,并在先前设计的SimpleTrack基础上提出了SimpleTrackV2算法。首先,为了解决复杂场景中线性运动模型预测性能不佳的问题,我们设计了一种基于长短期记忆(LSTM)的目标状态预测算法,称为LSTM-MP。该算法使用LSTM对目标的历史运动信息进行编码,并用多层感知器(MLP)进行解码,以实现目标状态预测。其次,为了减轻遮挡对目标状态显著性 的影响,我们基于注意力机制设计了一种基于时空注意力的目标外观特征融合(TSA-FF)目标状态融合算法。TSA-FF计算自适应融合系数以增强目标状态融合,从而提高后续数据关联的准确性。为了证明所提方法的有效性,我们在MOT17数据集上将SimpleTrackV2与基线模型SimpleTrack进行了比较。我们还对SimpleTrackV2的TSA-FF和LSTM-MP进行了消融实验,探索融合帧数的最佳值以及不同损失函数对模型性能的影响。实验结果表明,SimpleTrackV2在处理相机抖动和目标遮挡方面表现更好,与SimpleTrack算法相比,MOTA、IDF1和HOTA分别提高了1.6%、3.2%和6.1%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/ab8383a3d427/sensors-24-06015-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/c0182e711250/sensors-24-06015-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/31bfee1025cd/sensors-24-06015-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/e30da554690e/sensors-24-06015-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/33f6f98e06b3/sensors-24-06015-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/ab8383a3d427/sensors-24-06015-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/c0182e711250/sensors-24-06015-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/31bfee1025cd/sensors-24-06015-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/e30da554690e/sensors-24-06015-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/33f6f98e06b3/sensors-24-06015-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c18/11436168/ab8383a3d427/sensors-24-06015-g005.jpg

相似文献

1
SimpleTrackV2: Rethinking the Timing Characteristics for Multi-Object Tracking.SimpleTrackV2:重新思考多目标跟踪的时间特征
Sensors (Basel). 2024 Sep 17;24(18):6015. doi: 10.3390/s24186015.
2
SimpleTrack: Rethinking and Improving the JDE Approach for Multi-Object Tracking.SimpleTrack:重新思考并改进用于多目标跟踪的JDE方法
Sensors (Basel). 2022 Aug 5;22(15):5863. doi: 10.3390/s22155863.
3
STMMOT: Advancing multi-object tracking through spatiotemporal memory networks and multi-scale attention pyramids.STMMOT:通过时空记忆网络和多尺度注意力金字塔推进多目标跟踪
Neural Netw. 2023 Nov;168:363-379. doi: 10.1016/j.neunet.2023.09.047. Epub 2023 Sep 29.
4
A Long-Term Video Tracking Method for Group-Housed Pigs.一种用于群养猪的长期视频跟踪方法。
Animals (Basel). 2024 May 19;14(10):1505. doi: 10.3390/ani14101505.
5
An electric bicycle tracking algorithm for improved traffic management.一种用于改善交通管理的电动自行车跟踪算法。
Heliyon. 2024 Jun 8;10(13):e32708. doi: 10.1016/j.heliyon.2024.e32708. eCollection 2024 Jul 15.
6
Multi-Target Tracking Based on a Combined Attention Mechanism and Occlusion Sensing in a Behavior-Analysis System.基于行为分析系统中的联合注意力机制和遮挡感知的多目标跟踪。
Sensors (Basel). 2023 Mar 8;23(6):2956. doi: 10.3390/s23062956.
7
Robust Object Tracking Based on Motion Consistency.基于运动一致性的鲁棒目标跟踪
Sensors (Basel). 2018 Feb 13;18(2):572. doi: 10.3390/s18020572.
8
Adaptive Tracking of High-Maneuvering Targets Based on Multi-Feature Fusion Trajectory Clustering: LPI's Purpose.基于多特征融合轨迹聚类的高机动目标自适应跟踪:低截获概率(LPI)的目的。
Sensors (Basel). 2022 Jun 22;22(13):4713. doi: 10.3390/s22134713.
9
Multi-Feature Single Target Robust Tracking Fused with Particle Filter.融合粒子滤波器的多特征单目标鲁棒跟踪
Sensors (Basel). 2022 Feb 27;22(5):1879. doi: 10.3390/s22051879.
10
Multi-objective pedestrian tracking method based on YOLOv8 and improved DeepSORT.基于YOLOv8和改进的DeepSORT的多目标行人跟踪方法
Math Biosci Eng. 2024 Jan 3;21(2):1791-1805. doi: 10.3934/mbe.2024077.

本文引用的文献

1
SimpleTrack: Rethinking and Improving the JDE Approach for Multi-Object Tracking.SimpleTrack:重新思考并改进用于多目标跟踪的JDE方法
Sensors (Basel). 2022 Aug 5;22(15):5863. doi: 10.3390/s22155863.
2
Rethinking the Competition Between Detection and ReID in Multiobject Tracking.多目标跟踪中检测与重识别竞争关系的再思考
IEEE Trans Image Process. 2022;31:3182-3196. doi: 10.1109/TIP.2022.3165376. Epub 2022 Apr 25.
3
HOTA: A Higher Order Metric for Evaluating Multi-object Tracking.HOTA:一种用于评估多目标跟踪的高阶度量
Int J Comput Vis. 2021;129(2):548-578. doi: 10.1007/s11263-020-01375-2. Epub 2020 Oct 8.
4
Deep Learning for Person Re-Identification: A Survey and Outlook.用于行人重识别的深度学习:综述与展望
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):2872-2893. doi: 10.1109/TPAMI.2021.3054775. Epub 2022 May 5.
5
Multi-Task Learning for Dense Prediction Tasks: A Survey.用于密集预测任务的多任务学习:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3614-3633. doi: 10.1109/TPAMI.2021.3054719. Epub 2022 Jun 3.
6
Multiple target tracking by learning-based hierarchical association of detection responses.基于学习的检测响应分层关联的多目标跟踪。
IEEE Trans Pattern Anal Mach Intell. 2013 Apr;35(4):898-910. doi: 10.1109/TPAMI.2012.159.