• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多目标跟踪的检测、数据关联与分割。

On Detection, Data Association and Segmentation for Multi-Target Tracking.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 Sep;41(9):2146-2160. doi: 10.1109/TPAMI.2018.2849374. Epub 2018 Jun 21.

DOI:10.1109/TPAMI.2018.2849374
PMID:29994110
Abstract

In this work, we propose a tracker that differs from most existing multi-target trackers in two major ways. First, our tracker does not rely on a pre-trained object detector to get the initial object hypotheses. Second, our tracker's final output is the fine contours of the targets rather than traditional bounding boxes. Therefore, our tracker simultaneously solves three main problems: detection, data association and segmentation. This is especially important because the output of each of those three problems are highly correlated and the solution of one can greatly help improve the others. The proposed algorithm consists of two main components: structured learning and Lagrange dual decomposition. Our structured learning based tracker learns a model for each target and infers the best locations of all targets simultaneously in a video clip. The inference of our structured learning is achieved through a new Target Identity-aware Network Flow (TINF), where each node in the network encodes the probability of each target identity belonging to that node. The probabilities are obtained by training target specific models using a global structured learning technique. This is followed by proposed Lagrangian relaxation optimization to find the high quality solution to the network. This forms the first component of our tracker. The second component is Lagrange dual decomposition, which combines the structured learning tracker with a segmentation algorithm. For segmentation, multi-label Conditional Random Field (CRF) is applied to a superpixel based spatio-temporal graph in a segment of video, in order to assign background or target labels to every superpixel. We show how the multi-label CRF is combined with the structured learning tracker through our dual decomposition formulation. This leads to more accurate segmentation results and also helps better resolve typical difficulties in multiple target tracking, such as occlusion handling, ID-switch and track drifting. The experiments on diverse and challenging sequences show that our method achieves superior results compared to competitive approaches for detection, multiple target tracking as well as segmentation.

摘要

在这项工作中,我们提出了一种与大多数现有的多目标跟踪器在两个主要方面不同的跟踪器。首先,我们的跟踪器不依赖于预先训练的目标检测器来获取初始目标假设。其次,我们的跟踪器的最终输出是目标的精细轮廓,而不是传统的边界框。因此,我们的跟踪器同时解决了三个主要问题:检测、数据关联和分割。这一点尤为重要,因为这三个问题的输出高度相关,解决其中一个问题可以极大地帮助改善其他问题。所提出的算法由两个主要组成部分组成:结构学习和拉格朗日对偶分解。我们基于结构学习的跟踪器为每个目标学习一个模型,并在视频片段中同时推断所有目标的最佳位置。我们的结构学习推断是通过一个新的目标身份感知网络流(TINF)实现的,其中网络中的每个节点编码每个目标身份属于该节点的概率。这些概率是通过使用全局结构学习技术训练特定于目标的模型获得的。随后是提出的拉格朗日松弛优化,以找到网络的高质量解。这构成了我们跟踪器的第一个组成部分。第二个组成部分是拉格朗日对偶分解,它将结构学习跟踪器与分割算法结合在一起。对于分割,多标签条件随机场(CRF)应用于视频片段中的基于超像素的时空图,以便将背景或目标标签分配给每个超像素。我们展示了如何通过我们的对偶分解公式将多标签 CRF 与结构学习跟踪器结合起来。这导致更准确的分割结果,并有助于更好地解决多目标跟踪中的典型困难,如遮挡处理、ID 切换和跟踪漂移。在多样化和具有挑战性的序列上的实验表明,与竞争方法相比,我们的方法在检测、多目标跟踪和分割方面都取得了优越的结果。

相似文献

1
On Detection, Data Association and Segmentation for Multi-Target Tracking.多目标跟踪的检测、数据关联与分割。
IEEE Trans Pattern Anal Mach Intell. 2019 Sep;41(9):2146-2160. doi: 10.1109/TPAMI.2018.2849374. Epub 2018 Jun 21.
2
MBT3D: Deep learning based multi-object tracker for bumblebee 3D flight path estimation.基于深度学习的大黄蜂 3D 飞行轨迹估计的多目标跟踪器
PLoS One. 2023 Sep 22;18(9):e0291415. doi: 10.1371/journal.pone.0291415. eCollection 2023.
3
Binary Quadratic Programing for Online Tracking of Hundreds of People in Extremely Crowded Scenes.用于在极其拥挤的场景中对数百人进行在线跟踪的二进制二次规划。
IEEE Trans Pattern Anal Mach Intell. 2018 Mar;40(3):568-581. doi: 10.1109/TPAMI.2017.2687462. Epub 2017 Mar 24.
4
Spatio-temporal auxiliary particle filtering with l1-norm-based appearance model learning for robust visual tracking.基于 l1 范数的外观模型学习的时空辅助粒子滤波用于鲁棒视觉跟踪。
IEEE Trans Image Process. 2013 Feb;22(2):511-22. doi: 10.1109/TIP.2012.2218824. Epub 2012 Sep 13.
5
Efficient joint model learning, segmentation and model updating for visual tracking.高效联合模型学习、分割和模型更新的视觉跟踪。
Neural Netw. 2022 Mar;147:175-185. doi: 10.1016/j.neunet.2021.12.018. Epub 2022 Jan 1.
6
Conditional Random Field (CRF)-Boosting: Constructing a Robust Online Hybrid Boosting Multiple Object Tracker Facilitated by CRF Learning.条件随机场(CRF)增强:构建一个由CRF学习促进的强大在线混合增强多目标跟踪器。
Sensors (Basel). 2017 Mar 17;17(3):617. doi: 10.3390/s17030617.
7
Multi-Task Structure-Aware Context Modeling for Robust Keypoint-Based Object Tracking.用于基于关键点的稳健目标跟踪的多任务结构感知上下文建模
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):915-927. doi: 10.1109/TPAMI.2018.2818132. Epub 2018 Mar 22.
8
Robust Individual-Cell/Object Tracking via PCANet Deep Network in Biomedicine and Computer Vision.通过PCANet深度网络在生物医学和计算机视觉中实现稳健的单细胞/物体跟踪
Biomed Res Int. 2016;2016:8182416. doi: 10.1155/2016/8182416. Epub 2016 Aug 25.
9
A Discriminative Single-Shot Segmentation Network for Visual Object Tracking.一种用于视觉目标跟踪的判别式单阶段分割网络。
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9742-9755. doi: 10.1109/TPAMI.2021.3137933. Epub 2022 Nov 7.
10
Oversaturated part-based visual tracking via spatio-temporal context learning.基于时空上下文学习的过饱和局部视觉跟踪
Appl Opt. 2016 Sep 1;55(25):6960-8. doi: 10.1364/AO.55.006960.

引用本文的文献

1
Animal re-identification in video through track clustering.通过轨迹聚类实现视频中的动物重新识别。
Pattern Anal Appl. 2025;28(3):125. doi: 10.1007/s10044-025-01497-8. Epub 2025 Jun 19.
2
Deep Efficient Data Association for Multi-Object Tracking: Augmented with SSIM-Based Ambiguity Elimination.用于多目标跟踪的深度高效数据关联:基于结构相似性指数的模糊消除增强
J Imaging. 2024 Jul 16;10(7):171. doi: 10.3390/jimaging10070171.
3
A Generative Adversarial Network Fused with Dual-Attention Mechanism and Its Application in Multitarget Image Fine Segmentation.
基于生成对抗网络融合双注意力机制及其在多目标图像精细分割中的应用。
Comput Intell Neurosci. 2021 Dec 18;2021:2464648. doi: 10.1155/2021/2464648. eCollection 2021.
4
A Novel Multi-Feature Fusion Method in Merging Information of Heterogenous-View Data for Oil Painting Image Feature Extraction and Recognition.一种用于油画图像特征提取与识别的融合异构视图数据信息的新型多特征融合方法。
Front Neurorobot. 2021 Jul 12;15:709043. doi: 10.3389/fnbot.2021.709043. eCollection 2021.
5
Reinforcement Learning-Based Data Association for Multiple Target Tracking in Clutter.基于强化学习的数据关联在杂波中的多目标跟踪。
Sensors (Basel). 2020 Nov 18;20(22):6595. doi: 10.3390/s20226595.
6
Spatial-Semantic and Temporal Attention Mechanism-Based Online Multi-Object Tracking.基于空间-语义和时间注意机制的在线多目标跟踪。
Sensors (Basel). 2020 Mar 16;20(6):1653. doi: 10.3390/s20061653.