• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

揭示自我监督在多视角多人关联与跟踪中的力量。

Unveiling the Power of Self-Supervision for Multi-View Multi-Human Association and Tracking.

作者信息

Feng Wei, Wang Feifan, Han Ruize, Gan Yiyang, Qian Zekun, Hou Junhui, Wang Song

出版信息

IEEE Trans Pattern Anal Mach Intell. 2025 Jan;47(1):351-368. doi: 10.1109/TPAMI.2024.3463966. Epub 2024 Dec 4.

DOI:10.1109/TPAMI.2024.3463966
PMID:39298301
Abstract

Multi-view multi-human association and tracking (MvMHAT), is an emerging yet important problem for multi-person scene video surveillance, aiming to track a group of people over time in each view, as well as to identify the same person across different views at the same time, which is different from previous MOT and multi-camera MOT tasks only considering the over-time human tracking. This way, the videos for MvMHAT require more complex annotations while containing more information for self-learning. In this work, we tackle this problem with an end-to-end neural network in a self-supervised learning manner. Specifically, we propose to take advantage of the spatial-temporal self-consistency rationale by considering three properties of reflexivity, symmetry, and transitivity. Besides the reflexivity property that naturally holds, we design the self-supervised learning losses based on the properties of symmetry and transitivity, for both appearance feature learning and assignment matrix optimization, to associate multiple humans over time and across views. Furthermore, to promote the research on MvMHAT, we build two new large-scale benchmarks for the network training and testing of different algorithms. Extensive experiments on the proposed benchmarks verify the effectiveness of our method. We have released the benchmark and code to the public.

摘要

多视角多人关联与跟踪(MvMHAT)是多人场景视频监控中一个新兴但重要的问题,旨在在每个视角下随时间跟踪一群人,同时在同一时间跨不同视角识别同一个人,这与之前仅考虑随时间进行人体跟踪的多目标跟踪(MOT)和多摄像机多目标跟踪任务不同。通过这种方式,MvMHAT的视频需要更复杂的标注,同时包含更多用于自学习的信息。在这项工作中,我们以自监督学习的方式用端到端神经网络解决这个问题。具体来说,我们建议通过考虑自反性、对称性和传递性这三个属性来利用时空自一致性原理。除了自然成立的自反性属性外,我们基于对称性和传递性属性设计自监督学习损失,用于外观特征学习和分配矩阵优化,以随时间和跨视角关联多个人。此外,为了推动对MvMHAT的研究,我们为不同算法的网络训练和测试构建了两个新的大规模基准。在所提出的基准上进行的大量实验验证了我们方法的有效性。我们已将基准和代码公开发布。

相似文献

1
Unveiling the Power of Self-Supervision for Multi-View Multi-Human Association and Tracking.揭示自我监督在多视角多人关联与跟踪中的力量。
IEEE Trans Pattern Anal Mach Intell. 2025 Jan;47(1):351-368. doi: 10.1109/TPAMI.2024.3463966. Epub 2024 Dec 4.
2
Multiple Human Association and Tracking From Egocentric and Complementary Top Views.基于单目和互补顶视角的多人关联与跟踪。
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5225-5242. doi: 10.1109/TPAMI.2021.3070562. Epub 2022 Aug 4.
3
Multi-View Multi-Human Association With Deep Assignment Network.基于深度分配网络的多视角多人关联。
IEEE Trans Image Process. 2022;31:1830-1840. doi: 10.1109/TIP.2021.3139178. Epub 2022 Feb 16.
4
DANet: Semi-supervised differentiated auxiliaries guided network for video action recognition.DANet:用于视频动作识别的半监督差异化辅助引导网络。
Neural Netw. 2023 Jan;158:121-131. doi: 10.1016/j.neunet.2022.11.009. Epub 2022 Nov 17.
5
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
6
High-Precision Multi-Object Tracking in Satellite Videos via Pixel-Wise Adaptive Feature Enhancement.通过逐像素自适应特征增强实现卫星视频中的高精度多目标跟踪
Sensors (Basel). 2024 Oct 9;24(19):6489. doi: 10.3390/s24196489.
7
Cross-view motion consistent self-supervised video inter-intra contrastive for action representation understanding.跨视图运动一致的自我监督视频内-外对比动作表示理解。
Neural Netw. 2024 Nov;179:106578. doi: 10.1016/j.neunet.2024.106578. Epub 2024 Jul 26.
8
Self-Supervised Multi-View Person Association and its Applications.自监督多视角人物关联及其应用。
IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2794-2808. doi: 10.1109/TPAMI.2020.2974726. Epub 2021 Jul 1.
9
Deep Graph Metric Learning for Weakly Supervised Person Re-Identification.深度图度量学习在弱监督行人再识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6074-6093. doi: 10.1109/TPAMI.2021.3084613. Epub 2022 Sep 14.
10
Inference-Domain Network Evolution: A New Perspective for One-Shot Multi-Object Tracking.推理域网络进化:一种新的单-shot 多目标跟踪视角。
IEEE Trans Image Process. 2023;32:2147-2159. doi: 10.1109/TIP.2023.3263104. Epub 2023 Apr 6.