• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于对比评估网络的弱监督时间动作定位

Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.

作者信息

Liu Ziyi, Wang Le, Zhang Qilin, Tang Wei, Zheng Nanning, Hua Gang

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5886-5902. doi: 10.1109/TPAMI.2021.3078798. Epub 2022 Aug 4.

DOI:10.1109/TPAMI.2021.3078798
PMID:33974541
Abstract

Given only video-level action categorical labels during training, weakly-supervised temporal action localization (WS-TAL) learns to detect action instances and locates their temporal boundaries in untrimmed videos. Compared to its fully supervised counterpart, WS-TAL is more cost-effective in data labeling and thus favorable in practical applications. However, the coarse video-level supervision inevitably incurs ambiguities in action localization, especially in untrimmed videos containing multiple action instances. To overcome this challenge, we observe that significant temporal contrasts among video snippets, e.g., caused by temporal discontinuities and sudden changes, often occur around true action boundaries. This motivates us to introduce a Contrast-based Localization EvaluAtioN Network (CleanNet), whose core is a new temporal action proposal evaluator, which provides fine-grained pseudo supervision by leveraging the temporal contrasts among snippet-level classification predictions. As a result, the uncertainty in locating action instances can be resolved via evaluating their temporal contrast scores. Moreover, the new action localization module is an integral part of CleanNet which enables end-to-end training. This is in contrast to many existing WS-TAL methods where action localization is merely a post-processing step. Besides, we also explore the usage of temporal contrast on temporal action proposal (TAP) generation task, which we believe is the first attempt with the weak supervision setting. Experiments on the THUMOS14, ActivityNet v1.2 and v1.3 datasets validate the efficacy of our method against existing state-of-the-art WS-TAL algorithms.

摘要

在训练过程中仅给定视频级别的动作类别标签,弱监督时间动作定位(WS-TAL)旨在学习检测动作实例并在未修剪的视频中定位其时间边界。与完全监督的方法相比,WS-TAL在数据标注方面更具成本效益,因此在实际应用中更具优势。然而,粗略的视频级监督不可避免地会在动作定位中产生模糊性,尤其是在包含多个动作实例的未修剪视频中。为了克服这一挑战,我们观察到视频片段之间显著的时间对比,例如由时间不连续性和突然变化引起的对比,通常发生在真实动作边界周围。这促使我们引入基于对比的定位评估网络(CleanNet),其核心是一个新的时间动作提议评估器,它通过利用片段级分类预测之间的时间对比来提供细粒度的伪监督。结果,通过评估动作实例的时间对比分数,可以解决定位动作实例时的不确定性。此外,新的动作定位模块是CleanNet的一个组成部分,它支持端到端训练。这与许多现有的WS-TAL方法形成对比,在这些方法中动作定位仅仅是一个后处理步骤。此外,我们还探索了时间对比在时间动作提议(TAP)生成任务中的应用,我们认为这是在弱监督设置下的首次尝试。在THUMOS14、ActivityNet v1.2和v1.3数据集上的实验验证了我们的方法相对于现有最先进的WS-TAL算法的有效性。

相似文献

1
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.基于对比评估网络的弱监督时间动作定位
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5886-5902. doi: 10.1109/TPAMI.2021.3078798. Epub 2022 Aug 4.
2
Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.自适应双流共识网络的弱监督时间动作定位。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4136-4151. doi: 10.1109/TPAMI.2022.3189662. Epub 2023 Mar 7.
3
StochasticFormer: Stochastic Modeling for Weakly Supervised Temporal Action Localization.随机Former:弱监督时间动作定位的随机建模
IEEE Trans Image Process. 2023;32:1379-1389. doi: 10.1109/TIP.2023.3244411. Epub 2023 Feb 23.
4
Ensemble Prototype Network For Weakly Supervised Temporal Action Localization.用于弱监督时间动作定位的集成原型网络
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4560-4574. doi: 10.1109/TNNLS.2024.3377468. Epub 2025 Feb 28.
5
ContextLoc++: A Unified Context Model for Temporal Action Localization.ContextLoc++:用于时间动作定位的统一上下文模型。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):9504-9519. doi: 10.1109/TPAMI.2023.3237597. Epub 2023 Jun 30.
6
Weakly supervised temporal action localization with actionness-guided false positive suppression.基于动作引导型假阳性抑制的弱监督时间动作定位。
Neural Netw. 2024 Jul;175:106307. doi: 10.1016/j.neunet.2024.106307. Epub 2024 Apr 15.
7
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization.精细动作:用于时间动作定位的细粒度视频数据集
IEEE Trans Image Process. 2022;31:6937-6950. doi: 10.1109/TIP.2022.3217368. Epub 2022 Nov 8.
8
Vectorized Evidential Learning for Weakly-Supervised Temporal Action Localization.
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15949-15963. doi: 10.1109/TPAMI.2023.3311447. Epub 2023 Nov 3.
9
Deep Motion Prior for Weakly-Supervised Temporal Action Localization.用于弱监督时间动作定位的深度运动先验
IEEE Trans Image Process. 2022;31:5203-5213. doi: 10.1109/TIP.2022.3193752. Epub 2022 Aug 4.
10
Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-Resolution Information in Temporal Domain.通过利用时域中的多分辨率信息改进弱监督时间动作定位
IEEE Trans Image Process. 2021;30:6659-6672. doi: 10.1109/TIP.2021.3089355. Epub 2021 Jul 26.