• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PCG-TAL:用于时间动作定位的渐进式跨粒度合作

PCG-TAL: Progressive Cross-Granularity Cooperation for Temporal Action Localization.

作者信息

Su Rui, Xu Dong, Sheng Lu, Ouyang Wanli

出版信息

IEEE Trans Image Process. 2021;30:2103-2113. doi: 10.1109/TIP.2020.3044218. Epub 2021 Jan 25.

DOI:10.1109/TIP.2020.3044218
PMID:33332270
Abstract

There are two major lines of works, i.e., anchor-based and frame-based approaches, in the field of temporal action localization. But each line of works is inherently limited to a certain detection granularity and cannot simultaneously achieve high recall rates with accurate action boundaries. In this work, we propose a progressive cross-granularity cooperation (PCG-TAL) framework to effectively take advantage of complementarity between the anchor-based and frame-based paradigms, as well as between two-view clues (i.e., appearance and motion). Specifically, our new Anchor-Frame Cooperation (AFC) module can effectively integrate both two-granularity and two-stream knowledge at the feature and proposal levels, as well as within each AFC module and across adjacent AFC modules. Specifically, the RGB-stream AFC module and the flow-stream AFC module are stacked sequentially to form a progressive localization framework. The whole framework can be learned in an end-to-end fashion, whilst the temporal action localization performance can be gradually boosted in a progressive manner. Our newly proposed framework outperforms the state-of-the-art methods on three benchmark datasets the THUMOS14, ActivityNet v1.3 and UCF-101-24, which clearly demonstrates the effectiveness of our framework.

摘要

在时间动作定位领域,主要有两大类工作,即基于锚点的方法和基于帧的方法。但每一类工作本质上都局限于特定的检测粒度,无法同时实现高召回率和精确的动作边界。在这项工作中,我们提出了一种渐进式跨粒度协作(PCG-TAL)框架,以有效利用基于锚点和基于帧的范式之间以及双视图线索(即外观和运动)之间的互补性。具体而言,我们新的锚点-帧协作(AFC)模块可以在特征和提议级别,以及在每个AFC模块内部和相邻AFC模块之间有效地整合双粒度和双流知识。具体来说,RGB流AFC模块和光流AFC模块依次堆叠,形成一个渐进式定位框架。整个框架可以以端到端的方式进行学习,同时时间动作定位性能可以以渐进的方式逐步提高。我们新提出的框架在三个基准数据集THUMOS14、ActivityNet v1.3和UCF-101-24上优于现有方法,这清楚地证明了我们框架的有效性。

相似文献

1
PCG-TAL: Progressive Cross-Granularity Cooperation for Temporal Action Localization.PCG-TAL:用于时间动作定位的渐进式跨粒度合作
IEEE Trans Image Process. 2021;30:2103-2113. doi: 10.1109/TIP.2020.3044218. Epub 2021 Jan 25.
2
Progressive Cross-Stream Cooperation in Spatial and Temporal Domain for Action Localization.用于动作定位的时空域渐进式跨流合作
IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4477-4490. doi: 10.1109/TPAMI.2020.2997860. Epub 2021 Nov 3.
3
Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-Resolution Information in Temporal Domain.通过利用时域中的多分辨率信息改进弱监督时间动作定位
IEEE Trans Image Process. 2021;30:6659-6672. doi: 10.1109/TIP.2021.3089355. Epub 2021 Jul 26.
4
Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.自适应双流共识网络的弱监督时间动作定位。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4136-4151. doi: 10.1109/TPAMI.2022.3189662. Epub 2023 Mar 7.
5
Revisiting Anchor Mechanisms for Temporal Action Localization.重新审视用于时域动作定位的锚定机制。
IEEE Trans Image Process. 2020 Aug 19;PP. doi: 10.1109/TIP.2020.3016486.
6
Structured Attention Composition for Temporal Action Localization.用于时间动作定位的结构化注意力合成
IEEE Trans Image Process. 2022 Jun 13;PP. doi: 10.1109/TIP.2022.3180925.
7
A Temporal-Aware Relation and Attention Network for Temporal Action Localization.用于时间动作定位的时间感知关系与注意力网络。
IEEE Trans Image Process. 2022;31:4746-4760. doi: 10.1109/TIP.2022.3182866. Epub 2022 Jul 14.
8
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.基于对比评估网络的弱监督时间动作定位
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5886-5902. doi: 10.1109/TPAMI.2021.3078798. Epub 2022 Aug 4.
9
Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation.用于生成时间动作建议的多级内容感知边界检测
IEEE Trans Image Process. 2023;32:6090-6101. doi: 10.1109/TIP.2023.3328471. Epub 2023 Nov 8.
10
ContextLoc++: A Unified Context Model for Temporal Action Localization.ContextLoc++:用于时间动作定位的统一上下文模型。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):9504-9519. doi: 10.1109/TPAMI.2023.3237597. Epub 2023 Jun 30.