• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新审视用于时域动作定位的锚定机制。

Revisiting Anchor Mechanisms for Temporal Action Localization.

作者信息

Yang Le, Peng Houwen, Zhang Dingwen, Fu Jianlong, Han Junwei

出版信息

IEEE Trans Image Process. 2020 Aug 19;PP. doi: 10.1109/TIP.2020.3016486.

DOI:10.1109/TIP.2020.3016486
PMID:32813656
Abstract

Most of the current action localization methods follow an anchor-based pipeline: depicting action instances by pre-defined anchors, learning to select the anchors closest to the ground truth, and predicting the confidence of anchors with refinements. Pre-defined anchors set prior about the location and duration for action instances, which facilitates the localization for common action instances but limits the flexibility for tackling action instances with drastic varieties, especially for extremely short or extremely long ones. To address this problem, this paper proposes a novel anchor-free action localization module that assists action localization by temporal points. Specifically, this module represents an action instance as a point with its distances to the starting boundary and ending boundary, alleviating the pre-defined anchor restrictions in terms of action localization and duration. The proposed anchor-free module is capable of predicting the action instances whose duration is either extremely short or extremely long. By combining the proposed anchor-free module with a conventional anchor-based module, we propose a novel action localization framework, called A2Net. The cooperation between anchor-free and anchor-based modules achieves superior performance to the state-of-the-art on THUMOS14 (45.5% vs. 42.8%). Furthermore, comprehensive experiments demonstrate the complementarity between the anchor-free and the anchor-based module, making A2Net simple but effective.

摘要

当前大多数动作定位方法都遵循基于锚点的流程

通过预定义的锚点描绘动作实例,学习选择最接近真实值的锚点,并通过细化预测锚点的置信度。预定义的锚点为动作实例的位置和持续时间设置了先验信息,这有助于常见动作实例的定位,但限制了处理具有极大变化的动作实例的灵活性,特别是对于极短或极长的动作实例。为了解决这个问题,本文提出了一种新颖的无锚点动作定位模块,该模块通过时间点辅助动作定位。具体来说,该模块将动作实例表示为一个点,以及它到起始边界和结束边界的距离,从而在动作定位和持续时间方面减轻了预定义锚点的限制。所提出的无锚点模块能够预测持续时间极短或极长的动作实例。通过将所提出的无锚点模块与传统的基于锚点的模块相结合,我们提出了一种新颖的动作定位框架,称为A2Net。无锚点模块和基于锚点的模块之间的协作在THUMOS14上取得了优于现有技术的性能(45.5%对42.8%)。此外,全面的实验证明了无锚点模块和基于锚点的模块之间的互补性,使得A2Net简单而有效。

相似文献

1
Revisiting Anchor Mechanisms for Temporal Action Localization.重新审视用于时域动作定位的锚定机制。
IEEE Trans Image Process. 2020 Aug 19;PP. doi: 10.1109/TIP.2020.3016486.
2
A Temporal-Aware Relation and Attention Network for Temporal Action Localization.用于时间动作定位的时间感知关系与注意力网络。
IEEE Trans Image Process. 2022;31:4746-4760. doi: 10.1109/TIP.2022.3182866. Epub 2022 Jul 14.
3
PCG-TAL: Progressive Cross-Granularity Cooperation for Temporal Action Localization.PCG-TAL:用于时间动作定位的渐进式跨粒度合作
IEEE Trans Image Process. 2021;30:2103-2113. doi: 10.1109/TIP.2020.3044218. Epub 2021 Jan 25.
4
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization.用于点级弱监督动作定位的紧凑表示与可靠分类学习
IEEE Trans Image Process. 2022;31:7363-7377. doi: 10.1109/TIP.2022.3222623. Epub 2022 Nov 30.
5
Structured Attention Composition for Temporal Action Localization.用于时间动作定位的结构化注意力合成
IEEE Trans Image Process. 2022 Jun 13;PP. doi: 10.1109/TIP.2022.3180925.
6
End-to-End Temporal Action Detection With Transformer.基于Transformer的端到端时域动作检测
IEEE Trans Image Process. 2022;31:5427-5441. doi: 10.1109/TIP.2022.3195321. Epub 2022 Aug 17.
7
PointPainting: 3D Object Detection Aided by Semantic Image Information.点画:基于语义图像信息辅助的 3D 目标检测。
Sensors (Basel). 2023 Mar 6;23(5):2868. doi: 10.3390/s23052868.
8
Consistency-Aware Anchor Pyramid Network for Crowd Localization.用于人群定位的一致性感知锚点金字塔网络。
IEEE Trans Pattern Anal Mach Intell. 2024 Apr 29;PP. doi: 10.1109/TPAMI.2024.3392013.
9
Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.自适应双流共识网络的弱监督时间动作定位。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4136-4151. doi: 10.1109/TPAMI.2022.3189662. Epub 2023 Mar 7.
10
StochasticFormer: Stochastic Modeling for Weakly Supervised Temporal Action Localization.随机Former:弱监督时间动作定位的随机建模
IEEE Trans Image Process. 2023;32:1379-1389. doi: 10.1109/TIP.2023.3244411. Epub 2023 Feb 23.

引用本文的文献

1
MCMNET: Multi-Scale Context Modeling Network for Temporal Action Detection.MCMNET:用于时态动作检测的多尺度上下文建模网络。
Sensors (Basel). 2023 Aug 31;23(17):7563. doi: 10.3390/s23177563.
2
Non-Local Temporal Difference Network for Temporal Action Detection.非局部时域差分网络的时域动作检测。
Sensors (Basel). 2022 Nov 1;22(21):8396. doi: 10.3390/s22218396.
3
Football Game Video Analysis Method with Deep Learning.基于深度学习的足球比赛视频分析方法。
Comput Intell Neurosci. 2022 Jun 8;2022:3284156. doi: 10.1155/2022/3284156. eCollection 2022.