• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于弱监督时间动作定位的双分支关系原型网络

Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization.

作者信息

Huang Linjiang, Huang Yan, Ouyang Wanli, Wang Liang

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5729-5746. doi: 10.1109/TPAMI.2021.3076172. Epub 2022 Aug 4.

DOI:10.1109/TPAMI.2021.3076172
PMID:33909560
Abstract

As a challenging task of high-level video understanding, weakly supervised temporal action localization has attracted more attention recently. With only video-level category labels, this task should indistinguishably identify the background and action categories frame by frame. However, it is non-trivial to achieve this in untrimmed videos, due to the unconstrained background, complex and multi-label actions. With the observation that these difficulties are mainly brought by the large variations within background and actions, we propose to address these challenges from the perspective of modeling variations. Moreover, it is desired to further reduce the variations, or learn compact features, so as to cast the problem of background identification as rejecting background and alleviate the contradiction between classification and detection. Accordingly, in this paper, we propose a two-branch relational prototypical network. The first branch, namely action-branch, adopts class-wise prototypes and mainly acts as an auxiliary to introduce priori knowledge about label dependencies and be a guide for the second branch. Meanwhile, the second branch, namely sub-branch, starts with multiple prototypes, namely sub-prototypes, to enable a powerful ability of modeling variations. As a further benefit, we elaborately design a multi-label clustering loss based on the sub-prototypes to learn compact features under the multi-label setting. The two branches are associated using the correspondences between two types of prototypes, leading to a special two-stage classifier in the s-branch, on the other hand, the two branches serve as regularization terms to each other, improving the final performance. Ablation studies find that the proposed model is capable of modeling classes with large variations and learning compact features. Extensive experimental evaluations on Thumos14, MultiThumos and ActivityNet datasets demonstrate the effectiveness of the proposed method and superior performance over state-of-the-art approaches.

摘要

作为高级视频理解中的一项具有挑战性的任务,弱监督时间动作定位最近受到了更多关注。仅使用视频级别的类别标签,此任务需要逐帧无差别地识别背景和动作类别。然而,在未修剪的视频中实现这一点并非易事,因为背景不受约束、动作复杂且具有多标签。通过观察发现这些困难主要是由背景和动作中的巨大变化带来的,我们建议从建模变化的角度来应对这些挑战。此外,期望进一步减少变化或学习紧凑特征,以便将背景识别问题转化为拒绝背景,并缓解分类与检测之间的矛盾。因此,在本文中,我们提出了一种双分支关系原型网络。第一个分支,即动作分支,采用类别原型,主要作为辅助来引入关于标签依赖的先验知识,并为第二个分支提供指导。同时,第二个分支,即子分支,从多个原型(即子原型)开始,以具备强大的建模变化能力。作为进一步的优势,我们基于子原型精心设计了一种多标签聚类损失,以在多标签设置下学习紧凑特征。两个分支通过两种原型之间的对应关系相关联,在子分支中形成了一个特殊的两阶段分类器,另一方面,两个分支相互作为正则化项,提高了最终性能。消融研究发现,所提出的模型能够对具有大变化的类别进行建模并学习紧凑特征。在Thumos14、MultiThumos和ActivityNet数据集上进行的广泛实验评估证明了所提出方法的有效性以及相对于现有方法的优越性能。

相似文献

1
Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization.用于弱监督时间动作定位的双分支关系原型网络
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5729-5746. doi: 10.1109/TPAMI.2021.3076172. Epub 2022 Aug 4.
2
Modeling Sub-Actions for Weakly Supervised Temporal Action Localization.用于弱监督时间动作定位的子动作建模
IEEE Trans Image Process. 2021;30:5154-5167. doi: 10.1109/TIP.2021.3078324. Epub 2021 May 24.
3
Ensemble Prototype Network For Weakly Supervised Temporal Action Localization.用于弱监督时间动作定位的集成原型网络
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4560-4574. doi: 10.1109/TNNLS.2024.3377468. Epub 2025 Feb 28.
4
Multi-Hierarchical Category Supervision for Weakly-Supervised Temporal Action Localization.用于弱监督时间动作定位的多分层类别监督
IEEE Trans Image Process. 2021;30:9332-9344. doi: 10.1109/TIP.2021.3124671. Epub 2021 Nov 12.
5
Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.自适应双流共识网络的弱监督时间动作定位。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4136-4151. doi: 10.1109/TPAMI.2022.3189662. Epub 2023 Mar 7.
6
Adaptive Prototype Learning for Weakly-Supervised Temporal Action Localization.用于弱监督时间动作定位的自适应原型学习
IEEE Trans Image Process. 2025;34:3154-3168. doi: 10.1109/TIP.2024.3431915.
7
Weakly supervised temporal action localization with actionness-guided false positive suppression.基于动作引导型假阳性抑制的弱监督时间动作定位。
Neural Netw. 2024 Jul;175:106307. doi: 10.1016/j.neunet.2024.106307. Epub 2024 Apr 15.
8
Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-Resolution Information in Temporal Domain.通过利用时域中的多分辨率信息改进弱监督时间动作定位
IEEE Trans Image Process. 2021;30:6659-6672. doi: 10.1109/TIP.2021.3089355. Epub 2021 Jul 26.
9
StochasticFormer: Stochastic Modeling for Weakly Supervised Temporal Action Localization.随机Former:弱监督时间动作定位的随机建模
IEEE Trans Image Process. 2023;32:1379-1389. doi: 10.1109/TIP.2023.3244411. Epub 2023 Feb 23.
10
Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization.打破赢家通吃:用于弱监督时间动作定位的迭代赢家出局网络
IEEE Trans Image Process. 2019 Dec;28(12):5797-5808. doi: 10.1109/TIP.2019.2922108. Epub 2019 Jun 17.

引用本文的文献

1
Aerial scene understanding in the wild: Multi-scene recognition via prototype-based memory networks.野外空中场景理解:通过基于原型的记忆网络进行多场景识别。
ISPRS J Photogramm Remote Sens. 2021 Jul;177:89-102. doi: 10.1016/j.isprsjprs.2021.04.006.