• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于弱监督时间动作定位的语义和时间上下文关联学习

Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization.

作者信息

Fu Jie, Gao Junyu, Xu Changsheng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12427-12443. doi: 10.1109/TPAMI.2023.3287208. Epub 2023 Sep 5.

DOI:10.1109/TPAMI.2023.3287208
PMID:37335790
Abstract

Weakly-supervised temporal action localization (WSTAL) aims to automatically identify and localize action instances in untrimmed videos with only video-level labels as supervision. In this task, there exist two challenges: (1) how to accurately discover the action categories in an untrimmed video (what to discover); (2) how to elaborately focus on the integral temporal interval of each action instance (where to focus). Empirically, to discover the action categories, discriminative semantic information should be extracted, while robust temporal contextual information is beneficial for complete action localization. However, most existing WSTAL methods ignore to explicitly and jointly model the semantic and temporal contextual correlation information for the above two challenges. In this article, a Semantic and Temporal Contextual Correlation Learning Network (STCL-Net) with the semantic (SCL) and temporal contextual correlation learning (TCL) modules is proposed, which achieves both accurate action discovery and complete action localization by modeling the semantic and temporal contextual correlation information for each snippet in the inter- and intra-video manners respectively. It is noteworthy that the two proposed modules are both designed in a unified dynamic correlation-embedding paradigm. Extensive experiments are performed on different benchmarks. On all the benchmarks, our proposed method exhibits superior or comparable performance in comparison to the existing state-of-the-art models, especially achieving gains as high as 7.2% in terms of the average mAP on THUMOS-14. In addition, comprehensive ablation studies also verify the effectiveness and robustness of each component in our model.

摘要

弱监督时间动作定位(WSTAL)旨在仅以视频级标签作为监督,在未修剪的视频中自动识别和定位动作实例。在这项任务中,存在两个挑战:(1)如何在未修剪的视频中准确发现动作类别(发现什么);(2)如何精心聚焦于每个动作实例的完整时间间隔(聚焦何处)。根据经验,为了发现动作类别,应提取有区分性的语义信息,而强大的时间上下文信息有助于完整的动作定位。然而,大多数现有的WSTAL方法都忽略了针对上述两个挑战,显式地联合建模语义和时间上下文相关信息。在本文中,提出了一种具有语义(SCL)和时间上下文相关学习(TCL)模块的语义和时间上下文相关学习网络(STCL-Net),该网络通过分别以视频间和视频内的方式为每个片段建模语义和时间上下文相关信息,实现了准确的动作发现和完整的动作定位。值得注意的是,所提出的两个模块均采用统一的动态相关嵌入范式设计。在不同的基准上进行了广泛的实验。在所有基准上,与现有的最先进模型相比,我们提出的方法表现出优异或相当的性能,特别是在THUMOS-14上,平均mAP方面实现了高达7.2%的提升。此外,全面的消融研究也验证了我们模型中每个组件的有效性和鲁棒性。

相似文献

1
Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization.用于弱监督时间动作定位的语义和时间上下文关联学习
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12427-12443. doi: 10.1109/TPAMI.2023.3287208. Epub 2023 Sep 5.
2
Deep Motion Prior for Weakly-Supervised Temporal Action Localization.用于弱监督时间动作定位的深度运动先验
IEEE Trans Image Process. 2022;31:5203-5213. doi: 10.1109/TIP.2022.3193752. Epub 2022 Aug 4.
3
StochasticFormer: Stochastic Modeling for Weakly Supervised Temporal Action Localization.随机Former:弱监督时间动作定位的随机建模
IEEE Trans Image Process. 2023;32:1379-1389. doi: 10.1109/TIP.2023.3244411. Epub 2023 Feb 23.
4
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization.用于点级弱监督动作定位的紧凑表示与可靠分类学习
IEEE Trans Image Process. 2022;31:7363-7377. doi: 10.1109/TIP.2022.3222623. Epub 2022 Nov 30.
5
Bilateral Relation Distillation for Weakly Supervised Temporal Action Localization.用于弱监督时间动作定位的双边关系蒸馏
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11458-11471. doi: 10.1109/TPAMI.2023.3284853. Epub 2023 Sep 5.
6
Ensemble Prototype Network For Weakly Supervised Temporal Action Localization.用于弱监督时间动作定位的集成原型网络
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4560-4574. doi: 10.1109/TNNLS.2024.3377468. Epub 2025 Feb 28.
7
Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization.打破赢家通吃:用于弱监督时间动作定位的迭代赢家出局网络
IEEE Trans Image Process. 2019 Dec;28(12):5797-5808. doi: 10.1109/TIP.2019.2922108. Epub 2019 Jun 17.
8
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.基于对比评估网络的弱监督时间动作定位
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5886-5902. doi: 10.1109/TPAMI.2021.3078798. Epub 2022 Aug 4.
9
Multi-Hierarchical Category Supervision for Weakly-Supervised Temporal Action Localization.用于弱监督时间动作定位的多分层类别监督
IEEE Trans Image Process. 2021;30:9332-9344. doi: 10.1109/TIP.2021.3124671. Epub 2021 Nov 12.
10
TCGL: Temporal Contrastive Graph for Self-Supervised Video Representation Learning.TCGL:用于自监督视频表征学习的时间对比图
IEEE Trans Image Process. 2022;31:1978-1993. doi: 10.1109/TIP.2022.3147032. Epub 2022 Feb 18.