• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AdapNet:用于弱监督动作识别和定位的可适应分解编解码器网络。

AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization.

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1852-1863. doi: 10.1109/TNNLS.2019.2962815. Epub 2023 Apr 4.

DOI:10.1109/TNNLS.2019.2962815
PMID:31995502
Abstract

The point process is a solid framework to model sequential data, such as videos, by exploring the underlying relevance. As a challenging problem for high-level video understanding, weakly supervised action recognition and localization in untrimmed videos have attracted intensive research attention. Knowledge transfer by leveraging the publicly available trimmed videos as external guidance is a promising attempt to make up for the coarse-grained video-level annotation and improve the generalization performance. However, unconstrained knowledge transfer may bring about irrelevant noise and jeopardize the learning model. This article proposes a novel adaptability decomposing encoder-decoder network to transfer reliable knowledge between the trimmed and untrimmed videos for action recognition and localization by bidirectional point process modeling, given only video-level annotations. By decomposing the original features into the domain-adaptable and domain-specific ones based on their adaptability, trimmed-untrimmed knowledge transfer can be safely confined within a more coherent subspace. An encoder-decoder-based structure is carefully designed and jointly optimized to facilitate effective action classification and temporal localization. Extensive experiments are conducted on two benchmark data sets (i.e., THUMOS14 and ActivityNet1.3), and the experimental results clearly corroborate the efficacy of our method.

摘要

点过程是一种用于建模顺序数据(例如视频)的稳健框架,通过探索潜在的相关性来实现。作为高级视频理解的一个具有挑战性的问题,未修剪视频中的弱监督动作识别和定位引起了广泛的研究关注。通过利用公开的修剪视频作为外部指导进行知识迁移是一种很有前途的尝试,可以弥补粗粒度的视频级注释并提高泛化性能。然而,无约束的知识迁移可能会带来不相关的噪声,并危及学习模型。本文提出了一种新颖的适应性分解编码器-解码器网络,通过双向点过程建模,仅使用视频级注释,在修剪和未修剪的视频之间进行可靠知识的迁移,用于动作识别和定位。通过基于适应性将原始特征分解为域自适应和特定于域的特征,可以将修剪-未修剪的知识迁移安全地限制在更一致的子空间内。精心设计了基于编码器-解码器的结构,并进行联合优化,以促进有效的动作分类和时间定位。在两个基准数据集(即 THUMOS14 和 ActivityNet1.3)上进行了广泛的实验,实验结果清楚地证实了我们方法的有效性。

相似文献

1
AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization.AdapNet:用于弱监督动作识别和定位的可适应分解编解码器网络。
IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):1852-1863. doi: 10.1109/TNNLS.2019.2962815. Epub 2023 Apr 4.
2
StochasticFormer: Stochastic Modeling for Weakly Supervised Temporal Action Localization.随机Former:弱监督时间动作定位的随机建模
IEEE Trans Image Process. 2023;32:1379-1389. doi: 10.1109/TIP.2023.3244411. Epub 2023 Feb 23.
3
Deep Learning-Based Action Detection in Untrimmed Videos: A Survey.基于深度学习的未修剪视频动作检测:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4302-4320. doi: 10.1109/TPAMI.2022.3193611. Epub 2023 Mar 7.
4
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.基于对比评估网络的弱监督时间动作定位
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5886-5902. doi: 10.1109/TPAMI.2021.3078798. Epub 2022 Aug 4.
5
Bilateral Relation Distillation for Weakly Supervised Temporal Action Localization.用于弱监督时间动作定位的双边关系蒸馏
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11458-11471. doi: 10.1109/TPAMI.2023.3284853. Epub 2023 Sep 5.
6
Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization.打破赢家通吃:用于弱监督时间动作定位的迭代赢家出局网络
IEEE Trans Image Process. 2019 Dec;28(12):5797-5808. doi: 10.1109/TIP.2019.2922108. Epub 2019 Jun 17.
7
Weakly supervised temporal action localization with actionness-guided false positive suppression.基于动作引导型假阳性抑制的弱监督时间动作定位。
Neural Netw. 2024 Jul;175:106307. doi: 10.1016/j.neunet.2024.106307. Epub 2024 Apr 15.
8
Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.自适应双流共识网络的弱监督时间动作定位。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4136-4151. doi: 10.1109/TPAMI.2022.3189662. Epub 2023 Mar 7.
9
Multi-Hierarchical Category Supervision for Weakly-Supervised Temporal Action Localization.用于弱监督时间动作定位的多分层类别监督
IEEE Trans Image Process. 2021;30:9332-9344. doi: 10.1109/TIP.2021.3124671. Epub 2021 Nov 12.
10
Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization.用于弱监督时间动作定位的双分支关系原型网络
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5729-5746. doi: 10.1109/TPAMI.2021.3076172. Epub 2022 Aug 4.

引用本文的文献

1
Image Representation Method Based on Relative Layer Entropy for Insulator Recognition.基于相对层熵的绝缘子识别图像表示方法
Entropy (Basel). 2020 Apr 8;22(4):419. doi: 10.3390/e22040419.
2
Enhanced Action Recognition Using Multiple Stream Deep Learning with Optical Flow and Weighted Sum.基于光流和加权和的多流深度学习增强动作识别
Sensors (Basel). 2020 Jul 13;20(14):3894. doi: 10.3390/s20143894.
3
A Novel LiDAR Data Classification Algorithm Combined CapsNet with ResNet.一种将胶囊网络(CapsNet)与残差网络(ResNet)相结合的新型激光雷达数据分类算法。
Sensors (Basel). 2020 Feb 19;20(4):1151. doi: 10.3390/s20041151.