• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于生成时间动作提议的时间间隙感知注意力模型。

Temporal Gap-Aware Attention Model for Temporal Action Proposal Generation.

作者信息

Sooksatra Sorn, Watcharapinchai Sitapa

机构信息

National Electronic and Computer Technology Center, National Science and Technology Development Agency, Khlong Nueng, Khlong Luang District, Pathum Thani 12120, Thailand.

出版信息

J Imaging. 2024 Nov 29;10(12):307. doi: 10.3390/jimaging10120307.

DOI:10.3390/jimaging10120307
PMID:39728204
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11678434/
Abstract

Temporal action proposal generation is a method for extracting temporal action instances or proposals from untrimmed videos. Existing methods often struggle to segment contiguous action proposals, which are a group of action boundaries with small temporal gaps. To address this limitation, we propose incorporating an attention mechanism to weigh the importance of each proposal within a contiguous group. This mechanism leverages the gap displacement between proposals to calculate attention scores, enabling a more accurate localization of action boundaries. We evaluate our method against a state-of-the-art boundary-based baseline on ActivityNet v1.3 and Thumos 2014 datasets. The experimental results demonstrate that our approach significantly improves the performance of short-duration and contiguous action proposals, achieving an average recall of 78.22%.

摘要

时态动作提议生成是一种从未经剪辑的视频中提取时态动作实例或提议的方法。现有方法常常难以分割连续的动作提议,连续的动作提议是一组具有小时空间隙的动作边界。为解决这一局限性,我们建议纳入一种注意力机制,以权衡连续组内每个提议的重要性。该机制利用提议之间的间隙位移来计算注意力分数,从而更准确地定位动作边界。我们在ActivityNet v1.3和Thumos 2014数据集上,将我们的方法与基于边界的最新基线进行了评估。实验结果表明,我们的方法显著提高了短时长和连续动作提议的性能,平均召回率达到78.22%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/19fea7916b49/jimaging-10-00307-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/8cf663beef8a/jimaging-10-00307-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/1815bfb6aa4c/jimaging-10-00307-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/13f250b829de/jimaging-10-00307-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/8a449b9c84b3/jimaging-10-00307-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/19fea7916b49/jimaging-10-00307-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/8cf663beef8a/jimaging-10-00307-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/1815bfb6aa4c/jimaging-10-00307-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/13f250b829de/jimaging-10-00307-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/8a449b9c84b3/jimaging-10-00307-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/19fea7916b49/jimaging-10-00307-g005.jpg

相似文献

1
Temporal Gap-Aware Attention Model for Temporal Action Proposal Generation.用于生成时间动作提议的时间间隙感知注意力模型。
J Imaging. 2024 Nov 29;10(12):307. doi: 10.3390/jimaging10120307.
2
Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation.用于生成时间动作建议的多级内容感知边界检测
IEEE Trans Image Process. 2023;32:6090-6101. doi: 10.1109/TIP.2023.3328471. Epub 2023 Nov 8.
3
Online action proposal generation using spatio-temporal attention network.基于时空注意力网络的在线动作建议生成。
Neural Netw. 2022 Sep;153:518-529. doi: 10.1016/j.neunet.2022.06.032. Epub 2022 Jun 30.
4
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.基于对比评估网络的弱监督时间动作定位
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5886-5902. doi: 10.1109/TPAMI.2021.3078798. Epub 2022 Aug 4.
5
A Comprehensive Review on Temporal-Action Proposal Generation.关于时间动作建议生成的综合综述
J Imaging. 2022 Jul 23;8(8):207. doi: 10.3390/jimaging8080207.
6
Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization.自适应双流共识网络的弱监督时间动作定位。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4136-4151. doi: 10.1109/TPAMI.2022.3189662. Epub 2023 Mar 7.
7
Context-Aware Proposal-Boundary Network With Structural Consistency for Audiovisual Event Localization.
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):15872-15882. doi: 10.1109/TNNLS.2023.3290083. Epub 2024 Oct 29.
8
RecapNet: Action Proposal Generation Mimicking Human Cognitive Process.RecapNet:模仿人类认知过程的动作提案生成。
IEEE Trans Cybern. 2021 Dec;51(12):6017-6028. doi: 10.1109/TCYB.2020.2965196. Epub 2021 Dec 22.
9
Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation.Segment-Tube:基于逐帧分割的非修剪视频中的时空动作定位。
Sensors (Basel). 2018 May 22;18(5):1657. doi: 10.3390/s18051657.
10
MCMNET: Multi-Scale Context Modeling Network for Temporal Action Detection.MCMNET:用于时态动作检测的多尺度上下文建模网络。
Sensors (Basel). 2023 Aug 31;23(17):7563. doi: 10.3390/s23177563.

本文引用的文献

1
Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation.用于生成时间动作建议的多级内容感知边界检测
IEEE Trans Image Process. 2023;32:6090-6101. doi: 10.1109/TIP.2023.3328471. Epub 2023 Nov 8.
2
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.