Suppr超能文献

用于生成时间动作提议的时间间隙感知注意力模型。

Temporal Gap-Aware Attention Model for Temporal Action Proposal Generation.

作者信息

Sooksatra Sorn, Watcharapinchai Sitapa

机构信息

National Electronic and Computer Technology Center, National Science and Technology Development Agency, Khlong Nueng, Khlong Luang District, Pathum Thani 12120, Thailand.

出版信息

J Imaging. 2024 Nov 29;10(12):307. doi: 10.3390/jimaging10120307.

Abstract

Temporal action proposal generation is a method for extracting temporal action instances or proposals from untrimmed videos. Existing methods often struggle to segment contiguous action proposals, which are a group of action boundaries with small temporal gaps. To address this limitation, we propose incorporating an attention mechanism to weigh the importance of each proposal within a contiguous group. This mechanism leverages the gap displacement between proposals to calculate attention scores, enabling a more accurate localization of action boundaries. We evaluate our method against a state-of-the-art boundary-based baseline on ActivityNet v1.3 and Thumos 2014 datasets. The experimental results demonstrate that our approach significantly improves the performance of short-duration and contiguous action proposals, achieving an average recall of 78.22%.

摘要

时态动作提议生成是一种从未经剪辑的视频中提取时态动作实例或提议的方法。现有方法常常难以分割连续的动作提议,连续的动作提议是一组具有小时空间隙的动作边界。为解决这一局限性,我们建议纳入一种注意力机制,以权衡连续组内每个提议的重要性。该机制利用提议之间的间隙位移来计算注意力分数,从而更准确地定位动作边界。我们在ActivityNet v1.3和Thumos 2014数据集上,将我们的方法与基于边界的最新基线进行了评估。实验结果表明,我们的方法显著提高了短时长和连续动作提议的性能,平均召回率达到78.22%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/70dd/11678434/8cf663beef8a/jimaging-10-00307-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验