• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于时空拉普拉斯金字塔的动作识别。

Spatio-temporal Laplacian pyramid coding for action recognition.

出版信息

IEEE Trans Cybern. 2014 Jun;44(6):817-27. doi: 10.1109/TCYB.2013.2273174. Epub 2013 Jul 31.

DOI:10.1109/TCYB.2013.2273174
PMID:23912503
Abstract

We present a novel descriptor, called spatio-temporal Laplacian pyramid coding (STLPC), for holistic representation of human actions. In contrast to sparse representations based on detected local interest points, STLPC regards a video sequence as a whole with spatio-temporal features directly extracted from it, which prevents the loss of information in sparse representations. Through decomposing each sequence into a set of band-pass-filtered components, the proposed pyramid model localizes features residing at different scales, and therefore is able to effectively encode the motion information of actions. To make features further invariant and resistant to distortions as well as noise, a bank of 3-D Gabor filters is applied to each level of the Laplacian pyramid, followed by max pooling within filter bands and over spatio-temporal neighborhoods. Since the convolving and pooling are performed spatio-temporally, the coding model can capture structural and motion information simultaneously and provide an informative representation of actions. The proposed method achieves superb recognition rates on the KTH, the multiview IXMAS, the challenging UCF Sports, and the newly released HMDB51 datasets. It outperforms state of the art methods showing its great potential on action recognition.

摘要

我们提出了一种新的描述符,称为时空拉普拉斯金字塔编码(STLPC),用于整体表示人类动作。与基于检测到的局部兴趣点的稀疏表示不同,STLPC 将视频序列视为整体,直接从中提取时空特征,从而防止稀疏表示中的信息丢失。通过将每个序列分解为一组带通滤波分量,所提出的金字塔模型将特征定位在不同的尺度上,因此能够有效地编码动作的运动信息。为了使特征进一步不变且能够抵抗失真和噪声,对拉普拉斯金字塔的每一级应用一组 3D Gabor 滤波器,然后在滤波器带和时空邻域内进行最大池化。由于卷积和池化是在时空上进行的,因此编码模型可以同时捕获结构和运动信息,并提供动作的信息表示。该方法在 KTH、多视图 IXMAS、具有挑战性的 UCF Sports 和新发布的 HMDB51 数据集上实现了出色的识别率。它优于最先进的方法,显示了其在动作识别方面的巨大潜力。

相似文献

1
Spatio-temporal Laplacian pyramid coding for action recognition.基于时空拉普拉斯金字塔的动作识别。
IEEE Trans Cybern. 2014 Jun;44(6):817-27. doi: 10.1109/TCYB.2013.2273174. Epub 2013 Jul 31.
2
Learning sparse representations for human action recognition.学习人类动作识别的稀疏表示。
IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1576-88. doi: 10.1109/TPAMI.2011.253.
3
Evaluation of color spatio-temporal interest points for human action recognition.用于人体动作识别的彩色时空兴趣点评估。
IEEE Trans Image Process. 2014 Apr;23(4):1569-80. doi: 10.1109/TIP.2014.2302677.
4
Learning Spatio-Temporal Representations for Action Recognition: A Genetic Programming Approach.学习时空表示进行动作识别:一种遗传编程方法。
IEEE Trans Cybern. 2016 Jan;46(1):158-70. doi: 10.1109/TCYB.2015.2399172. Epub 2015 Feb 13.
5
Dynamic Spatio-Temporal Bag of Expressions (D-STBoE) Model for Human Action Recognition.用于人体动作识别的动态时空词袋(D-STBoE)模型。
Sensors (Basel). 2019 Jun 21;19(12):2790. doi: 10.3390/s19122790.
6
Learning Human Actions by Combining Global Dynamics and Local Appearance.通过组合全局动态和局部外观来学习人类动作。
IEEE Trans Pattern Anal Mach Intell. 2014 Dec;36(12):2466-82. doi: 10.1109/TPAMI.2014.2329301.
7
Action recognition using mined hierarchical compound features.基于挖掘的层次化组合特征的动作识别。
IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):883-97. doi: 10.1109/TPAMI.2010.144.
8
Modeling Geometric-Temporal Context With Directional Pyramid Co-Occurrence for Action Recognition.基于方向金字塔共现的时空上下文建模方法及其在动作识别中的应用
IEEE Trans Image Process. 2014 Feb;23(2):658-72. doi: 10.1109/TIP.2013.2291319.
9
Action Spotting and Recognition Based on a Spatiotemporal Orientation Analysis.基于时空方向分析的动作定位与识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Mar;35(3):527-40. doi: 10.1109/TPAMI.2012.141.
10
Learning discriminative key poses for action recognition.学习用于动作识别的判别关键姿势。
IEEE Trans Cybern. 2013 Dec;43(6):1860-70. doi: 10.1109/TSMCB.2012.2231959.

引用本文的文献

1
A Comprehensive Methodological Survey of Human Activity Recognition Across Diverse Data Modalities.跨多种数据模态的人类活动识别综合方法学综述
Sensors (Basel). 2025 Jun 27;25(13):4028. doi: 10.3390/s25134028.
2
A Comprehensive Review of Recent Deep Learning Techniques for Human Activity Recognition.深度学习技术在人体活动识别中的研究进展综述
Comput Intell Neurosci. 2022 Apr 20;2022:8323962. doi: 10.1155/2022/8323962. eCollection 2022.
3
A union of deep learning and swarm-based optimization for 3D human action recognition.
基于深度学习和群体智能优化的三维人体动作识别方法。
Sci Rep. 2022 Mar 31;12(1):5494. doi: 10.1038/s41598-022-09293-8.
4
An Effective Approach for Human Activity Classification Using Feature Fusion and Machine Learning Methods.一种基于特征融合和机器学习方法的人类活动分类有效方法。
Appl Bionics Biomech. 2022 Feb 2;2022:7931729. doi: 10.1155/2022/7931729. eCollection 2022.
5
A Novel Parameter Initialization Technique Using RBM-NN for Human Action Recognition.利用 RBM-NN 的新颖参数初始化技术进行人体动作识别。
Comput Intell Neurosci. 2020 Sep 10;2020:8852404. doi: 10.1155/2020/8852404. eCollection 2020.
6
Robust automated reading of the skin prick test via 3D imaging and parametric surface fitting.通过 3D 成像和参数曲面拟合实现皮肤点刺试验的稳健自动化读取。
PLoS One. 2019 Oct 21;14(10):e0223623. doi: 10.1371/journal.pone.0223623. eCollection 2019.
7
A Comprehensive Survey of Vision-Based Human Action Recognition Methods.基于视觉的人体动作识别方法综述。
Sensors (Basel). 2019 Feb 27;19(5):1005. doi: 10.3390/s19051005.
8
Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network.基于注意力感知时间加权卷积神经网络的动作识别。
Sensors (Basel). 2018 Jun 21;18(7):1979. doi: 10.3390/s18071979.
9
Multi-surface analysis for human action recognition in video.视频中人类动作识别的多表面分析
Springerplus. 2016 Aug 2;5(1):1226. doi: 10.1186/s40064-016-2876-z. eCollection 2016.