• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于特征协方差矩阵的视频动作识别。

Action recognition from video using feature covariance matrices.

机构信息

Department of Electrical and Computer Engineering, Boston University, Boston, MA 02215, USA.

出版信息

IEEE Trans Image Process. 2013 Jun;22(6):2479-94. doi: 10.1109/TIP.2013.2252622.

DOI:10.1109/TIP.2013.2252622
PMID:23508265
Abstract

We propose a general framework for fast and accurate recognition of actions in video using empirical covariance matrices of features. A dense set of spatio-temporal feature vectors are computed from video to provide a localized description of the action, and subsequently aggregated in an empirical covariance matrix to compactly represent the action. Two supervised learning methods for action recognition are developed using feature covariance matrices. Common to both methods is the transformation of the classification problem in the closed convex cone of covariance matrices into an equivalent problem in the vector space of symmetric matrices via the matrix logarithm. The first method applies nearest-neighbor classification using a suitable Riemannian metric for covariance matrices. The second method approximates the logarithm of a query covariance matrix by a sparse linear combination of the logarithms of training covariance matrices. The action label is then determined from the sparse coefficients. Both methods achieve state-of-the-art classification performance on several datasets, and are robust to action variability, viewpoint changes, and low object resolution. The proposed framework is conceptually simple and has low storage and computational requirements making it attractive for real-time implementation.

摘要

我们提出了一种使用特征经验协方差矩阵快速准确识别视频中动作的通用框架。从视频中计算出密集的时空特征向量集,提供动作的局部描述,然后在经验协方差矩阵中聚合,以紧凑地表示动作。使用特征协方差矩阵开发了两种用于动作识别的监督学习方法。这两种方法的共同点是通过矩阵对数将协方差矩阵的闭凸锥中的分类问题转换为对称矩阵向量空间中的等效问题。第一种方法使用适合协方差矩阵的黎曼度量应用最近邻分类。第二种方法通过训练协方差矩阵的对数的稀疏线性组合来近似查询协方差矩阵的对数。然后从稀疏系数中确定动作标签。这两种方法在多个数据集上都实现了最先进的分类性能,并且对动作变化、视角变化和低对象分辨率具有鲁棒性。所提出的框架概念简单,存储和计算要求低,因此非常适合实时实现。

相似文献

1
Action recognition from video using feature covariance matrices.基于特征协方差矩阵的视频动作识别。
IEEE Trans Image Process. 2013 Jun;22(6):2479-94. doi: 10.1109/TIP.2013.2252622.
2
Jensen-Bregman LogDet divergence with application to efficient similarity search for covariance matrices.Jensen-Bregman 对数差异与协方差矩阵高效相似度搜索的应用。
IEEE Trans Pattern Anal Mach Intell. 2013 Sep;35(9):2161-74. doi: 10.1109/TPAMI.2012.259.
3
Modeling Geometric-Temporal Context With Directional Pyramid Co-Occurrence for Action Recognition.基于方向金字塔共现的时空上下文建模方法及其在动作识别中的应用
IEEE Trans Image Process. 2014 Feb;23(2):658-72. doi: 10.1109/TIP.2013.2291319.
4
Semisupervised feature selection via spline regression for video semantic recognition.基于样条回归的半监督特征选择在视频语义识别中的应用。
IEEE Trans Neural Netw Learn Syst. 2015 Feb;26(2):252-64. doi: 10.1109/TNNLS.2014.2314123.
5
Multiclass brain-computer interface classification by Riemannian geometry.基于黎曼几何的多类脑-机接口分类。
IEEE Trans Biomed Eng. 2012 Apr;59(4):920-8. doi: 10.1109/TBME.2011.2172210. Epub 2011 Oct 14.
6
Action recognition using mined hierarchical compound features.基于挖掘的层次化组合特征的动作识别。
IEEE Trans Pattern Anal Mach Intell. 2011 May;33(5):883-97. doi: 10.1109/TPAMI.2010.144.
7
Canonical correlation analysis of video volume tensors for action categorization and detection.用于动作分类与检测的视频体积张量典型相关分析。
IEEE Trans Pattern Anal Mach Intell. 2009 Aug;31(8):1415-28. doi: 10.1109/TPAMI.2008.167.
8
Sparse Coding on Symmetric Positive Definite Manifolds Using Bregman Divergences.基于 Bregman 散度的对称正定流形上的稀疏编码。
IEEE Trans Neural Netw Learn Syst. 2016 Jun;27(6):1294-306. doi: 10.1109/TNNLS.2014.2387383. Epub 2015 Jan 27.
9
Learning sparse representations for human action recognition.学习人类动作识别的稀疏表示。
IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1576-88. doi: 10.1109/TPAMI.2011.253.
10
Single and multiple object tracking using log-euclidean Riemannian subspace and block-division appearance model.基于对数欧几里得黎曼子空间和块划分表观模型的单目标和多目标跟踪
IEEE Trans Pattern Anal Mach Intell. 2012 Dec;34(12):2420-40. doi: 10.1109/TPAMI.2012.42.

引用本文的文献

1
Riemannian Spatio-Temporal Features of Locomotion for Individual Recognition.用于个体识别的运动的黎曼时空特征。
Sensors (Basel). 2018 Dec 23;19(1):56. doi: 10.3390/s19010056.