• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类及其活动的速率不变识别。

Rate-invariant recognition of humans and their activities.

作者信息

Veeraraghavan Ashok, Srivastava Anuj, Roy-Chowdhury Amit K, Chellappa Rama

机构信息

Centre for Automation Research and Electrical and Computer Engineering Department, University of Maryland, College Park, MD 20742, USA.

出版信息

IEEE Trans Image Process. 2009 Jun;18(6):1326-39. doi: 10.1109/TIP.2009.2017143. Epub 2009 Apr 24.

DOI:10.1109/TIP.2009.2017143
PMID:19398409
Abstract

Pattern recognition in video is a challenging task because of the multitude of spatio-temporal variations that occur in different videos capturing the exact same event. While traditional pattern-theoretic approaches account for the spatial changes that occur due to lighting and pose, very little has been done to address the effect of temporal rate changes in the executions of an event. In this paper, we provide a systematic model-based approach to learn the nature of such temporal variations (time warps) while simultaneously allowing for the spatial variations in the descriptors. We illustrate our approach for the problem of action recognition and provide experimental justification for the importance of accounting for rate variations in action recognition. The model is composed of a nominal activity trajectory and a function space capturing the probability distribution of activity-specific time warping transformations. We use the square-root parameterization of time warps to derive geodesics, distance measures, and probability distributions on the space of time warping functions. We then design a Bayesian algorithm which treats the execution rate function as a nuisance variable and integrates it out using Monte Carlo sampling, to generate estimates of class posteriors. This approach allows us to learn the space of time warps for each activity while simultaneously capturing other intra- and interclass variations. Next, we discuss a special case of this approach which assumes a uniform distribution on the space of time warping functions and show how computationally efficient inference algorithms may be derived for this special case. We discuss the relative advantages and disadvantages of both approaches and show their efficacy using experiments on gait-based person identification and activity recognition.

摘要

视频中的模式识别是一项具有挑战性的任务,因为在捕捉完全相同事件的不同视频中会出现大量的时空变化。虽然传统的模式理论方法考虑了由于光照和姿势引起的空间变化,但在处理事件执行过程中时间速率变化的影响方面做得很少。在本文中,我们提供了一种基于系统模型的方法来学习这种时间变化(时间扭曲)的本质,同时考虑描述符中的空间变化。我们说明了我们针对动作识别问题的方法,并为在动作识别中考虑速率变化的重要性提供了实验依据。该模型由一个标称活动轨迹和一个捕获特定活动时间扭曲变换概率分布的函数空间组成。我们使用时间扭曲的平方根参数化来推导时间扭曲函数空间上的测地线、距离度量和概率分布。然后,我们设计了一种贝叶斯算法,将执行速率函数视为一个讨厌变量,并使用蒙特卡罗采样将其积分掉,以生成类后验估计。这种方法使我们能够学习每个活动的时间扭曲空间,同时捕获其他类内和类间变化。接下来,我们讨论这种方法的一种特殊情况,即假设时间扭曲函数空间上的均匀分布,并展示如何为这种特殊情况推导计算效率高的推理算法。我们讨论了这两种方法的相对优缺点,并通过基于步态的人员识别和活动识别实验展示了它们的有效性。

相似文献

1
Rate-invariant recognition of humans and their activities.人类及其活动的速率不变识别。
IEEE Trans Image Process. 2009 Jun;18(6):1326-39. doi: 10.1109/TIP.2009.2017143. Epub 2009 Apr 24.
2
Observing human-object interactions: using spatial and functional compatibility for recognition.观察人与物体的交互:利用空间和功能兼容性进行识别。
IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1775-89. doi: 10.1109/TPAMI.2009.83.
3
Distribution-based dimensionality reduction applied to articulated motion recognition.基于分布的降维方法在关节运动识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2009 May;31(5):795-810. doi: 10.1109/TPAMI.2008.80.
4
Activity modeling using event probability sequences.使用事件概率序列进行活动建模。
IEEE Trans Image Process. 2008 Apr;17(4):594-607. doi: 10.1109/TIP.2008.916991.
5
Abrupt motion tracking via intensively adaptive Markov-chain Monte Carlo sampling.基于密集自适应马尔可夫链蒙特卡罗采样的快速运动跟踪。
IEEE Trans Image Process. 2012 Feb;21(2):789-801. doi: 10.1109/TIP.2011.2168414. Epub 2011 Sep 19.
6
Latent-space variational bayes.潜在空间变分贝叶斯
IEEE Trans Pattern Anal Mach Intell. 2008 Dec;30(12):2236-42. doi: 10.1109/TPAMI.2008.157.
7
Empirical Markov Chain Monte Carlo Bayesian analysis of fMRI data.功能磁共振成像数据的经验马尔可夫链蒙特卡罗贝叶斯分析。
Neuroimage. 2008 Aug 1;42(1):99-111. doi: 10.1016/j.neuroimage.2008.04.235. Epub 2008 Apr 29.
8
Looking for shapes in two-dimensional cluttered point clouds.在二维杂乱点云中寻找形状。
IEEE Trans Pattern Anal Mach Intell. 2009 Sep;31(9):1616-29. doi: 10.1109/TPAMI.2008.223.
9
PADS: a probabilistic activity detection framework for video data.PADS:一种用于视频数据的概率活动检测框架。
IEEE Trans Pattern Anal Mach Intell. 2010 Dec;32(12):2246-61. doi: 10.1109/TPAMI.2010.33.
10
Deriving evidence theoretical functions in multivariate data spaces: a systematic approach.推导多元数据空间中的证据理论函数:一种系统方法。
IEEE Trans Syst Man Cybern B Cybern. 2008 Apr;38(2):455-65. doi: 10.1109/TSMCB.2007.913593.

引用本文的文献

1
Time-warping analysis for biological signals: methodology and application.生物信号的时间扭曲分析:方法与应用
Sci Rep. 2025 Apr 5;15(1):11718. doi: 10.1038/s41598-025-95108-5.
2
Shape Analysis of Functional Data With Elastic Partial Matching.基于弹性局部匹配的函数型数据形状分析
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9589-9602. doi: 10.1109/TPAMI.2021.3130535. Epub 2022 Nov 7.
3
A geometric approach for computing tolerance bounds for elastic functional data.一种用于计算弹性函数型数据公差界限的几何方法。
J Appl Stat. 2019 Sep 24;47(3):481-505. doi: 10.1080/02664763.2019.1645818. Epub 2019 Jul 23.
4
Multiview Layer Fusion Model for Action Recognition Using RGBD Images.基于 RGBD 图像的动作识别的多视图层融合模型。
Comput Intell Neurosci. 2018 Jun 20;2018:9032945. doi: 10.1155/2018/9032945. eCollection 2018.