Suppr超能文献

基于时空行为的相关性——或者说如何在不计算的情况下判断两个潜在运动场是否相似?

Space-time behavior-based correlation-Or-how to tell if two underlying motion fields are similar without computing them?

作者信息

Shechtman Eli, Irani Michal

机构信息

Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, 76100 Rehovot, Isreal.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):2045-56. doi: 10.1109/TPAMI.2007.1119.

Abstract

We introduce a behavior-based similarity measure which tells us whether two different space-time intensity patterns of two different video segments could have resulted from a similar underlying motion field. This is done directly from the intensity information, without explicitly computing the underlying motions. Such a measure allows us to detect similarity between video segments of differently dressed people performing the same type of activity. It requires no foreground/background segmentation, no prior learning of activities, and no motion estimation or tracking. Using this behavior-based similarity measure, we extend the notion of 2-dimensional image correlation into the 3-dimensional space-time volume, thus allowing to correlate dynamic behaviors and actions. Small space-time video segments (small video clips) are "correlated" against entire video sequences in all three dimensions (x,y, and t). Peak correlation values correspond to video locations with similar dynamic behaviors. Our approach can detect very complex behaviors in video sequences (e.g., ballet movements, pool dives, running water), even when multiple complex activities occur simultaneously within the field-of-view of the camera. We further show its robustness to small changes in scale and orientation of the correlated behavior.

摘要

我们引入了一种基于行为的相似性度量,它能告诉我们两个不同视频片段的不同时空强度模式是否可能源于相似的潜在运动场。这是直接根据强度信息完成的,无需显式计算潜在运动。这样的度量使我们能够检测穿着不同的人执行相同类型活动的视频片段之间的相似性。它不需要前景/背景分割,不需要事先学习活动,也不需要运动估计或跟踪。使用这种基于行为的相似性度量,我们将二维图像相关性的概念扩展到三维时空体积中,从而能够关联动态行为和动作。小的时空视频片段(小视频剪辑)在所有三个维度(x、y和t)上与整个视频序列进行“相关性”比较。峰值相关性值对应于具有相似动态行为的视频位置。我们的方法能够检测视频序列中非常复杂的行为(例如,芭蕾舞动作、跳水、流水),即使在摄像机视野内同时发生多个复杂活动时也是如此。我们还展示了它对相关行为的比例和方向的小变化的鲁棒性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验