• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PADS:一种用于视频数据的概率活动检测框架。

PADS: a probabilistic activity detection framework for video data.

机构信息

Department of Computer Science, Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2010 Dec;32(12):2246-61. doi: 10.1109/TPAMI.2010.33.

DOI:10.1109/TPAMI.2010.33
PMID:20975121
Abstract

There is now a growing need to identify various kinds of activities that occur in videos. In this paper, we first present a logical language called Probabilistic Activity Description Language (PADL) in which users can specify activities of interest. We then develop a probabilistic framework which assigns to any subvideo of a given video sequence a probability that the subvideo contains the given activity, and we finally develop two fast algorithms to detect activities within this framework. OffPad finds all minimal segments of a video that contain a given activity with a probability exceeding a given threshold. In contrast, the OnPad algorithm examines a video during playout (rather than afterwards as OffPad does) and computes the probability that a given activity is occurring (even if the activity is only partially complete). Our prototype Probabilistic Activity Detection System (PADS) implements the framework and the two algorithms, building on top of existing image processing algorithms. We have conducted detailed experiments and compared our approach to four different approaches presented in the literature. We show that-for complex activity definitions-our approach outperforms all the other approaches.

摘要

现在越来越需要识别视频中发生的各种活动。在本文中,我们首先提出了一种称为概率活动描述语言(PADL)的逻辑语言,用户可以使用该语言指定感兴趣的活动。然后,我们开发了一个概率框架,该框架为给定视频序列的任何子视频分配一个包含给定活动的子视频的概率,最后我们开发了两个在该框架内检测活动的快速算法。OffPad 找到包含给定活动的视频的所有最小片段,其概率超过给定阈值。相比之下,OnPad 算法在播放期间检查视频(而不是像 OffPad 那样在之后检查),并计算给定活动正在发生的概率(即使活动只是部分完成)。我们的概率活动检测系统(PADS)原型实现了该框架和两个算法,构建在现有的图像处理算法之上。我们进行了详细的实验,并将我们的方法与文献中提出的四种不同方法进行了比较。我们表明,对于复杂的活动定义,我们的方法优于所有其他方法。

相似文献

1
PADS: a probabilistic activity detection framework for video data.PADS:一种用于视频数据的概率活动检测框架。
IEEE Trans Pattern Anal Mach Intell. 2010 Dec;32(12):2246-61. doi: 10.1109/TPAMI.2010.33.
2
Probabilistic image modeling with an extended chain graph for human activity recognition and image segmentation.基于扩展链式图的人体活动识别和图像分割的概率图像建模。
IEEE Trans Image Process. 2011 Sep;20(9):2401-13. doi: 10.1109/TIP.2011.2128332. Epub 2011 Mar 17.
3
Activity modeling using event probability sequences.使用事件概率序列进行活动建模。
IEEE Trans Image Process. 2008 Apr;17(4):594-607. doi: 10.1109/TIP.2008.916991.
4
Observing human-object interactions: using spatial and functional compatibility for recognition.观察人与物体的交互:利用空间和功能兼容性进行识别。
IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1775-89. doi: 10.1109/TPAMI.2009.83.
5
Building models of animals from video.通过视频构建动物模型。
IEEE Trans Pattern Anal Mach Intell. 2006 Aug;28(8):1319-34. doi: 10.1109/TPAMI.2006.155.
6
Cross-domain human action recognition.跨域人类动作识别
IEEE Trans Syst Man Cybern B Cybern. 2012 Apr;42(2):298-307. doi: 10.1109/TSMCB.2011.2166761. Epub 2011 Sep 26.
7
Probabilistic space-time video modeling via piecewise GMM.基于分段高斯混合模型的概率时空视频建模
IEEE Trans Pattern Anal Mach Intell. 2004 Mar;26(3):384-96. doi: 10.1109/TPAMI.2004.1262334.
8
A new approach for overlay text detection and extraction from complex video scene.一种从复杂视频场景中检测和提取叠加文本的新方法。
IEEE Trans Image Process. 2009 Feb;18(2):401-11. doi: 10.1109/TIP.2008.2008225. Epub 2008 Dec 16.
9
Handling movement epenthesis and hand segmentation ambiguities in continuous sign language recognition using nested dynamic programming.使用嵌套动态规划处理连续手语识别中的运动插入和手部分割歧义。
IEEE Trans Pattern Anal Mach Intell. 2010 Mar;32(3):462-77. doi: 10.1109/TPAMI.2009.26.
10
Active and dynamic information fusion for facial expression understanding from image sequences.用于从图像序列理解面部表情的主动动态信息融合
IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):699-714. doi: 10.1109/TPAMI.2005.93.