• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过特定于动作的人物检测来识别动作。

Recognizing Actions Through Action-Specific Person Detection.

出版信息

IEEE Trans Image Process. 2015 Nov;24(11):4422-32. doi: 10.1109/TIP.2015.2465147. Epub 2015 Aug 5.

DOI:10.1109/TIP.2015.2465147
PMID:26259079
Abstract

Action recognition in still images is a challenging problem in computer vision. To facilitate comparative evaluation independently of person detection, the standard evaluation protocol for action recognition uses an oracle person detector to obtain perfect bounding box information at both training and test time. The assumption is that, in practice, a general person detector will provide candidate bounding boxes for action recognition. In this paper, we argue that this paradigm is suboptimal and that action class labels should already be considered during the detection stage. Motivated by the observation that body pose is strongly conditioned on action class, we show that: 1) the existing state-of-the-art generic person detectors are not adequate for proposing candidate bounding boxes for action classification; 2) due to limited training examples, the direct training of action-specific person detectors is also inadequate; and 3) using only a small number of labeled action examples, the transfer learning is able to adapt an existing detector to propose higher quality bounding boxes for subsequent action classification. To the best of our knowledge, we are the first to investigate transfer learning for the task of action-specific person detection in still images. We perform extensive experiments on two benchmark data sets: 1) Stanford-40 and 2) PASCAL VOC 2012. For the action detection task (i.e., both person localization and classification of the action performed), our approach outperforms methods based on general person detection by 5.7% mean average precision (MAP) on Stanford-40 and 2.1% MAP on PASCAL VOC 2012. Our approach also significantly outperforms the state of the art with a MAP of 45.4% on Stanford-40 and 31.4% on PASCAL VOC 2012. We also evaluate our action detection approach for the task of action classification (i.e., recognizing actions without localizing them). For this task, our approach, without using any ground-truth person localization at test time, outperforms on both data sets state-of-the-art methods, which do use person locations.

摘要

静止图像中的动作识别是计算机视觉中的一个具有挑战性的问题。为了在不依赖于人员检测的情况下进行比较评估,动作识别的标准评估协议使用一个 oracle 人员探测器在训练和测试时获得完美的边界框信息。假设在实践中,一般的人员探测器将为动作识别提供候选边界框。在本文中,我们认为这种范例是次优的,并且在检测阶段就应该考虑动作类别标签。受身体姿势强烈取决于动作类别的观察结果的启发,我们表明:1)现有的最先进的通用人员探测器不足以提出用于动作分类的候选边界框;2)由于训练示例有限,直接训练特定于动作的人员探测器也不足;3)仅使用少量标记的动作示例,迁移学习能够适应现有的探测器,以便为后续的动作分类提出更高质量的边界框。据我们所知,我们是第一个研究在静止图像中特定于动作的人员检测任务的迁移学习的人。我们在两个基准数据集上进行了广泛的实验:1)斯坦福大学 40 人和 2)PASCAL VOC 2012。对于动作检测任务(即人员本地化和执行动作的分类),我们的方法在斯坦福大学 40 上的平均准确率(MAP)比基于通用人员检测的方法高出 5.7%,在 PASCAL VOC 2012 上的 MAP 高出 2.1%。我们的方法还在斯坦福大学 40 上的 MAP 达到 45.4%,在 PASCAL VOC 2012 上的 MAP 达到 31.4%,明显优于最新技术。我们还评估了我们的动作检测方法在动作分类任务中的应用(即无需定位人员即可识别动作)。对于这个任务,我们的方法在两个数据集上都优于使用人员位置的最新技术方法,而无需在测试时使用任何真实人员定位。

相似文献

1
Recognizing Actions Through Action-Specific Person Detection.通过特定于动作的人物检测来识别动作。
IEEE Trans Image Process. 2015 Nov;24(11):4422-32. doi: 10.1109/TIP.2015.2465147. Epub 2015 Aug 5.
2
Semantic pyramids for gender and action recognition.用于性别和动作识别的语义金字塔。
IEEE Trans Image Process. 2014 Aug;23(8):3633-45. doi: 10.1109/TIP.2014.2331759. Epub 2014 Jun 18.
3
Training Robust Object Detectors From Noisy Category Labels and Imprecise Bounding Boxes.从噪声类别标签和不精确边界框中训练鲁棒目标检测器。
IEEE Trans Image Process. 2021;30:5782-5792. doi: 10.1109/TIP.2021.3085208. Epub 2021 Jun 23.
4
Weakly Supervised Large Scale Object Localization with Multiple Instance Learning and Bag Splitting.基于多示例学习和 Bag Splitting 的弱监督大规模目标定位。
IEEE Trans Pattern Anal Mach Intell. 2016 Feb;38(2):405-16. doi: 10.1109/TPAMI.2015.2456908.
5
Augmented multiple instance regression for inferring object contours in bounding boxes.基于增强型多实例回归的边界框中目标轮廓推断。
IEEE Trans Image Process. 2014 Apr;23(4):1722-36. doi: 10.1109/TIP.2014.2307436.
6
Action Recognition in Still Images With Minimum Annotation Efforts.以最少标注工作量实现静止图像中的动作识别
IEEE Trans Image Process. 2016 Nov;25(11):5479-5490. doi: 10.1109/TIP.2016.2605305. Epub 2016 Sep 1.
7
Self Paced Deep Learning for Weakly Supervised Object Detection.用于弱监督目标检测的自定进度深度学习
IEEE Trans Pattern Anal Mach Intell. 2019 Mar;41(3):712-725. doi: 10.1109/TPAMI.2018.2804907. Epub 2018 Feb 12.
8
Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information .基于最大互信息的视频序列粗到精自适应人像检测
Sensors (Basel). 2018 Dec 20;19(1):4. doi: 10.3390/s19010004.
9
Weakly Supervised Object Detection via Object-Specific Pixel Gradient.基于特定对象像素梯度的弱监督目标检测
IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):5960-5970. doi: 10.1109/TNNLS.2018.2816021. Epub 2018 Apr 9.
10
Weakly-Supervised Salient Object Detection With Saliency Bounding Boxes.基于显著性边界框的弱监督显著目标检测
IEEE Trans Image Process. 2021;30:4423-4435. doi: 10.1109/TIP.2021.3071691. Epub 2021 Apr 21.

引用本文的文献

1
Recognition of human action for scene understanding using world cup optimization and transfer learning approach.使用世界杯优化和迁移学习方法进行场景理解的人类行为识别
PeerJ Comput Sci. 2023 May 23;9:e1396. doi: 10.7717/peerj-cs.1396. eCollection 2023.