Suppr
超能文献

使用物体功能来预测人类活动，以实现机器人的反应式响应。

Anticipating Human Activities Using Object Affordances for Reactive Robotic Response.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):14-29. doi: 10.1109/TPAMI.2015.2430335.

DOI:10.1109/TPAMI.2015.2430335

Abstract

An important aspect of human perception is anticipation, which we use extensively in our day-to-day activities when interacting with other humans as well as with our surroundings. Anticipating which activities will a human do next (and how) can enable an assistive robot to plan ahead for reactive responses. Furthermore, anticipation can even improve the detection accuracy of past activities. The challenge, however, is two-fold: We need to capture the rich context for modeling the activities and object affordances, and we need to anticipate the distribution over a large space of future human activities. In this work, we represent each possible future using an anticipatory temporal conditional random field (ATCRF) that models the rich spatial-temporal relations through object affordances. We then consider each ATCRF as a particle and represent the distribution over the potential futures using a set of particles. In extensive evaluation on CAD-120 human activity RGB-D dataset, we first show that anticipation improves the state-of-the-art detection results. We then show that for new subjects (not seen in the training set), we obtain an activity anticipation accuracy (defined as whether one of top three predictions actually happened) of 84.1, 74.4 and 62.2 percent for an anticipation time of 1, 3 and 10 seconds respectively. Finally, we also show a robot using our algorithm for performing a few reactive responses.

摘要

人类感知的一个重要方面是预测，我们在与他人以及周围环境互动时会广泛地运用这种能力。预测人类接下来会进行哪些（以及如何进行）活动，可以使辅助机器人提前计划做出反应。此外，预测甚至可以提高对过去活动的检测准确性。然而，这面临着两个挑战：我们需要捕捉丰富的上下文来建模活动和对象的可及性，我们需要预测未来人类活动在大空间中的分布。在这项工作中，我们使用预期的时间条件随机场（ATCRF）来表示每个可能的未来，该模型通过对象可及性来建模丰富的时空关系。然后，我们将每个 ATCRF 视为一个粒子，并使用一组粒子来表示潜在未来的分布。在对 CAD-120 人类活动 RGB-D 数据集进行的广泛评估中，我们首先表明，预测可以提高现有检测结果的性能。然后，我们展示了对于新的对象（未在训练集中看到），我们可以在 1、3 和 10 秒的预测时间内分别获得 84.1%、74.4%和 62.2%的活动预测准确率（定义为三个预测中是否有一个实际发生）。最后，我们还展示了机器人使用我们的算法执行一些反应性响应的情况。

相似文献

Anticipating Human Activities Using Object Affordances for Reactive Robotic Response.

IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):14-29. doi: 10.1109/TPAMI.2015.2430335.

Perception through visuomotor anticipation in a mobile robot.

Neural Netw. 2007 Jan;20(1):22-33. doi: 10.1016/j.neunet.2006.07.003. Epub 2006 Sep 28.

Modeling 3D Environments through Hidden Human Context.

IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2040-53. doi: 10.1109/TPAMI.2015.2501811. Epub 2015 Dec 3.

Explicit modeling of human-object interactions in realistic videos.

IEEE Trans Pattern Anal Mach Intell. 2013 Apr;35(4):835-48. doi: 10.1109/TPAMI.2012.175.

Real-time multiple human perception with color-depth cameras on a mobile robot.

IEEE Trans Cybern. 2013 Oct;43(5):1429-41. doi: 10.1109/TCYB.2013.2275291. Epub 2013 Aug 21.

Quantitative analysis of human-model agreement in visual saliency modeling: a comparative study.

IEEE Trans Image Process. 2013 Jan;22(1):55-69. doi: 10.1109/TIP.2012.2210727. Epub 2012 Jul 30.

PADS: a probabilistic activity detection framework for video data.

IEEE Trans Pattern Anal Mach Intell. 2010 Dec;32(12):2246-61. doi: 10.1109/TPAMI.2010.33.

Action Anticipation Using Pairwise Human-Object Interactions and Transformers.

IEEE Trans Image Process. 2021;30:8116-8129. doi: 10.1109/TIP.2021.3113114. Epub 2021 Sep 27.

Situated anticipation.

Synthese. 2021;198(1):349-371. doi: 10.1007/s11229-018-02013-8. Epub 2018 Nov 20.

Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.

IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1325-39. doi: 10.1109/TPAMI.2013.248.

引用本文的文献

A simulated dataset for proactive robot task inference from streaming natural language dialogues.

Sci Data. 2025 Aug 11;12(1):1405. doi: 10.1038/s41597-025-05727-w.

Recognition of Grasping Patterns Using Deep Learning for Human-Robot Collaboration.

Sensors (Basel). 2023 Nov 5;23(21):8989. doi: 10.3390/s23218989.

Human Motion Prediction via Dual-Attention and Multi-Granularity Temporal Convolutional Networks.

Sensors (Basel). 2023 Jun 16;23(12):5653. doi: 10.3390/s23125653.

Recognition of human action for scene understanding using world cup optimization and transfer learning approach.

PeerJ Comput Sci. 2023 May 23;9:e1396. doi: 10.7717/peerj-cs.1396. eCollection 2023.

Intelligent Video Analytics for Human Action Recognition: The State of Knowledge.

Sensors (Basel). 2023 Apr 25;23(9):4258. doi: 10.3390/s23094258.

Skeleton-based motion prediction: A survey.

Front Comput Neurosci. 2022 Oct 28;16:1051222. doi: 10.3389/fncom.2022.1051222. eCollection 2022.

Generative model-enhanced human motion prediction.

Appl AI Lett. 2022 Apr;3(2):e63. doi: 10.1002/ail2.63. Epub 2022 Mar 23.

Human activity recognition in artificial intelligence framework: a narrative review.

Artif Intell Rev. 2022;55(6):4755-4808. doi: 10.1007/s10462-021-10116-x. Epub 2022 Jan 18.

Human-in-the-Loop Robot Control for Human-Robot Collaboration: HUMAN INTENTION ESTIMATION AND SAFE TRAJECTORY TRACKING CONTROL FOR COLLABORATIVE TASKS.

IEEE Control Syst. 2020 Dec;40(6):29-56. Epub 2020 Nov 16.

A Review on Computer Vision-Based Methods for Human Action Recognition.

J Imaging. 2020 Jun 10;6(6):46. doi: 10.3390/jimaging6060046.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

使用物体功能来预测人类活动，以实现机器人的反应式响应。

Anticipating Human Activities Using Object Affordances for Reactive Robotic Response.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译