Van-Horenbeke Franz A, Peer Angelika
Human-Centered Technologies and Machine Intelligence Lab, Faculty of Science and Technology, Free University of Bozen-Bolzano, Bolzano, Italy.
Front Robot AI. 2021 May 10;8:643010. doi: 10.3389/frobt.2021.643010. eCollection 2021.
Recognizing the actions, plans, and goals of a person in an unconstrained environment is a key feature that future robotic systems will need in order to achieve a natural human-machine interaction. Indeed, we humans are constantly understanding and predicting the actions and goals of others, which allows us to interact in intuitive and safe ways. While action and plan recognition are tasks that humans perform naturally and with little effort, they are still an unresolved problem from the point of view of artificial intelligence. The immense variety of possible actions and plans that may be encountered in an unconstrained environment makes current approaches be far from human-like performance. In addition, while very different types of algorithms have been proposed to tackle the problem of activity, plan, and goal (intention) recognition, these tend to focus in only one part of the problem (e.g., action recognition), and techniques that address the problem as a whole have been not so thoroughly explored. This review is meant to provide a general view of the problem of activity, plan, and goal recognition as a whole. It presents a description of the problem, both from the human perspective and from the computational perspective, and proposes a classification of the main types of approaches that have been proposed to address it (logic-based, classical machine learning, deep learning, and brain-inspired), together with a description and comparison of the classes. This general view of the problem can help on the identification of research gaps, and may also provide inspiration for the development of new approaches that address the problem in a unified way.
识别处于无约束环境中的人的行为、计划和目标,是未来机器人系统实现自然人机交互所需的一项关键特性。事实上,我们人类一直在理解和预测他人的行为和目标,这使我们能够以直观且安全的方式进行互动。虽然行为和计划识别是人类自然而然就能轻松完成的任务,但从人工智能的角度来看,它们仍是尚未解决的问题。在无约束环境中可能遇到的行为和计划种类繁多,这使得当前的方法远未达到类似人类的性能。此外,虽然已经提出了非常不同类型的算法来解决活动、计划和目标(意图)识别问题,但这些算法往往只关注问题的一个方面(例如,行为识别),而将该问题作为一个整体来解决的技术尚未得到充分探索。这篇综述旨在从整体上提供对活动、计划和目标识别问题的总体看法。它从人类视角和计算视角对该问题进行了描述,并对为解决该问题而提出的主要方法类型(基于逻辑的、经典机器学习、深度学习和受大脑启发的方法)进行了分类,同时对各类方法进行了描述和比较。对该问题的这种总体看法有助于识别研究差距,也可能为以统一方式解决该问题的新方法的开发提供灵感。