Department of Applied Informatics, Comenius University in Bratislava Bratislava, Slovakia.
Front Neurorobot. 2012 Feb 29;6:1. doi: 10.3389/fnbot.2012.00001. eCollection 2012.
The recent outburst of interest in cognitive developmental robotics is fueled by the ambition to propose ecologically plausible mechanisms of how, among other things, a learning agent/robot could ground linguistic meanings in its sensorimotor behavior. Along this stream, we propose a model that allows the simulated iCub robot to learn the meanings of actions (point, touch, and push) oriented toward objects in robot's peripersonal space. In our experiments, the iCub learns to execute motor actions and comment on them. Architecturally, the model is composed of three neural-network-based modules that are trained in different ways. The first module, a two-layer perceptron, is trained by back-propagation to attend to the target position in the visual scene, given the low-level visual information and the feature-based target information. The second module, having the form of an actor-critic architecture, is the most distinguishing part of our model, and is trained by a continuous version of reinforcement learning to execute actions as sequences, based on a linguistic command. The third module, an echo-state network, is trained to provide the linguistic description of the executed actions. The trained model generalizes well in case of novel action-target combinations with randomized initial arm positions. It can also promptly adapt its behavior if the action/target suddenly changes during motor execution.
最近,人们对认知发展机器人学的兴趣大增,其动机是提出一个在生态上合理的机制,说明学习代理/机器人如何将语言意义建立在其感觉运动行为基础上。沿着这条思路,我们提出了一个模型,使模拟的 iCub 机器人能够学习针对机器人近体空间中的物体的动作(指向、触摸和推动)的意义。在我们的实验中,iCub 学会了执行运动动作并对其进行评论。从架构上讲,该模型由三个基于神经网络的模块组成,这些模块以不同的方式进行训练。第一个模块是一个两层感知机,通过反向传播进行训练,以便在给定低水平视觉信息和基于特征的目标信息的情况下,关注视觉场景中的目标位置。第二个模块采用演员-批评家架构的形式,是我们模型的最具特色部分,通过连续强化学习进行训练,以便根据语言命令执行动作序列。第三个模块是回声状态网络,用于提供执行动作的语言描述。训练有素的模型在具有随机初始手臂位置的新动作-目标组合的情况下能够很好地泛化。如果在运动执行过程中动作/目标突然发生变化,它也可以迅速调整其行为。