Haruno M, Wolpert D M, Kawato M
ATR Human Information Processing Research Laboratories, Seika-cho, Soraku-gun, Kyoto 619-02, Japan.
Neural Comput. 2001 Oct;13(10):2201-20. doi: 10.1162/089976601750541778.
Humans demonstrate a remarkable ability to generate accurate and appropriate motor behavior under many different and often uncertain environmental conditions. We previously proposed a new modular architecture, the modular selection and identification for control (MOSAIC) model, for motor learning and control based on multiple pairs of forward (predictor) and inverse (controller) models. The architecture simultaneously learns the multiple inverse models necessary for control as well as how to select the set of inverse models appropriate for a given environment. It combines both feedforward and feedback sensorimotor information so that the controllers can be selected both prior to movement and subsequently during movement. This article extends and evaluates the MOSAIC architecture in the following respects. The learning in the architecture was implemented by both the original gradient-descent method and the expectation-maximization (EM) algorithm. Unlike gradient descent, the newly derived EM algorithm is robust to the initial starting conditions and learning parameters. Second, simulations of an object manipulation task prove that the architecture can learn to manipulate multiple objects and switch between them appropriately. Moreover, after learning, the model shows generalization to novel objects whose dynamics lie within the polyhedra of already learned dynamics. Finally, when each of the dynamics is associated with a particular object shape, the model is able to select the appropriate controller before movement execution. When presented with a novel shape-dynamic pairing, inappropriate activation of modules is observed followed by on-line correction.
人类展现出了非凡的能力,能够在许多不同且往往不确定的环境条件下产生准确且恰当的运动行为。我们之前提出了一种新的模块化架构,即控制的模块化选择与识别(MOSAIC)模型,用于基于多对前向(预测器)和反向(控制器)模型的运动学习与控制。该架构同时学习控制所需的多个反向模型,以及如何选择适合给定环境的反向模型集。它结合了前馈和反馈的感觉运动信息,以便在运动之前和运动过程中都能选择控制器。本文在以下方面扩展并评估了MOSAIC架构。架构中的学习通过原始的梯度下降法和期望最大化(EM)算法来实现。与梯度下降不同,新推导的EM算法对初始起始条件和学习参数具有鲁棒性。其次,一个物体操纵任务的模拟证明,该架构能够学习操纵多个物体并在它们之间适当地切换。此外,学习之后,该模型对动力学位于已学习动力学多面体内的新物体具有泛化能力。最后,当每个动力学与特定的物体形状相关联时,该模型能够在运动执行之前选择合适的控制器。当呈现新的形状 - 动力学配对时,会观察到模块的不适当激活,随后进行在线校正。