Barto A G, Anderson C W, Sutton R S
Biol Cybern. 1982;43(3):175-85. doi: 10.1007/BF00319977.
An approach to solving nonlinear control problems is illustrated by means of a layered associative network composed of adaptive elements capable of reinforcement learning. The first layer adaptively develops a representation in terms of which the second layer can solve the problem linearly. The adaptive elements comprising the network employ a novel type of learning rule whose properties, we argue, are essential to the adaptive behavior of the layered network. The behavior of the network is illustrated by means of a spatial learning problem that requires the formation of nonlinear associations. We argue that this approach to nonlinearity can be extended to a large class of nonlinear control problems.
一种解决非线性控制问题的方法通过由能够进行强化学习的自适应元件组成的分层联想网络来说明。第一层自适应地开发一种表示,第二层可以根据该表示线性地解决问题。构成网络的自适应元件采用一种新型的学习规则,我们认为其特性对于分层网络的自适应行为至关重要。通过一个需要形成非线性关联的空间学习问题来说明网络的行为。我们认为这种处理非线性的方法可以扩展到一大类非线性控制问题。