Galicki M, Leistritz L, Witte H
Institute of Medical Statistics, Computer Science and Documentation, Friedrich Schiller University, D-07740 Jena, Germany.
IEEE Trans Neural Netw. 1999;10(4):741-56. doi: 10.1109/72.774210.
This paper is concerned with a general learning (with arbitrary criterion and state-dependent constraints) of continuous trajectories by means of recurrent neural networks with time-varying weights. The learning process is transformed into an optimal control framework, where the weights to be found are treated as controls. A new learning algorithm based on a variational formulation of Pontryagin's maximum principle is proposed. This algorithm is shown to converge, under reasonable conditions, to an optimal solution. The neural networks with time-dependent weights make it possible to efficiently find an admissible solution (i.e., initial weights satisfying state constraints) which then serves as an initial guess to carry out a proper minimization of a given criterion. The proposed methodology may be directly applicable to both classification of temporal sequences and to optimal tracking of nonlinear dynamic systems. Numerical examples are also given which demonstrate the efficiency of the approach presented.
本文关注的是通过具有时变权重的递归神经网络对连续轨迹进行一般学习(采用任意准则和状态相关约束)。学习过程被转化为一个最优控制框架,其中待求的权重被视为控制量。提出了一种基于庞特里亚金极大值原理变分形式的新学习算法。在合理条件下,该算法被证明能收敛到最优解。具有时间相关权重的神经网络使得能够有效地找到一个可行解(即满足状态约束的初始权重),然后将其作为初始猜测来对给定准则进行适当的最小化。所提出的方法可直接应用于时间序列分类和非线性动态系统的最优跟踪。还给出了数值例子,展示了所提出方法的有效性。