Padhi Radhakant, Unnikrishnan Nishant, Wang Xiaohua, Balakrishnan S N
Department of Aerospace Engineering, Indian Institute of Science, Bangalore, India.
Neural Netw. 2006 Dec;19(10):1648-60. doi: 10.1016/j.neunet.2006.08.010. Epub 2006 Oct 11.
Even though dynamic programming offers an optimal control solution in a state feedback form, the method is overwhelmed by computational and storage requirements. Approximate dynamic programming implemented with an Adaptive Critic (AC) neural network structure has evolved as a powerful alternative technique that obviates the need for excessive computations and storage requirements in solving optimal control problems. In this paper, an improvement to the AC architecture, called the "Single Network Adaptive Critic (SNAC)" is presented. This approach is applicable to a wide class of nonlinear systems where the optimal control (stationary) equation can be explicitly expressed in terms of the state and costate variables. The selection of this terminology is guided by the fact that it eliminates the use of one neural network (namely the action network) that is part of a typical dual network AC setup. As a consequence, the SNAC architecture offers three potential advantages: a simpler architecture, lesser computational load and elimination of the approximation error associated with the eliminated network. In order to demonstrate these benefits and the control synthesis technique using SNAC, two problems have been solved with the AC and SNAC approaches and their computational performances are compared. One of these problems is a real-life Micro-Electro-Mechanical-system (MEMS) problem, which demonstrates that the SNAC technique is applicable to complex engineering systems.
尽管动态规划以状态反馈形式提供了最优控制解决方案,但该方法受到计算和存储需求的困扰。采用自适应评判器(AC)神经网络结构实现的近似动态规划已发展成为一种强大的替代技术,它在解决最优控制问题时无需过多的计算和存储需求。本文提出了对AC架构的一种改进,称为“单网络自适应评判器(SNAC)”。这种方法适用于广泛的一类非线性系统,在这类系统中,最优控制(稳态)方程可以根据状态和协态变量明确表示。选择这个术语的依据是它省去了典型双网络AC设置中作为一部分的一个神经网络(即动作网络)的使用。因此,SNAC架构具有三个潜在优势:架构更简单、计算负载更小以及消除了与被省去网络相关的近似误差。为了证明这些优势以及使用SNAC的控制综合技术,我们用AC和SNAC方法解决了两个问题,并比较了它们的计算性能。其中一个问题是一个实际的微机电系统(MEMS)问题,这表明SNAC技术适用于复杂的工程系统。