IEEE Trans Neural Netw Learn Syst. 2016 Dec;27(12):2564-2576. doi: 10.1109/TNNLS.2015.2496622. Epub 2015 Nov 17.
This paper focuses on dynamic learning from neural control for a class of nonlinear strict-feedback systems with predefined tracking performance attributes. To reduce the number of neural network (NN) approximators used and make the convergence of neural weights verified easily, state variables are introduced to transform the state-feedback control of the original strict-feedback systems into the output-feedback control of the system in the normal form. Then, using the output error transformation based on performance functions, the constrained tracking control problem of the normal systems is transformed into the stabilization problem of an equivalent unconstrained one. By combining the backstepping method, a high-gain observer with radial basis function (RBF) NNs, a novel adaptive neural control (ANC) scheme is proposed to guarantee the predefined tracking error performance as well as the ultimate boundedness of all other closed-loop signals. In particular, only one NN is employed to approximate the lumped unknown system dynamics during the controller design. Under the satisfaction of the partial persistent excitation condition for RBF NNs, the proposed stable ANC scheme is shown to be capable of achieving knowledge acquisition, expression, and storage of unknown system dynamics. The stored knowledge is reused to develop a neural learning controller for improving the control performance of the closed-loop system. When the initial condition satisfies the predefined performance, the proposed neural learning control can still guarantee the predefined tracking performance. Simulation results on a third-order one-link robot are given to show the effectiveness of the proposed method.
本文针对一类具有预定跟踪性能属性的非线性严格反馈系统,研究了从神经控制中进行动态学习的问题。为了减少神经网络(NN)逼近器的数量,并使神经权重的收敛性易于验证,引入状态变量将原始严格反馈系统的状态反馈控制转换为系统在规范形式下的输出反馈控制。然后,利用基于性能函数的输出误差变换,将规范系统的约束跟踪控制问题转化为等效无约束系统的镇定问题。通过结合反推法、具有径向基函数(RBF)神经网络的高增益观测器,提出了一种新的自适应神经控制(ANC)方案,以保证预定的跟踪误差性能以及所有其他闭环信号的最终有界性。特别地,在控制器设计过程中,仅使用一个 NN 来逼近集中未知系统动态。在 RBF NN 满足部分持续激励条件下,所提出的稳定 ANC 方案能够实现未知系统动态的知识获取、表达和存储。所存储的知识被重新用于开发神经学习控制器,以提高闭环系统的控制性能。当初始条件满足预定性能时,所提出的神经学习控制仍然可以保证预定的跟踪性能。在一个三阶单连杆机器人上的仿真结果表明了所提出方法的有效性。