Wang Tengda, Niu Ben, Xu Ning, Zhang Liang
College of Control Science and Engineering, Bohai University, Jinzhou 121013, Liaoning, China.
Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, Liaoning, 116024, China.
ISA Trans. 2024 Dec;155:69-81. doi: 10.1016/j.isatra.2024.09.011. Epub 2024 Sep 10.
This article investigates an adaptive dynamic programming-based online compensation hierarchical sliding-mode control problem for a class of partially unknown switched nonlinear systems with actuator failures and uncertain perturbations under an identifier-critic neural networks architecture. Firstly, by introducing a cost function related to hierarchical sliding-mode surfaces for the nominal system, the original control problem is equivalently converted into an optimal control problem. To obtain this optimal control policy, the Hamilton-Jacobi-Bellman equation is solved through an adaptive dynamic programming method. Compared with conventional adaptive dynamic programming methods, the identifier-critic network architecture not only overcomes the limitation on the unknown internal dynamic but also eliminates the approximation error arising from the actor network. The weights in the critic network are tuned via the gradient descent approach and the experience replay technology, such that the persistence of excitation condition can be relaxed. Then, a compensation term containing hierarchical sliding-mode surfaces is used to offset uncertain actuator failures without the fault detection and isolation unit. Based on the Lyapunov stability theory, all states of the closed-loop nonlinear system are stable in the sense of uniformly ultimately boundedness. Finally, numerical and practical examples are given to demonstrate the effectiveness of our presented online compensation control strategy.
本文研究了一类具有执行器故障和不确定扰动的部分未知切换非线性系统在标识符-评判神经网络架构下基于自适应动态规划的在线补偿分层滑模控制问题。首先,通过为标称系统引入与分层滑模面相关的代价函数,将原控制问题等效转化为最优控制问题。为获得该最优控制策略,采用自适应动态规划方法求解哈密顿-雅可比-贝尔曼方程。与传统自适应动态规划方法相比,标识符-评判网络架构不仅克服了对未知内部动态的限制,还消除了由执行网络产生的逼近误差。评判网络中的权重通过梯度下降法和经验回放技术进行调整,从而可以放宽持续激励条件。然后,使用一个包含分层滑模面的补偿项来抵消不确定的执行器故障,而无需故障检测与隔离单元。基于李雅普诺夫稳定性理论,闭环非线性系统的所有状态在一致最终有界意义下是稳定的。最后,给出数值和实际例子以证明所提出的在线补偿控制策略的有效性。