School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.
School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China; Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou 313001, China.
Neural Netw. 2023 Oct;167:588-600. doi: 10.1016/j.neunet.2023.08.044. Epub 2023 Sep 1.
This paper considers an optimal control of an affine nonlinear system with unknown system dynamics. A new identifier-critic framework is proposed to solve the optimal control problem. Firstly, a neural network identifier is built to estimate the unknown system dynamics, and a critic NN is constructed to solve the Hamiltonian-Jacobi-Bellman equation associated with the optimal control problem. A dynamic regressor extension and mixing technique is applied to design the weight update laws with relaxed persistence of excitation conditions for the two classes of neural networks. The parameter estimation of the update laws and the stability of the closed-loop system under the adaptive optimal control are analyzed using a Lyapunov function method. Numerical simulation results are presented to demonstrate the effectiveness of the proposed IC learning based optimal control algorithm for the affine nonlinear system.
本文考虑了具有未知系统动态的仿射非线性系统的最优控制。提出了一种新的识别器-评价器框架来解决最优控制问题。首先,构建了一个神经网络识别器来估计未知的系统动态,然后构建了一个评价器神经网络来求解与最优控制问题相关的哈密顿-雅可比-贝尔曼方程。应用动态回归扩展和混合技术来设计两类神经网络的权值更新律,同时放宽了对激励条件的持续要求。利用李雅普诺夫函数方法分析了更新律的参数估计和自适应最优控制下闭环系统的稳定性。数值仿真结果验证了所提出的基于 IC 学习的仿射非线性系统最优控制算法的有效性。