基于放松的 PE 条件的辨识 - 评论神经网络逼近的仿射非线性系统自适应最优控制。

Adaptive optimal control of affine nonlinear systems via identifier-critic neural network approximation with relaxed PE conditions.

机构信息

School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.

School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China; Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou 313001, China.

出版信息

Neural Netw. 2023 Oct;167:588-600. doi: 10.1016/j.neunet.2023.08.044. Epub 2023 Sep 1.

DOI:10.1016/j.neunet.2023.08.044

PMID:37703669

Abstract

This paper considers an optimal control of an affine nonlinear system with unknown system dynamics. A new identifier-critic framework is proposed to solve the optimal control problem. Firstly, a neural network identifier is built to estimate the unknown system dynamics, and a critic NN is constructed to solve the Hamiltonian-Jacobi-Bellman equation associated with the optimal control problem. A dynamic regressor extension and mixing technique is applied to design the weight update laws with relaxed persistence of excitation conditions for the two classes of neural networks. The parameter estimation of the update laws and the stability of the closed-loop system under the adaptive optimal control are analyzed using a Lyapunov function method. Numerical simulation results are presented to demonstrate the effectiveness of the proposed IC learning based optimal control algorithm for the affine nonlinear system.

摘要

本文考虑了具有未知系统动态的仿射非线性系统的最优控制。提出了一种新的识别器-评价器框架来解决最优控制问题。首先，构建了一个神经网络识别器来估计未知的系统动态，然后构建了一个评价器神经网络来求解与最优控制问题相关的哈密顿-雅可比-贝尔曼方程。应用动态回归扩展和混合技术来设计两类神经网络的权值更新律，同时放宽了对激励条件的持续要求。利用李雅普诺夫函数方法分析了更新律的参数估计和自适应最优控制下闭环系统的稳定性。数值仿真结果验证了所提出的基于 IC 学习的仿射非线性系统最优控制算法的有效性。

相似文献

Adaptive optimal control of affine nonlinear systems via identifier-critic neural network approximation with relaxed PE conditions.

Neural Netw. 2023 Oct;167:588-600. doi: 10.1016/j.neunet.2023.08.044. Epub 2023 Sep 1.

Optimal Robust Control of Nonlinear Systems with Unknown Dynamics via NN Learning with Relaxed Excitation.

Entropy (Basel). 2024 Jan 14;26(1):0. doi: 10.3390/e26010072.

Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances.

Neural Netw. 2018 Mar;99:19-30. doi: 10.1016/j.neunet.2017.11.022. Epub 2017 Dec 13.

Online adaptive policy learning algorithm for H∞ state feedback control of unknown affine nonlinear discrete-time systems.

IEEE Trans Cybern. 2014 Dec;44(12):2706-18. doi: 10.1109/TCYB.2014.2313915. Epub 2014 Jul 28.

Particle swarm optimized neural networks based local tracking control scheme of unknown nonlinear interconnected systems.

Neural Netw. 2021 Feb;134:54-63. doi: 10.1016/j.neunet.2020.09.020. Epub 2020 Nov 11.

Event-triggered fault-tolerant control for input-constrained nonlinear systems with mismatched disturbances via adaptive dynamic programming.

Neural Netw. 2023 Jul;164:508-520. doi: 10.1016/j.neunet.2023.05.001. Epub 2023 May 6.

Neural network-based finite-horizon optimal control of uncertain affine nonlinear discrete-time systems.

IEEE Trans Neural Netw Learn Syst. 2015 Mar;26(3):486-99. doi: 10.1109/TNNLS.2014.2315646.

Neural network robust tracking control with adaptive critic framework for uncertain nonlinear systems.

Neural Netw. 2018 Jan;97:11-18. doi: 10.1016/j.neunet.2017.09.005. Epub 2017 Sep 21.

Advanced optimal tracking integrating a neural critic technique for asymmetric constrained zero-sum games.

Neural Netw. 2024 Sep;177:106388. doi: 10.1016/j.neunet.2024.106388. Epub 2024 May 15.

Synergetic learning structure-based neuro-optimal fault tolerant control for unknown nonlinear systems.

Neural Netw. 2022 Nov;155:204-214. doi: 10.1016/j.neunet.2022.08.010. Epub 2022 Aug 18.

引用本文的文献

Optimal control under safety constraints and disturbances: a multi-step, off-policy adaptive dynamic programming approach.

Nonlinear Dyn. 2025;113(17):22973-22999. doi: 10.1007/s11071-025-11329-3. Epub 2025 Jun 15.

Advanced holographic convolutional dense networks and Tangent runner optimization for enhanced polycystic ovarian disease classification.

Sci Rep. 2025 May 5;15(1):15719. doi: 10.1038/s41598-025-98873-5.

Investigation of the new optical soliton solutions to the (2+1)-dimensional calogero-bogoyavlenskii schiff model.

Sci Rep. 2024 Dec 30;14(1):32001. doi: 10.1038/s41598-024-83552-8.

Braking failure anti-rollover control and hardware-in-the-loop verification of wire-controlled heavy vehicles.

Sci Rep. 2024 Nov 30;14(1):29802. doi: 10.1038/s41598-024-80854-9.

Assorted optical solitons of the (1+1)- and (2+1)-dimensional Chiral nonlinear Schrödinger equations using modified extended tanh-function technique.

Sci Rep. 2024 Oct 26;14(1):25530. doi: 10.1038/s41598-024-74050-y.

Research on Move-to-Escape Enhanced Dung Beetle Optimization and Its Applications.

Biomimetics (Basel). 2024 Aug 29;9(9):517. doi: 10.3390/biomimetics9090517.

Lyapunov-based neural network model predictive control using metaheuristic optimization approach.

Sci Rep. 2024 Aug 13;14(1):18760. doi: 10.1038/s41598-024-69365-9.

A novel stabilized artificial neural network model enhanced by variational mode decomposing.

Heliyon. 2024 Jul 4;10(13):e34142. doi: 10.1016/j.heliyon.2024.e34142. eCollection 2024 Jul 15.

Novel embedding model predicting the credit card's default using neural network optimized by harmony search algorithm and vortex search algorithm.

Heliyon. 2024 Apr 23;10(9):e30134. doi: 10.1016/j.heliyon.2024.e30134. eCollection 2024 May 15.

FUZ-SMO: A fuzzy slime mould optimizer for mitigating false alarm rates in the classification of underwater datasets using deep convolutional neural networks.

Heliyon. 2024 Mar 28;10(7):e28681. doi: 10.1016/j.heliyon.2024.e28681. eCollection 2024 Apr 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于放松的 PE 条件的辨识 - 评论神经网络逼近的仿射非线性系统自适应最优控制。

Adaptive optimal control of affine nonlinear systems via identifier-critic neural network approximation with relaxed PE conditions.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献