基于强化学习的约束非线性系统自适应最优控制：一种新型状态依赖变换方法

Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.

作者信息

Yan Lei, Liu Zhi, Chen C L Philip, Zhang Yun, Wu Zongze

机构信息

School of Automation, Guangdong University of Technology, Guangzhou, Guangdong, 510006, China; School of Intelligent Manufacturing, Nanyang Institute of Technology, Nanyang, Henan, 473004, China.

School of Automation, Guangdong University of Technology, Guangzhou, Guangdong, 510006, China.

出版信息

ISA Trans. 2023 Feb;133:29-41. doi: 10.1016/j.isatra.2022.07.006. Epub 2022 Jul 12.

DOI:10.1016/j.isatra.2022.07.006

PMID:35940933

Abstract

Existing schemes for state-constrained systems either impose feasibility conditions or ignore the optimality. In this article, an adaptive optimal control scheme for the strict-feedback nonlinear system is proposed, which benefits from two design steps. Firstly, a novel nonlinear state-dependent function (NSDF) is formulated to equivalently transform the system into a non-constrained one to deal with state constraints without the requirements on feasibility conditions. Secondly, an adaptive optimal control scheme is designed for the non-constrained system, in which reinforcement learning (RL) is utilized to yield the optimal controller in each designing procedure. Updating rules of the actor and critic neural network are driven by the modified adaptive laws, used to approximate the optimal virtual and actual controllers. It is proved that all the signals in the closed-loop system are bounded and the output tracking error converges to an adjustable neighborhood of the origin not affected by the proposed NSDF. Two simulation examples are presented illustrating the effectiveness of the proposed scheme.

摘要

现有的状态约束系统方案要么施加可行性条件，要么忽略最优性。本文提出了一种用于严格反馈非线性系统的自适应最优控制方案，该方案得益于两个设计步骤。首先，构造了一种新颖的非线性状态依赖函数（NSDF），将系统等效地转化为无约束系统，以处理状态约束，而无需可行性条件。其次，为无约束系统设计了一种自适应最优控制方案，其中利用强化学习（RL）在每个设计过程中产生最优控制器。 actor和critic神经网络的更新规则由修改后的自适应律驱动，用于逼近最优虚拟控制器和实际控制器。证明了闭环系统中的所有信号都是有界的，并且输出跟踪误差收敛到原点的一个可调邻域，该邻域不受所提出的NSDF的影响。给出了两个仿真例子，说明了所提方案的有效性。

相似文献

Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.

ISA Trans. 2023 Feb;133:29-41. doi: 10.1016/j.isatra.2022.07.006. Epub 2022 Jul 12.

Reinforcement learning-based consensus control for MASs with intermittent constraints.

Neural Netw. 2024 Apr;172:106105. doi: 10.1016/j.neunet.2024.106105. Epub 2024 Jan 6.

Prescribed Finite-Time Adaptive Neural Tracking Control for Nonlinear State-Constrained Systems: Barrier Function Approach.

IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7513-7522. doi: 10.1109/TNNLS.2021.3085324. Epub 2022 Nov 30.

Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning.

Neural Netw. 2022 Oct;154:43-55. doi: 10.1016/j.neunet.2022.06.039. Epub 2022 Jun 30.

Dynamic learning from adaptive neural control for full-state constrained strict-feedback nonlinear systems.

Neural Netw. 2024 Feb;170:596-609. doi: 10.1016/j.neunet.2023.11.064. Epub 2023 Nov 30.

Adaptive Full-State-Constrained Control of Nonlinear Systems With Deferred Constraints Based on Nonbarrier Lyapunov Function Method.

IEEE Trans Cybern. 2022 Aug;52(8):7634-7642. doi: 10.1109/TCYB.2020.3036646. Epub 2022 Jul 19.

IBLF-Based Adaptive Neural Control of State-Constrained Uncertain Stochastic Nonlinear Systems.

IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7345-7356. doi: 10.1109/TNNLS.2021.3084820. Epub 2022 Nov 30.

Observer-Based Adaptive Optimized Control for Stochastic Nonlinear Systems With Input and State Constraints.

IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7791-7805. doi: 10.1109/TNNLS.2021.3087796. Epub 2022 Nov 30.

Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems With State Constraints.

IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):3131-3145. doi: 10.1109/TNNLS.2021.3051030. Epub 2022 Jul 6.

Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.

IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于强化学习的约束非线性系统自适应最优控制：一种新型状态依赖变换方法

Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.

作者信息

Yan Lei, Liu Zhi, Chen C L Philip, Zhang Yun, Wu Zongze

机构信息

School of Automation, Guangdong University of Technology, Guangzhou, Guangdong, 510006, China; School of Intelligent Manufacturing, Nanyang Institute of Technology, Nanyang, Henan, 473004, China.

School of Automation, Guangdong University of Technology, Guangzhou, Guangdong, 510006, China.

出版信息

ISA Trans. 2023 Feb;133:29-41. doi: 10.1016/j.isatra.2022.07.006. Epub 2022 Jul 12.

DOI:10.1016/j.isatra.2022.07.006

PMID:35940933

Abstract

摘要

基于强化学习的约束非线性系统自适应最优控制：一种新型状态依赖变换方法

Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于强化学习的约束非线性系统自适应最优控制：一种新型状态依赖变换方法

Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.

作者信息

机构信息

出版信息

相似文献