• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的约束非线性系统自适应最优控制:一种新型状态依赖变换方法

Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.

作者信息

Yan Lei, Liu Zhi, Chen C L Philip, Zhang Yun, Wu Zongze

机构信息

School of Automation, Guangdong University of Technology, Guangzhou, Guangdong, 510006, China; School of Intelligent Manufacturing, Nanyang Institute of Technology, Nanyang, Henan, 473004, China.

School of Automation, Guangdong University of Technology, Guangzhou, Guangdong, 510006, China.

出版信息

ISA Trans. 2023 Feb;133:29-41. doi: 10.1016/j.isatra.2022.07.006. Epub 2022 Jul 12.

DOI:10.1016/j.isatra.2022.07.006
PMID:35940933
Abstract

Existing schemes for state-constrained systems either impose feasibility conditions or ignore the optimality. In this article, an adaptive optimal control scheme for the strict-feedback nonlinear system is proposed, which benefits from two design steps. Firstly, a novel nonlinear state-dependent function (NSDF) is formulated to equivalently transform the system into a non-constrained one to deal with state constraints without the requirements on feasibility conditions. Secondly, an adaptive optimal control scheme is designed for the non-constrained system, in which reinforcement learning (RL) is utilized to yield the optimal controller in each designing procedure. Updating rules of the actor and critic neural network are driven by the modified adaptive laws, used to approximate the optimal virtual and actual controllers. It is proved that all the signals in the closed-loop system are bounded and the output tracking error converges to an adjustable neighborhood of the origin not affected by the proposed NSDF. Two simulation examples are presented illustrating the effectiveness of the proposed scheme.

摘要

现有的状态约束系统方案要么施加可行性条件,要么忽略最优性。本文提出了一种用于严格反馈非线性系统的自适应最优控制方案,该方案得益于两个设计步骤。首先,构造了一种新颖的非线性状态依赖函数(NSDF),将系统等效地转化为无约束系统,以处理状态约束,而无需可行性条件。其次,为无约束系统设计了一种自适应最优控制方案,其中利用强化学习(RL)在每个设计过程中产生最优控制器。 actor和critic神经网络的更新规则由修改后的自适应律驱动,用于逼近最优虚拟控制器和实际控制器。证明了闭环系统中的所有信号都是有界的,并且输出跟踪误差收敛到原点的一个可调邻域,该邻域不受所提出的NSDF的影响。给出了两个仿真例子,说明了所提方案的有效性。

相似文献

1
Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.基于强化学习的约束非线性系统自适应最优控制:一种新型状态依赖变换方法
ISA Trans. 2023 Feb;133:29-41. doi: 10.1016/j.isatra.2022.07.006. Epub 2022 Jul 12.
2
Reinforcement learning-based consensus control for MASs with intermittent constraints.基于强化学习的具有间歇约束的 MASs 一致性控制。
Neural Netw. 2024 Apr;172:106105. doi: 10.1016/j.neunet.2024.106105. Epub 2024 Jan 6.
3
Prescribed Finite-Time Adaptive Neural Tracking Control for Nonlinear State-Constrained Systems: Barrier Function Approach.非线性状态约束系统的规定有限时间自适应神经跟踪控制:障碍函数方法
IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7513-7522. doi: 10.1109/TNNLS.2021.3085324. Epub 2022 Nov 30.
4
Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning.基于强化学习的非线性离散时间系统周期性事件触发自适应跟踪控制设计。
Neural Netw. 2022 Oct;154:43-55. doi: 10.1016/j.neunet.2022.06.039. Epub 2022 Jun 30.
5
Dynamic learning from adaptive neural control for full-state constrained strict-feedback nonlinear systems.自适应神经网络控制的动态学习用于全状态约束严格反馈非线性系统。
Neural Netw. 2024 Feb;170:596-609. doi: 10.1016/j.neunet.2023.11.064. Epub 2023 Nov 30.
6
Adaptive Full-State-Constrained Control of Nonlinear Systems With Deferred Constraints Based on Nonbarrier Lyapunov Function Method.基于非障碍Lyapunov函数方法的具有延迟约束的非线性系统自适应全状态约束控制
IEEE Trans Cybern. 2022 Aug;52(8):7634-7642. doi: 10.1109/TCYB.2020.3036646. Epub 2022 Jul 19.
7
IBLF-Based Adaptive Neural Control of State-Constrained Uncertain Stochastic Nonlinear Systems.基于迭代学习律的状态约束不确定随机非线性系统自适应神经控制
IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7345-7356. doi: 10.1109/TNNLS.2021.3084820. Epub 2022 Nov 30.
8
Observer-Based Adaptive Optimized Control for Stochastic Nonlinear Systems With Input and State Constraints.
IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7791-7805. doi: 10.1109/TNNLS.2021.3087796. Epub 2022 Nov 30.
9
Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems With State Constraints.具有状态约束的严格反馈非线性系统的基于观测器的神经自适应优化控制
IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):3131-3145. doi: 10.1109/TNNLS.2021.3051030. Epub 2022 Jul 6.
10
Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用
IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.