• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一类非线性系统的自适应最优控制:在线策略迭代方法。

Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach.

作者信息

He Shuping, Fang Haiyang, Zhang Maoguang, Liu Fei, Ding Zhengtao

出版信息

IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):549-558. doi: 10.1109/TNNLS.2019.2905715. Epub 2019 Apr 11.

DOI:10.1109/TNNLS.2019.2905715
PMID:30990199
Abstract

This paper studies the online adaptive optimal controller design for a class of nonlinear systems through a novel policy iteration (PI) algorithm. By using the technique of neural network linear differential inclusion (LDI) to linearize the nonlinear terms in each iteration, the optimal law for controller design can be solved through the relevant algebraic Riccati equation (ARE) without using the system internal parameters. Based on PI approach, the adaptive optimal control algorithm is developed with the online linearization and the two-step iteration, i.e., policy evaluation and policy improvement. The convergence of the proposed PI algorithm is also proved. Finally, two numerical examples are given to illustrate the effectiveness and applicability of the proposed method.

摘要

本文通过一种新颖的策略迭代(PI)算法研究了一类非线性系统的在线自适应最优控制器设计。通过使用神经网络线性微分包含(LDI)技术在每次迭代中对非线性项进行线性化,可以通过相关的代数黎卡提方程(ARE)求解控制器设计的最优律,而无需使用系统内部参数。基于PI方法,通过在线线性化和两步迭代(即策略评估和策略改进)开发了自适应最优控制算法。还证明了所提出的PI算法的收敛性。最后,给出了两个数值例子来说明所提方法的有效性和适用性。

相似文献

1
Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach.一类非线性系统的自适应最优控制:在线策略迭代方法。
IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):549-558. doi: 10.1109/TNNLS.2019.2905715. Epub 2019 Apr 11.
2
Model-free optimal controller design for continuous-time nonlinear systems by adaptive dynamic programming based on a precompensator.基于预补偿器的自适应动态规划的连续时间非线性系统无模型最优控制器设计
ISA Trans. 2015 Jul;57:63-70. doi: 10.1016/j.isatra.2014.08.018. Epub 2015 Feb 20.
3
Event-Triggered Adaptive Optimal Control With Output Feedback: An Adaptive Dynamic Programming Approach.具有输出反馈的事件触发自适应最优控制:一种自适应动态规划方法。
IEEE Trans Neural Netw Learn Syst. 2021 Nov;32(11):5208-5221. doi: 10.1109/TNNLS.2020.3027301. Epub 2021 Oct 27.
4
Infinite horizon self-learning optimal control of nonaffine discrete-time nonlinear systems.非仿射离散时间非线性系统的无限时域自学习最优控制。
IEEE Trans Neural Netw Learn Syst. 2015 Apr;26(4):866-79. doi: 10.1109/TNNLS.2015.2401334. Epub 2015 Mar 2.
5
A policy iteration approach to online optimal control of continuous-time constrained-input systems.一种连续时间约束输入系统在线最优控制的策略迭代方法。
ISA Trans. 2013 Sep;52(5):611-21. doi: 10.1016/j.isatra.2013.04.004. Epub 2013 May 24.
6
Adaptive nearly optimal control for a class of continuous-time nonaffine nonlinear systems with inequality constraints.一类具有不等式约束的连续时间非仿射非线性系统的自适应近乎最优控制
ISA Trans. 2017 Jan;66:122-133. doi: 10.1016/j.isatra.2016.10.019. Epub 2016 Nov 9.
7
Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems.策略迭代自适应动态规划算法用于离散时间非线性系统。
IEEE Trans Neural Netw Learn Syst. 2014 Mar;25(3):621-34. doi: 10.1109/TNNLS.2013.2281663.
8
Near-Optimal Controller for Nonlinear Continuous-Time Systems With Unknown Dynamics Using Policy Iteration.基于策略迭代的未知动态非线性连续时间系统的近最优控制器。
IEEE Trans Neural Netw Learn Syst. 2016 Jul;27(7):1537-49. doi: 10.1109/TNNLS.2015.2451535. Epub 2015 Jul 31.
9
Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems.多控制器非线性系统的连续时间分布式策略迭代
IEEE Trans Cybern. 2021 May;51(5):2372-2383. doi: 10.1109/TCYB.2020.2979614. Epub 2021 Apr 15.
10
Approximate optimal control design for nonlinear one-dimensional parabolic PDE systems using empirical eigenfunctions and neural network.基于经验特征函数和神经网络的非线性一维抛物型偏微分方程系统的近似最优控制设计
IEEE Trans Syst Man Cybern B Cybern. 2012 Dec;42(6):1538-49. doi: 10.1109/TSMCB.2012.2194781. Epub 2012 May 10.

引用本文的文献

1
Distributed Adaptive Optimization Algorithm for High-Order Nonlinear Multi-Agent Stochastic Systems with Lévy Noise.具有列维噪声的高阶非线性多智能体随机系统的分布式自适应优化算法
Entropy (Basel). 2024 Sep 30;26(10):834. doi: 10.3390/e26100834.
2
Robust-optimal control of rotary inverted pendulum control through fuzzy descriptor-based techniques.基于模糊广义系统技术的旋转倒立摆的鲁棒最优控制
Sci Rep. 2024 Mar 7;14(1):5593. doi: 10.1038/s41598-024-56202-2.
3
Adaptive Output Containment Tracking Control for Heterogeneous Wide-Area Networks with Aperiodic Intermittent Communication and Uncertain Leaders.
具有非周期间歇通信和不确定领导者的异构广域网络的自适应输出约束跟踪控制
Sensors (Basel). 2023 Oct 22;23(20):8631. doi: 10.3390/s23208631.
4
A hybrid controller method with genetic algorithm optimization to measure position and angular for mobile robot motion control.一种采用遗传算法优化的混合控制器方法,用于测量移动机器人运动控制中的位置和角度。
Front Robot AI. 2023 Jan 12;9:1087371. doi: 10.3389/frobt.2022.1087371. eCollection 2022.