• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自适应动态规划方法的未知一般非线性系统的数据驱动鲁棒近似最优跟踪控制

Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method.

作者信息

Zhang Huaguang, Cui Lili, Zhang Xin, Luo Yanhong

机构信息

School of Information Science and Engineering, Northeastern University, Shenyang 110004, China.

出版信息

IEEE Trans Neural Netw. 2011 Dec;22(12):2226-36. doi: 10.1109/TNN.2011.2168538. Epub 2011 Oct 13.

DOI:10.1109/TNN.2011.2168538
PMID:21997259
Abstract

In this paper, a novel data-driven robust approximate optimal tracking control scheme is proposed for unknown general nonlinear systems by using the adaptive dynamic programming (ADP) method. In the design of the controller, only available input-output data is required instead of known system dynamics. A data-driven model is established by a recurrent neural network (NN) to reconstruct the unknown system dynamics using available input-output data. By adding a novel adjustable term related to the modeling error, the resultant modeling error is first guaranteed to converge to zero. Then, based on the obtained data-driven model, the ADP method is utilized to design the approximate optimal tracking controller, which consists of the steady-state controller and the optimal feedback controller. Further, a robustifying term is developed to compensate for the NN approximation errors introduced by implementing the ADP method. Based on Lyapunov approach, stability analysis of the closed-loop system is performed to show that the proposed controller guarantees the system state asymptotically tracking the desired trajectory. Additionally, the obtained control input is proven to be close to the optimal control input within a small bound. Finally, two numerical examples are used to demonstrate the effectiveness of the proposed control scheme.

摘要

本文采用自适应动态规划(ADP)方法,针对未知的一般非线性系统提出了一种新颖的数据驱动鲁棒近似最优跟踪控制方案。在控制器设计中,仅需要可用的输入输出数据,而无需已知的系统动态模型。通过递归神经网络(NN)建立数据驱动模型,利用可用的输入输出数据重构未知的系统动态。通过添加一个与建模误差相关的新颖可调项,首先保证所得建模误差收敛到零。然后,基于所获得的数据驱动模型,利用ADP方法设计近似最优跟踪控制器,该控制器由稳态控制器和最优反馈控制器组成。此外,还设计了一个鲁棒项来补偿实施ADP方法时引入的神经网络近似误差。基于李雅普诺夫方法,对闭环系统进行稳定性分析,结果表明所提出的控制器保证系统状态渐近跟踪期望轨迹。此外,还证明了所获得的控制输入在一个小范围内接近最优控制输入。最后,通过两个数值例子验证了所提控制方案的有效性。

相似文献

1
Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method.基于自适应动态规划方法的未知一般非线性系统的数据驱动鲁棒近似最优跟踪控制
IEEE Trans Neural Netw. 2011 Dec;22(12):2226-36. doi: 10.1109/TNN.2011.2168538. Epub 2011 Oct 13.
2
Approximate dynamic programming for optimal stationary control with control-dependent noise.具有控制相关噪声的最优平稳控制的近似动态规划
IEEE Trans Neural Netw. 2011 Dec;22(12):2392-8. doi: 10.1109/TNN.2011.2165729. Epub 2011 Sep 26.
3
Data-driven model-free adaptive control for a class of MIMO nonlinear discrete-time systems.一类多输入多输出非线性离散时间系统的数据驱动无模型自适应控制
IEEE Trans Neural Netw. 2011 Dec;22(12):2173-88. doi: 10.1109/TNN.2011.2176141. Epub 2011 Nov 30.
4
Data-based virtual unmodeled dynamics driven multivariable nonlinear adaptive switching control.基于数据的虚拟未建模动力学驱动的多变量非线性自适应切换控制
IEEE Trans Neural Netw. 2011 Dec;22(12):2154-72. doi: 10.1109/TNN.2011.2167685. Epub 2011 Nov 16.
5
Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks.基于强化学习的线性参数化神经网络对非仿射非线性离散时间系统的控制
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):994-1001. doi: 10.1109/TSMCB.2008.926607.
6
Neural networks-based adaptive control for nonlinear time-varying delays systems with unknown control direction.基于神经网络的未知控制方向非线性时变延迟系统自适应控制
IEEE Trans Neural Netw. 2011 Oct;22(10):1599-612. doi: 10.1109/TNN.2011.2165222. Epub 2011 Aug 30.
7
Online adaptive policy learning algorithm for H∞ state feedback control of unknown affine nonlinear discrete-time systems.用于未知仿射非线性离散时间系统 H∞状态反馈控制的在线自适应策略学习算法。
IEEE Trans Cybern. 2014 Dec;44(12):2706-18. doi: 10.1109/TCYB.2014.2313915. Epub 2014 Jul 28.
8
Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints.具有输入约束的非线性离散时间系统的基于强化学习神经网络的控制器
IEEE Trans Syst Man Cybern B Cybern. 2007 Apr;37(2):425-36. doi: 10.1109/tsmcb.2006.883869.
9
Adaptive feedback control by constrained approximate dynamic programming.基于约束近似动态规划的自适应反馈控制。
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):982-7. doi: 10.1109/TSMCB.2008.924140.
10
Adaptive predictive control using neural network for a class of pure-feedback systems in discrete time.一类离散时间纯反馈系统的基于神经网络的自适应预测控制。
IEEE Trans Neural Netw. 2008 Sep;19(9):1599-614. doi: 10.1109/TNN.2008.2000446.

引用本文的文献

1
Adaptive Optimal Control of Hybrid Electric Vehicle Power Battery via Policy Learning.基于策略学习的混合动力汽车动力蓄电池自适应最优控制
Comput Intell Neurosci. 2023 May 29;2023:8288527. doi: 10.1155/2023/8288527. eCollection 2023.
2
Adaptive Fuzzy Sliding Mode Control of a Pressure-Controlled Artificial Ventilator.压力控制型人工呼吸机的自适应模糊滑模控制
J Healthc Eng. 2021 Jun 23;2021:1926711. doi: 10.1155/2021/1926711. eCollection 2021.
3
Adaptive Dynamic Programming-Based Multi-Sensor Scheduling for Collaborative Target Tracking in Energy Harvesting Wireless Sensor Networks.
基于自适应动态规划的能量收集无线传感器网络中协作目标跟踪的多传感器调度。
Sensors (Basel). 2018 Nov 22;18(12):4090. doi: 10.3390/s18124090.
4
Neural Net Gains Estimation Based on an Equivalent Model.基于等效模型的神经网络增益估计
Comput Intell Neurosci. 2016;2016:1690924. doi: 10.1155/2016/1690924. Epub 2016 Jun 5.