• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有ε误差界的离散时间非线性系统有限时域最优控制的自适应动态规划

Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound.

作者信息

Wang Fei-Yue, Jin Ning, Liu Derong, Wei Qinglai

机构信息

Key Laboratory of Complex Systems and Intelligence Science, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China.

出版信息

IEEE Trans Neural Netw. 2011 Jan;22(1):24-36. doi: 10.1109/TNN.2010.2076370. Epub 2010 Sep 27.

DOI:10.1109/TNN.2010.2076370
PMID:20876014
Abstract

In this paper, we study the finite-horizon optimal control problem for discrete-time nonlinear systems using the adaptive dynamic programming (ADP) approach. The idea is to use an iterative ADP algorithm to obtain the optimal control law which makes the performance index function close to the greatest lower bound of all performance indices within an ε-error bound. The optimal number of control steps can also be obtained by the proposed ADP algorithms. A convergence analysis of the proposed ADP algorithms in terms of performance index function and control policy is made. In order to facilitate the implementation of the iterative ADP algorithms, neural networks are used for approximating the performance index function, computing the optimal control policy, and modeling the nonlinear system. Finally, two simulation examples are employed to illustrate the applicability of the proposed method.

摘要

在本文中,我们使用自适应动态规划(ADP)方法研究离散时间非线性系统的有限时域最优控制问题。其思路是使用一种迭代ADP算法来获得最优控制律,该控制律能使性能指标函数在ε误差范围内接近所有性能指标的最大下界。所提出的ADP算法还能得到最优控制步数。对所提出的ADP算法在性能指标函数和控制策略方面进行了收敛性分析。为便于迭代ADP算法的实现,使用神经网络来逼近性能指标函数、计算最优控制策略以及对非线性系统进行建模。最后,通过两个仿真例子来说明所提方法的适用性。

相似文献

1
Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound.具有ε误差界的离散时间非线性系统有限时域最优控制的自适应动态规划
IEEE Trans Neural Netw. 2011 Jan;22(1):24-36. doi: 10.1109/TNN.2010.2076370. Epub 2010 Sep 27.
2
An iterative ϵ-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state.一类具有非固定初始状态的离散时间非线性系统的迭代 ϵ-最优控制方案。
Neural Netw. 2012 Aug;32:236-44. doi: 10.1016/j.neunet.2012.02.027. Epub 2012 Feb 24.
3
Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems.基于有限逼近误差的离散时间非线性系统最优控制方法。
IEEE Trans Cybern. 2013 Apr;43(2):779-89. doi: 10.1109/TSMCB.2012.2216523. Epub 2013 Mar 7.
4
Error bounds of adaptive dynamic programming algorithms for solving undiscounted optimal control problems.自适应动态规划算法求解非折扣最优控制问题的误差界。
IEEE Trans Neural Netw Learn Syst. 2015 Jun;26(6):1323-34. doi: 10.1109/TNNLS.2015.2402203. Epub 2015 Mar 3.
5
Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems.策略迭代自适应动态规划算法用于离散时间非线性系统。
IEEE Trans Neural Netw Learn Syst. 2014 Mar;25(3):621-34. doi: 10.1109/TNNLS.2013.2281663.
6
Finite-approximation-error-based discrete-time iterative adaptive dynamic programming.基于有限逼近误差的离散时间迭代自适应动态规划。
IEEE Trans Cybern. 2014 Dec;44(12):2820-33. doi: 10.1109/TCYB.2014.2354377. Epub 2014 Sep 26.
7
Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints.基于神经网络的一类具有控制约束的离散时间仿射非线性系统的近最优控制
IEEE Trans Neural Netw. 2009 Sep;20(9):1490-503. doi: 10.1109/TNN.2009.2027233. Epub 2009 Aug 4.
8
Learning-Based Predictive Control for Discrete-Time Nonlinear Systems With Stochastic Disturbances.基于学习的随机干扰离散时间非线性系统预测控制。
IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):6202-6213. doi: 10.1109/TNNLS.2018.2820019. Epub 2018 May 9.
9
Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems.值迭代自适应动态规划在离散时间非线性系统最优控制中的应用。
IEEE Trans Cybern. 2016 Mar;46(3):840-53. doi: 10.1109/TCYB.2015.2492242. Epub 2015 Nov 2.
10
Data-Driven Finite-Horizon Approximate Optimal Control for Discrete-Time Nonlinear Systems Using Iterative HDP Approach.基于迭代 HDP 方法的数据驱动的离散时间非线性系统有限时域近似最优控制。
IEEE Trans Cybern. 2018 Oct;48(10):2948-2961. doi: 10.1109/TCYB.2017.2752845. Epub 2017 Oct 10.

引用本文的文献

1
Adaptive Finite-Time-Based Neural Optimal Control of Time-Delayed Wheeled Mobile Robotics Systems.基于自适应有限时间的轮式移动机器人系统时滞神经最优控制
Sensors (Basel). 2024 Aug 23;24(17):5462. doi: 10.3390/s24175462.
2
A Nonlinear Finite-Time Robust Differential Game Guidance Law.一种非线性有限时间鲁棒微分对策制导律。
Sensors (Basel). 2022 Sep 2;22(17):6650. doi: 10.3390/s22176650.
3
Non-Contact Respiratory Rate Estimation in Real-Time With Modified Joint Unscented Kalman Filter.基于改进联合无迹卡尔曼滤波器的实时非接触式呼吸率估计
IEEE Access. 2020 May 28;8:99445-99457. doi: 10.1109/ACCESS.2020.2998117. eCollection 2020.
4
Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.具有分层模型学习与规划的高效行动者-评论家算法
Comput Intell Neurosci. 2016;2016:4824072. doi: 10.1155/2016/4824072. Epub 2016 Oct 3.