• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于在线哈密顿-雅可比-贝尔曼公式的一类互联非线性离散时间系统的分布式最优控制

Decentralized optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Jacobi-Bellman formulation.

作者信息

Mehraeen Shahab, Jagannathan Sarangapani

机构信息

Department of Electrical and Computer Engineering, Louisiana State University, Baton Rouge, LA 70803, USA.

出版信息

IEEE Trans Neural Netw. 2011 Nov;22(11):1757-69. doi: 10.1109/TNN.2011.2160968. Epub 2011 Sep 29.

DOI:10.1109/TNN.2011.2160968
PMID:21965197
Abstract

In this paper, the direct neural dynamic programming technique is utilized to solve the Hamilton-Jacobi-Bellman equation forward-in-time for the decentralized near optimal regulation of a class of nonlinear interconnected discrete-time systems with unknown internal subsystem and interconnection dynamics, while the input gain matrix is considered known. Even though the unknown interconnection terms are considered weak and functions of the entire state vector, the decentralized control is attempted under the assumption that only the local state vector is measurable. The decentralized nearly optimal controller design for each subsystem consists of two neural networks (NNs), an action NN that is aimed to provide a nearly optimal control signal, and a critic NN which evaluates the performance of the overall system. All NN parameters are tuned online for both the NNs. By using Lyapunov techniques it is shown that all subsystems signals are uniformly ultimately bounded and that the synthesized subsystems inputs approach their corresponding nearly optimal control inputs with bounded error. Simulation results are included to show the effectiveness of the approach.

摘要

本文利用直接神经动态规划技术,对一类内部子系统和互联动态未知的非线性互联离散时间系统进行前向求解哈密顿-雅可比-贝尔曼方程,以实现分散近最优调节,同时假设输入增益矩阵已知。尽管未知互联项被认为是弱的且是整个状态向量的函数,但在仅局部状态向量可测的假设下尝试进行分散控制。每个子系统的分散近最优控制器设计由两个神经网络(NN)组成,一个动作NN旨在提供近最优控制信号,一个评判NN评估整个系统的性能。两个NN的所有参数均在线调整。利用李雅普诺夫技术表明,所有子系统信号均一致最终有界,且合成的子系统输入以有界误差趋近其相应的近最优控制输入。给出了仿真结果以表明该方法的有效性。

相似文献

1
Decentralized optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Jacobi-Bellman formulation.基于在线哈密顿-雅可比-贝尔曼公式的一类互联非线性离散时间系统的分布式最优控制
IEEE Trans Neural Netw. 2011 Nov;22(11):1757-69. doi: 10.1109/TNN.2011.2160968. Epub 2011 Sep 29.
2
Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update.基于时间的策略更新的未知内部动态仿射非线性离散时间系统的在线最优控制
IEEE Trans Neural Netw Learn Syst. 2012 Jul;23(7):1118-29. doi: 10.1109/TNNLS.2012.2196708.
3
Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用
IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.
4
Particle swarm optimized neural networks based local tracking control scheme of unknown nonlinear interconnected systems.基于粒子群优化神经网络的未知非线性互联系统局部跟踪控制方案。
Neural Netw. 2021 Feb;134:54-63. doi: 10.1016/j.neunet.2020.09.020. Epub 2020 Nov 11.
5
Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks.基于强化学习的线性参数化神经网络对非仿射非线性离散时间系统的控制
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):994-1001. doi: 10.1109/TSMCB.2008.926607.
6
Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach.基于在线学习最优控制方法的一类连续时间非线性互联系统的分散镇定。
IEEE Trans Neural Netw Learn Syst. 2014 Feb;25(2):418-28. doi: 10.1109/TNNLS.2013.2280013.
7
Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence.使用具有收敛性证明的离线训练神经网络对未知仿射非线性离散时间系统进行最优控制。
Neural Netw. 2009 Jul-Aug;22(5-6):851-60. doi: 10.1016/j.neunet.2009.06.014. Epub 2009 Jul 1.
8
Adaptive critic designs for optimal control of uncertain nonlinear systems with unmatched interconnections.自适应 critic 设计用于不确定非线性系统的最优控制,具有不匹配的互联。
Neural Netw. 2018 Sep;105:142-153. doi: 10.1016/j.neunet.2018.05.005. Epub 2018 May 26.
9
Decentralized dynamic surface control of large-scale interconnected systems in strict-feedback form using neural networks with asymptotic stabilization.基于神经网络的严格反馈形式大规模互联系统的分散动态面控制与渐近稳定
IEEE Trans Neural Netw. 2011 Nov;22(11):1709-22. doi: 10.1109/TNN.2011.2140381. Epub 2011 Sep 8.
10
Finite-Horizon Near-Optimal Output Feedback Neural Network Control of Quantized Nonlinear Discrete-Time Systems With Input Constraint.带输入约束的量化非线性离散时间系统的有限时域近最优输出反馈神经网络控制。
IEEE Trans Neural Netw Learn Syst. 2015 Aug;26(8):1776-88. doi: 10.1109/TNNLS.2015.2409301. Epub 2015 Mar 18.