• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有模型误差和外部干扰的非线性系统的鲁棒近似动态规划

Robust Approximate Dynamic Programming for Nonlinear Systems With Both Model Error and External Disturbance.

作者信息

Li Jie, Nagamune Ryozo, Zhang Yuhang, Li Shengbo Eben

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):896-910. doi: 10.1109/TNNLS.2023.3335138. Epub 2025 Jan 7.

DOI:10.1109/TNNLS.2023.3335138
PMID:38015685
Abstract

Model error and external disturbance have been separately addressed by optimizing the definite performance in standard linear control problems. However, the concurrent handling of both introduces uncertainty and nonconvexity into the performance, posing a huge challenge for solving nonlinear problems. This article introduces an additional cost function in the augmented Hamilton-Jacobi-Isaacs (HJI) equation of zero-sum games to simultaneously manage the model error and external disturbance in nonlinear robust performance problems. For satisfying the Hamilton-Jacobi inequality in nonlinear robust control theory under all considered model errors, the relationship between the additional cost function and model uncertainty is revealed. A critic online learning algorithm, applying Lyapunov stabilizing terms and historical states to reinforce training stability and achieve persistent learning, is proposed to approximate the solution of the augmented HJI equation. By constructing a joint Lyapunov candidate about the critic weight and system state, both stability and convergence are proved by the second method of Lyapunov. Theoretical results also show that introducing historical data reduces the ultimate bounds of system state and critic error. Three numerical examples are conducted to demonstrate the effectiveness of the proposed method.

摘要

在标准线性控制问题中,通过优化确定性能已分别解决了模型误差和外部干扰问题。然而,同时处理这两者会给性能引入不确定性和非凸性,这对求解非线性问题构成了巨大挑战。本文在零和博弈的增广哈密顿 - 雅可比 - 伊萨克斯(HJI)方程中引入了一个额外的成本函数,以同时处理非线性鲁棒性能问题中的模型误差和外部干扰。为了在所有考虑的模型误差下满足非线性鲁棒控制理论中的哈密顿 - 雅可比不等式,揭示了额外成本函数与模型不确定性之间的关系。提出了一种批评家在线学习算法,该算法应用李雅普诺夫稳定项和历史状态来增强训练稳定性并实现持续学习,以逼近增广HJI方程的解。通过构造关于批评家权重和系统状态的联合李雅普诺夫候选函数,利用李雅普诺夫第二方法证明了稳定性和收敛性。理论结果还表明,引入历史数据会降低系统状态和批评家误差的最终界限。进行了三个数值例子以证明所提方法的有效性。

相似文献

1
Robust Approximate Dynamic Programming for Nonlinear Systems With Both Model Error and External Disturbance.具有模型误差和外部干扰的非线性系统的鲁棒近似动态规划
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):896-910. doi: 10.1109/TNNLS.2023.3335138. Epub 2025 Jan 7.
2
Asymmetric Constrained Optimal Tracking Control With Critic Learning of Nonlinear Multiplayer Zero-Sum Games.基于非线性多人零和博弈批评学习的非对称约束最优跟踪控制
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5671-5683. doi: 10.1109/TNNLS.2022.3208611. Epub 2024 Apr 4.
3
A Nonlinear Finite-Time Robust Differential Game Guidance Law.一种非线性有限时间鲁棒微分对策制导律。
Sensors (Basel). 2022 Sep 2;22(17):6650. doi: 10.3390/s22176650.
4
Finite-Horizon $H_{\infty }$ Tracking Control for Unknown Nonlinear Systems With Saturating Actuators.有限时域 $H_{\infty }$ 跟踪控制用于具有饱和执行器的未知非线性系统。
IEEE Trans Neural Netw Learn Syst. 2018 Apr;29(4):1200-1212. doi: 10.1109/TNNLS.2017.2669099. Epub 2017 Feb 28.
5
Optimal H tracking control of nonlinear systems with zero-equilibrium-free via novel adaptive critic designs.通过新颖的自适应评价设计实现具有零平衡点的非线性系统的最优 H 跟踪控制。
Neural Netw. 2023 Jul;164:105-114. doi: 10.1016/j.neunet.2023.04.021. Epub 2023 Apr 20.
6
Online adaptive policy learning algorithm for H∞ state feedback control of unknown affine nonlinear discrete-time systems.用于未知仿射非线性离散时间系统 H∞状态反馈控制的在线自适应策略学习算法。
IEEE Trans Cybern. 2014 Dec;44(12):2706-18. doi: 10.1109/TCYB.2014.2313915. Epub 2014 Jul 28.
7
Robust Neurooptimal Control for a Robot via Adaptive Dynamic Programming.通过自适应动态规划实现机器人的鲁棒神经最优控制。
IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2584-2594. doi: 10.1109/TNNLS.2020.3006850. Epub 2021 Jun 2.
8
Observer-based event-triggered control for zero-sum games of input constrained multi-player nonlinear systems.基于观测器的事件触发控制用于输入受限多玩家非线性系统的零和博弈。
Neural Netw. 2021 Dec;144:101-112. doi: 10.1016/j.neunet.2021.08.012. Epub 2021 Aug 25.
9
Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control.基于神经网络的在线同时策略更新算法,用于解决非线性 H∞ 控制中的 HJI 方程。
IEEE Trans Neural Netw Learn Syst. 2012 Dec;23(12):1884-95. doi: 10.1109/TNNLS.2012.2217349.
10
Neural network robust tracking control with adaptive critic framework for uncertain nonlinear systems.基于自适应 critic 框架的不确定非线性系统神经网络鲁棒跟踪控制。
Neural Netw. 2018 Jan;97:11-18. doi: 10.1016/j.neunet.2017.09.005. Epub 2017 Sep 21.