• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于终身混合学习的部分不确定非线性离散时间系统的最优自适应跟踪控制

Optimal Adaptive Tracking Control of Partially Uncertain Nonlinear Discrete-Time Systems Using Lifelong Hybrid Learning.

作者信息

Farzanegan Behzad, Moghadam Rohollah, Jagannathan Sarangapani, Natarajan Pappa

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17254-17265. doi: 10.1109/TNNLS.2023.3301383. Epub 2024 Dec 2.

DOI:10.1109/TNNLS.2023.3301383
PMID:37610896
Abstract

This article addresses a multilayer neural network (MNN)-based optimal adaptive tracking of partially uncertain nonlinear discrete-time (DT) systems in affine form. By employing an actor-critic neural network (NN) to approximate the value function and optimal control policy, the critic NN is updated via a novel hybrid learning scheme, where its weights are adjusted once at a sampling instant and also in a finite iterative manner within the instants to enhance the convergence rate. Moreover, to deal with the persistency of excitation (PE) condition, a replay buffer is incorporated into the critic update law through concurrent learning. To address the vanishing gradient issue, the actor and critic MNN weights are tuned using control input and temporal difference errors (TDEs), respectively. In addition, a weight consolidation scheme is incorporated into the critic MNN update law to attain lifelong learning and overcome catastrophic forgetting, thus lowering the cumulative cost. The tracking error, and the actor and critic weight estimation errors are shown to be bounded using the Lyapunov analysis. Simulation results using the proposed approach on a two-link robot manipulator show a significant reduction in tracking error by 44% and cumulative cost by 31% in a multitask environment.

摘要

本文研究了基于多层神经网络(MNN)的仿射形式部分不确定非线性离散时间(DT)系统的最优自适应跟踪问题。通过使用演员-评论家神经网络(NN)来逼近值函数和最优控制策略,评论家NN通过一种新颖的混合学习方案进行更新,其权重在每个采样时刻调整一次,并且在时刻内以有限迭代方式调整以提高收敛速度。此外,为了处理持续激励(PE)条件,通过并发学习将重放缓冲区纳入评论家更新律。为了解决梯度消失问题,分别使用控制输入和时间差分误差(TDE)来调整演员和评论家MNN的权重。此外,将权重巩固方案纳入评论家MNN更新律以实现终身学习并克服灾难性遗忘,从而降低累积成本。使用李雅普诺夫分析表明跟踪误差以及演员和评论家权重估计误差是有界的。在两连杆机器人操纵器上使用所提出方法的仿真结果表明,在多任务环境中跟踪误差显著降低了44%,累积成本降低了31%。

相似文献

1
Optimal Adaptive Tracking Control of Partially Uncertain Nonlinear Discrete-Time Systems Using Lifelong Hybrid Learning.基于终身混合学习的部分不确定非线性离散时间系统的最优自适应跟踪控制
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17254-17265. doi: 10.1109/TNNLS.2023.3301383. Epub 2024 Dec 2.
2
Online Optimal Adaptive Control of Partially Uncertain Nonlinear Discrete-Time Systems Using Multilayer Neural Networks.基于多层神经网络的部分不确定非线性离散时间系统在线最优自适应控制。
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4840-4850. doi: 10.1109/TNNLS.2021.3061414. Epub 2022 Aug 31.
3
Lifelong Learning-Based Optimal Trajectory Tracking Control of Constrained Nonlinear Affine Systems Using Deep Neural Networks.
IEEE Trans Cybern. 2024 Dec;54(12):7133-7146. doi: 10.1109/TCYB.2024.3405354. Epub 2024 Nov 27.
4
Optimal Adaptive Control of Uncertain Nonlinear Continuous-Time Systems With Input and State Delays.具有输入和状态时滞的不确定非线性连续时间系统的最优自适应控制。
IEEE Trans Neural Netw Learn Syst. 2023 Jun;34(6):3195-3204. doi: 10.1109/TNNLS.2021.3112566. Epub 2023 Jun 1.
5
Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems.基于 actor-critic 的部分未知非线性离散时间系统最优跟踪。
IEEE Trans Neural Netw Learn Syst. 2015 Jan;26(1):140-51. doi: 10.1109/TNNLS.2014.2358227. Epub 2014 Oct 8.
6
Neural network-based finite-horizon optimal control of uncertain affine nonlinear discrete-time systems.基于神经网络的不确定仿射非线性离散时间系统有限时域最优控制。
IEEE Trans Neural Netw Learn Syst. 2015 Mar;26(3):486-99. doi: 10.1109/TNNLS.2014.2315646.
7
Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update.基于时间的策略更新的未知内部动态仿射非线性离散时间系统的在线最优控制
IEEE Trans Neural Netw Learn Syst. 2012 Jul;23(7):1118-29. doi: 10.1109/TNNLS.2012.2196708.
8
Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用
IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.
9
Reinforcement learning design-based adaptive tracking control with less learning parameters for nonlinear discrete-time MIMO systems.基于强化学习设计的具有较少学习参数的非线性离散时间 MIMO 系统的自适应跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2015 Jan;26(1):165-76. doi: 10.1109/TNNLS.2014.2360724. Epub 2014 Nov 25.
10
Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks.基于策略迭代和神经网络的未知约束输入系统自适应最优控制。
IEEE Trans Neural Netw Learn Syst. 2013 Oct;24(10):1513-25. doi: 10.1109/TNNLS.2013.2276571.