• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于神经逼近的具有未知非仿射死区输入的非线性离散时间系统的自适应强化学习控制

Adaptive Reinforcement Learning Control Based on Neural Approximation for Nonlinear Discrete-Time Systems With Unknown Nonaffine Dead-Zone Input.

作者信息

Liu Yan-Jun, Li Shu, Tong Shaocheng, Chen C L Philip

出版信息

IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):295-305. doi: 10.1109/TNNLS.2018.2844165. Epub 2018 Jun 28.

DOI:10.1109/TNNLS.2018.2844165
PMID:29994726
Abstract

In this paper, an optimal control algorithm is designed for uncertain nonlinear systems in discrete-time, which are in nonaffine form and with unknown dead-zone. The main contributions of this paper are that an optimal control algorithm is for the first time framed in this paper for nonlinear systems with nonaffine dead-zone, and the adaptive parameter law for dead-zone is calculated by using the gradient rules. The mean value theory is employed to deal with the nonaffine dead-zone input and the implicit function theory based on reinforcement learning is appropriately introduced to find an unknown ideal controller which is approximated by using the action network. Other neural networks are taken as the critic networks to approximate the strategic utility functions. Based on the Lyapunov stability analysis theory, we can prove the stability of systems, i.e., the optimal control laws can guarantee that all the signals in the closed-loop system are bounded and the tracking errors are converged to a small compact set. Finally, two simulation examples demonstrate the effectiveness of the design algorithm.

摘要

本文针对离散时间下的不确定非线性系统设计了一种最优控制算法,该系统为非仿射形式且带有未知死区。本文的主要贡献在于首次针对具有非仿射死区的非线性系统构建了最优控制算法,并且利用梯度规则计算了死区的自适应参数律。采用均值理论处理非仿射死区输入,并适当引入基于强化学习的隐函数理论来寻找未知理想控制器,该控制器通过动作网络进行近似。将其他神经网络作为评判网络来近似策略效用函数。基于李雅普诺夫稳定性分析理论,我们可以证明系统的稳定性,即最优控制律能够保证闭环系统中的所有信号都是有界的,并且跟踪误差收敛到一个小的紧致集。最后,两个仿真例子证明了设计算法的有效性。

相似文献

1
Adaptive Reinforcement Learning Control Based on Neural Approximation for Nonlinear Discrete-Time Systems With Unknown Nonaffine Dead-Zone Input.基于神经逼近的具有未知非仿射死区输入的非线性离散时间系统的自适应强化学习控制
IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):295-305. doi: 10.1109/TNNLS.2018.2844165. Epub 2018 Jun 28.
2
Adaptive NN tracking control of uncertain nonlinear discrete-time systems with nonaffine dead-zone input.不确定非线性离散时间系统的非仿射死区输入的自适应 NN 跟踪控制。
IEEE Trans Cybern. 2015 Mar;45(3):497-505. doi: 10.1109/TCYB.2014.2329495.
3
A Unified Approach to Adaptive Neural Control for Nonlinear Discrete-Time Systems With Nonlinear Dead-Zone Input.一种统一的方法,用于具有非线性死区输入的非线性离散时间系统的自适应神经控制。
IEEE Trans Neural Netw Learn Syst. 2016 Jan;27(1):139-50. doi: 10.1109/TNNLS.2015.2471262. Epub 2015 Sep 3.
4
Adaptive Control via Neural Output Feedback for a Class of Nonlinear Discrete-Time Systems in a Nested Interconnected Form.基于嵌套互联形式的一类非线性离散时间系统的神经输出反馈自适应控制。
IEEE Trans Cybern. 2018 Sep;48(9):2633-2642. doi: 10.1109/TCYB.2017.2747628. Epub 2017 Sep 14.
5
Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning.基于强化学习的一类未知非仿射非线性系统的离散时间在线学习控制。
Neural Netw. 2014 Jul;55:30-41. doi: 10.1016/j.neunet.2014.03.008. Epub 2014 Mar 28.
6
Reinforcement learning design-based adaptive tracking control with less learning parameters for nonlinear discrete-time MIMO systems.基于强化学习设计的具有较少学习参数的非线性离散时间 MIMO 系统的自适应跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2015 Jan;26(1):165-76. doi: 10.1109/TNNLS.2014.2360724. Epub 2014 Nov 25.
7
Adaptive Output Neural Network Control for a Class of Stochastic Nonlinear Systems With Dead-Zone Nonlinearities.自适应输出神经网络控制一类具有死区非线性的随机非线性系统。
IEEE Trans Neural Netw Learn Syst. 2017 Mar;28(3):726-739. doi: 10.1109/TNNLS.2015.2503004. Epub 2015 Dec 11.
8
Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks.基于强化学习的线性参数化神经网络对非仿射非线性离散时间系统的控制
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):994-1001. doi: 10.1109/TSMCB.2008.926607.
9
Adaptive NN controller design for a class of nonlinear MIMO discrete-time systems.自适应神经网络控制器设计的一类非线性多变量离散时间系统。
IEEE Trans Neural Netw Learn Syst. 2015 May;26(5):1007-18. doi: 10.1109/TNNLS.2014.2330336. Epub 2014 Jul 21.
10
Observer-based adaptive fuzzy-neural control for a class of uncertain nonlinear systems with unknown dead-zone input.基于观测器的自适应模糊神经网络控制:一类具有未知死区输入的不确定非线性系统
ISA Trans. 2010 Oct;49(4):462-9. doi: 10.1016/j.isatra.2010.06.002. Epub 2010 Jul 2.