• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有未知类反冲滞后的纯反馈离散时间多输入多输出系统的自适应评判设计

Adaptive Critic Design for Pure-Feedback Discrete-Time MIMO Systems Preceded by Unknown Backlashlike Hysteresis.

作者信息

Tang Li, Liu Yan-Jun, Chen C L Philip

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5681-5690. doi: 10.1109/TNNLS.2018.2805689. Epub 2018 Mar 23.

DOI:10.1109/TNNLS.2018.2805689
PMID:29993785
Abstract

This paper concentrates on the adaptive critic design (ACD) issue for a class of uncertain multi-input multioutput (MIMO) nonlinear discrete-time systems preceded by unknown backlashlike hysteresis. The considered systems are in a block-triangular pure-feedback form, in which there exist nonaffine functions and couplings between states and inputs. This makes that the ACD-based optimal control becomes very difficult and complicated. To this end, the mean value theorem is employed to transform the original systems into input-output models. Based on the reinforcement learning algorithm, the optimal control strategy is established with an actor-critic structure. Not only the stability of the systems is ensured but also the performance index is minimized. In contrast to the previous results, the main contributions are: 1) it is the first time to build an ACD framework for such MIMO systems with unknown hysteresis and 2) an adaptive auxiliary signal is developed to compensate the influence of hysteresis. In the end, a numerical study is provided to demonstrate the effectiveness of the present method.

摘要

本文聚焦于一类具有未知类反冲滞后的不确定多输入多输出(MIMO)非线性离散时间系统的自适应评判设计(ACD)问题。所考虑的系统呈块三角纯反馈形式,其中存在非仿射函数以及状态与输入之间的耦合。这使得基于ACD的最优控制变得非常困难和复杂。为此,利用均值定理将原始系统转换为输入 - 输出模型。基于强化学习算法,建立了具有演员 - 评判结构的最优控制策略。不仅确保了系统的稳定性,还使性能指标最小化。与先前的结果相比,主要贡献在于:1)首次为具有未知滞后的此类MIMO系统构建了ACD框架;2)开发了一种自适应辅助信号来补偿滞后的影响。最后,通过数值研究证明了本方法的有效性。

相似文献

1
Adaptive Critic Design for Pure-Feedback Discrete-Time MIMO Systems Preceded by Unknown Backlashlike Hysteresis.具有未知类反冲滞后的纯反馈离散时间多输入多输出系统的自适应评判设计
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5681-5690. doi: 10.1109/TNNLS.2018.2805689. Epub 2018 Mar 23.
2
Adaptive neural control of MIMO nonlinear systems with a block-triangular pure-feedback control structure.具有块三角纯反馈控制结构的 MIMO 非线性系统的自适应神经控制。
IEEE Trans Neural Netw Learn Syst. 2014 Nov;25(11):2017-29. doi: 10.1109/TNNLS.2014.2302856.
3
Neural-Network-Based Robust Optimal Tracking Control for MIMO Discrete-Time Systems With Unknown Uncertainty Using Adaptive Critic Design.基于神经网络的多输入多输出离散时间系统未知不确定性自适应 critic 设计鲁棒最优跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2018 Apr;29(4):1239-1251. doi: 10.1109/TNNLS.2017.2660070. Epub 2017 Feb 28.
4
Optimal Control-Based Adaptive NN Design for a Class of Nonlinear Discrete-Time Block-Triangular Systems.基于最优控制的一类非线性离散时间块状三角系统的自适应神经网络设计。
IEEE Trans Cybern. 2016 Nov;46(11):2670-2680. doi: 10.1109/TCYB.2015.2494007. Epub 2016 Feb 23.
5
Adaptive Control via Neural Output Feedback for a Class of Nonlinear Discrete-Time Systems in a Nested Interconnected Form.基于嵌套互联形式的一类非线性离散时间系统的神经输出反馈自适应控制。
IEEE Trans Cybern. 2018 Sep;48(9):2633-2642. doi: 10.1109/TCYB.2017.2747628. Epub 2017 Sep 14.
6
Optimal Fault-Tolerant Control for Discrete-Time Nonlinear Strict-Feedback Systems Based on Adaptive Critic Design.基于自适应 Critic 设计的离散时间非线性严格反馈系统最优容错控制
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2179-2191. doi: 10.1109/TNNLS.2018.2810138.
7
Reinforcement learning design-based adaptive tracking control with less learning parameters for nonlinear discrete-time MIMO systems.基于强化学习设计的具有较少学习参数的非线性离散时间 MIMO 系统的自适应跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2015 Jan;26(1):165-76. doi: 10.1109/TNNLS.2014.2360724. Epub 2014 Nov 25.
8
Adaptive output feedback NN control of a class of discrete-time MIMO nonlinear systems with unknown control directions.一类控制方向未知的离散时间多输入多输出非线性系统的自适应输出反馈神经网络控制
IEEE Trans Syst Man Cybern B Cybern. 2011 Apr;41(2):507-17. doi: 10.1109/TSMCB.2010.2065223. Epub 2010 Sep 2.
9
Adaptive Reinforcement Learning Control Based on Neural Approximation for Nonlinear Discrete-Time Systems With Unknown Nonaffine Dead-Zone Input.基于神经逼近的具有未知非仿射死区输入的非线性离散时间系统的自适应强化学习控制
IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):295-305. doi: 10.1109/TNNLS.2018.2844165. Epub 2018 Jun 28.
10
Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning.基于强化学习的一类未知非仿射非线性系统的离散时间在线学习控制。
Neural Netw. 2014 Jul;55:30-41. doi: 10.1016/j.neunet.2014.03.008. Epub 2014 Mar 28.

引用本文的文献

1
A Model-Based Recurrent Neural Network With Randomness for Efficient Control With Applications.一种基于模型的具有随机性的递归神经网络,用于高效控制及应用。
IEEE Trans Industr Inform. 2019 Apr;15(4):2054-2063. doi: 10.1109/TII.2018.2869588. Epub 2018 Sep 10.