• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

带输入约束的量化非线性离散时间系统的有限时域近最优输出反馈神经网络控制。

Finite-Horizon Near-Optimal Output Feedback Neural Network Control of Quantized Nonlinear Discrete-Time Systems With Input Constraint.

出版信息

IEEE Trans Neural Netw Learn Syst. 2015 Aug;26(8):1776-88. doi: 10.1109/TNNLS.2015.2409301. Epub 2015 Mar 18.

DOI:10.1109/TNNLS.2015.2409301
PMID:25794403
Abstract

The output feedback-based near-optimal regulation of uncertain and quantized nonlinear discrete-time systems in affine form with control constraint over finite horizon is addressed in this paper. First, the effect of input constraint is handled using a nonquadratic cost functional. Next, a neural network (NN)-based Luenberger observer is proposed to reconstruct both the system states and the control coefficient matrix so that a separate identifier is not needed. Then, approximate dynamic programming-based actor-critic framework is utilized to approximate the time-varying solution of the Hamilton-Jacobi-Bellman using NNs with constant weights and time-dependent activation functions. A new error term is defined and incorporated in the NN update law so that the terminal constraint error is also minimized over time. Finally, a novel dynamic quantizer for the control inputs with adaptive step size is designed to eliminate the quantization error overtime, thus overcoming the drawback of the traditional uniform quantizer. The proposed scheme functions in a forward-in-time manner without offline training phase. Lyapunov analysis is used to investigate the stability. Simulation results are given to show the effectiveness and feasibility of the proposed method.

摘要

本文研究了具有控制约束的不确定量化非线性离散时间仿射形式系统的基于输出反馈的近最优调节问题。首先,使用非二次代价函数处理输入约束的影响。其次,提出了一种基于神经网络(NN)的 Luenberger 观测器来重建系统状态和控制系数矩阵,因此不需要单独的标识符。然后,利用基于近似动态规划的动作-评论家框架,使用具有固定权重和时变激活函数的神经网络来近似时变的 Hamilton-Jacobi-Bellman 解。定义了一个新的误差项并将其纳入神经网络更新律中,以便随着时间的推移最小化终端约束误差。最后,设计了一种具有自适应步长的新型控制输入动态量化器,以随着时间的推移消除量化误差,从而克服传统均匀量化器的缺点。所提出的方案采用前向时间方式工作,无需离线训练阶段。使用 Lyapunov 分析来研究稳定性。仿真结果表明了所提出方法的有效性和可行性。

相似文献

1
Finite-Horizon Near-Optimal Output Feedback Neural Network Control of Quantized Nonlinear Discrete-Time Systems With Input Constraint.带输入约束的量化非线性离散时间系统的有限时域近最优输出反馈神经网络控制。
IEEE Trans Neural Netw Learn Syst. 2015 Aug;26(8):1776-88. doi: 10.1109/TNNLS.2015.2409301. Epub 2015 Mar 18.
2
Neural network-based finite-horizon optimal control of uncertain affine nonlinear discrete-time systems.基于神经网络的不确定仿射非线性离散时间系统有限时域最优控制。
IEEE Trans Neural Netw Learn Syst. 2015 Mar;26(3):486-99. doi: 10.1109/TNNLS.2014.2315646.
3
Neural network-based finite horizon stochastic optimal control design for nonlinear networked control systems.基于神经网络的非线性网络控制系统有限时域随机最优控制设计。
IEEE Trans Neural Netw Learn Syst. 2015 Mar;26(3):472-85. doi: 10.1109/TNNLS.2014.2315622.
4
Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update.基于时间的策略更新的未知内部动态仿射非线性离散时间系统的在线最优控制
IEEE Trans Neural Netw Learn Syst. 2012 Jul;23(7):1118-29. doi: 10.1109/TNNLS.2012.2196708.
5
Online adaptive policy learning algorithm for H∞ state feedback control of unknown affine nonlinear discrete-time systems.用于未知仿射非线性离散时间系统 H∞状态反馈控制的在线自适应策略学习算法。
IEEE Trans Cybern. 2014 Dec;44(12):2706-18. doi: 10.1109/TCYB.2014.2313915. Epub 2014 Jul 28.
6
Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics.使用单网络自适应评论家的有限时域控制约束非线性最优控制。
IEEE Trans Neural Netw Learn Syst. 2013 Jan;24(1):145-57. doi: 10.1109/TNNLS.2012.2227339.
7
Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence.使用具有收敛性证明的离线训练神经网络对未知仿射非线性离散时间系统进行最优控制。
Neural Netw. 2009 Jul-Aug;22(5-6):851-60. doi: 10.1016/j.neunet.2009.06.014. Epub 2009 Jul 1.
8
Decentralized optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Jacobi-Bellman formulation.基于在线哈密顿-雅可比-贝尔曼公式的一类互联非线性离散时间系统的分布式最优控制
IEEE Trans Neural Netw. 2011 Nov;22(11):1757-69. doi: 10.1109/TNN.2011.2160968. Epub 2011 Sep 29.
9
Optimal control of nonlinear continuous-time systems in strict-feedback form.非线性连续时间系统的严格反馈形式的最优控制。
IEEE Trans Neural Netw Learn Syst. 2015 Oct;26(10):2535-49. doi: 10.1109/TNNLS.2015.2441712. Epub 2015 Jun 23.
10
Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming.基于神经动态规划的不确定非线性网络控制系统随机最优控制器设计。
IEEE Trans Neural Netw Learn Syst. 2013 Mar;24(3):471-84. doi: 10.1109/TNNLS.2012.2234133.