• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多层神经网络的部分不确定非线性离散时间系统在线最优自适应控制。

Online Optimal Adaptive Control of Partially Uncertain Nonlinear Discrete-Time Systems Using Multilayer Neural Networks.

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4840-4850. doi: 10.1109/TNNLS.2021.3061414. Epub 2022 Aug 31.

DOI:10.1109/TNNLS.2021.3061414
PMID:33710960
Abstract

This article intends to address an online optimal adaptive regulation of nonlinear discrete-time systems in affine form and with partially uncertain dynamics using a multilayer neural network (MNN). The actor-critic framework estimates both the optimal control input and value function. Instantaneous control input error and temporal difference are used to tune the weights of the critic and actor networks, respectively. The selection of the basis functions and their derivatives are not required in the proposed approach. The state vector, critic, and actor NN weights are proven to be bounded using the Lyapunov method. Our approach can be extended to neural networks with an arbitrary number of hidden layers. We have demonstrated our approach via a simulation example.

摘要

本文旨在使用多层神经网络 (MNN) 解决仿射形式和部分不确定动态的非线性离散时间系统的在线最优自适应调节问题。演员-评论家框架估计最优控制输入和价值函数。瞬时控制输入误差和时间差分分别用于调整评论家网络和演员网络的权重。在提出的方法中不需要选择基函数及其导数。使用李雅普诺夫方法证明了状态向量、评论家、演员神经网络权重是有界的。我们的方法可以扩展到具有任意数量隐藏层的神经网络。我们通过一个仿真示例展示了我们的方法。

相似文献

1
Online Optimal Adaptive Control of Partially Uncertain Nonlinear Discrete-Time Systems Using Multilayer Neural Networks.基于多层神经网络的部分不确定非线性离散时间系统在线最优自适应控制。
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4840-4850. doi: 10.1109/TNNLS.2021.3061414. Epub 2022 Aug 31.
2
Optimal Adaptive Tracking Control of Partially Uncertain Nonlinear Discrete-Time Systems Using Lifelong Hybrid Learning.基于终身混合学习的部分不确定非线性离散时间系统的最优自适应跟踪控制
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17254-17265. doi: 10.1109/TNNLS.2023.3301383. Epub 2024 Dec 2.
3
Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用
IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.
4
Online adaptive policy learning algorithm for H∞ state feedback control of unknown affine nonlinear discrete-time systems.用于未知仿射非线性离散时间系统 H∞状态反馈控制的在线自适应策略学习算法。
IEEE Trans Cybern. 2014 Dec;44(12):2706-18. doi: 10.1109/TCYB.2014.2313915. Epub 2014 Jul 28.
5
Optimal Adaptive Control of Uncertain Nonlinear Continuous-Time Systems With Input and State Delays.具有输入和状态时滞的不确定非线性连续时间系统的最优自适应控制。
IEEE Trans Neural Netw Learn Syst. 2023 Jun;34(6):3195-3204. doi: 10.1109/TNNLS.2021.3112566. Epub 2023 Jun 1.
6
Adaptive Critic Learning-Based Robust Control of Systems with Uncertain Dynamics.基于自适应 critic 学习的不确定动态系统鲁棒控制。
Comput Intell Neurosci. 2021 Nov 16;2021:2952115. doi: 10.1155/2021/2952115. eCollection 2021.
7
Finite-Horizon Near-Optimal Output Feedback Neural Network Control of Quantized Nonlinear Discrete-Time Systems With Input Constraint.带输入约束的量化非线性离散时间系统的有限时域近最优输出反馈神经网络控制。
IEEE Trans Neural Netw Learn Syst. 2015 Aug;26(8):1776-88. doi: 10.1109/TNNLS.2015.2409301. Epub 2015 Mar 18.
8
Adaptive optimal control of affine nonlinear systems via identifier-critic neural network approximation with relaxed PE conditions.基于放松的 PE 条件的辨识 - 评论神经网络逼近的仿射非线性系统自适应最优控制。
Neural Netw. 2023 Oct;167:588-600. doi: 10.1016/j.neunet.2023.08.044. Epub 2023 Sep 1.
9
Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks.基于策略迭代和神经网络的未知约束输入系统自适应最优控制。
IEEE Trans Neural Netw Learn Syst. 2013 Oct;24(10):1513-25. doi: 10.1109/TNNLS.2013.2276571.
10
Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks.基于强化学习的线性参数化神经网络对非仿射非线性离散时间系统的控制
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):994-1001. doi: 10.1109/TSMCB.2008.926607.