• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的非线性离散时间系统周期性事件触发自适应跟踪控制设计。

Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning.

机构信息

College of Control Science and Engineering, Bohai University, Jinzhou 121013, Liaoning, China.

School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China.

出版信息

Neural Netw. 2022 Oct;154:43-55. doi: 10.1016/j.neunet.2022.06.039. Epub 2022 Jun 30.

DOI:10.1016/j.neunet.2022.06.039
PMID:35853319
Abstract

In this paper, an event-triggered control scheme with periodic characteristic is developed for nonlinear discrete-time systems under an actor-critic architecture of reinforcement learning (RL). The periodic event-triggered mechanism (ETM) is constructed to decide whether the sampling data are delivered to controllers or not. Meanwhile, the controller is updated only when the event-triggered condition deviates from a prescribed threshold. Compared with traditional continuous ETMs, the proposed periodic ETM can guarantee a minimal lower bound of the inter-event intervals and avoid sampling calculation point-to-point, which means that the partial communication resources can be efficiently economized. The critic and actor neural networks (NNs), consisting of radial basis function neural networks (RBFNNs), aim to approximate the unknown long-term performance index function and the ideal event-triggered controller, respectively. A rigorous stability analysis based on the Lyapunov difference method is provided to substantiate that the closed-loop system can be stabilized. All error signals of the closed-loop system are uniformly ultimately bounded (UUB) under the guidance of the proposed control scheme. Finally, two simulation examples are given to validate the effectiveness of the control design.

摘要

在强化学习(RL)的演员-评论家架构下,为非线性离散时间系统开发了具有周期性特征的事件触发控制方案。构建周期性事件触发机制(ETM)以决定是否将采样数据传输到控制器。同时,仅当事件触发条件偏离规定阈值时,才会更新控制器。与传统的连续 ETM 相比,所提出的周期性 ETM 可以保证最小的事件间隔下界,并避免采样计算点对点,这意味着可以有效地节省部分通信资源。由径向基函数神经网络(RBFNN)组成的评论家神经网络和演员神经网络,旨在分别逼近未知的长期性能指标函数和理想的事件触发控制器。基于李雅普诺夫差分方法的严格稳定性分析证明了闭环系统可以稳定。在提出的控制方案的指导下,闭环系统的所有误差信号都是一致有界的(UUB)。最后,给出了两个仿真示例,以验证控制设计的有效性。

相似文献

1
Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning.基于强化学习的非线性离散时间系统周期性事件触发自适应跟踪控制设计。
Neural Netw. 2022 Oct;154:43-55. doi: 10.1016/j.neunet.2022.06.039. Epub 2022 Jun 30.
2
Dynamic event-triggered controller design for nonlinear systems: Reinforcement learning strategy.动态事件触发控制器设计的非线性系统:强化学习策略。
Neural Netw. 2023 Jun;163:341-353. doi: 10.1016/j.neunet.2023.04.008. Epub 2023 Apr 19.
3
Adaptive Neural Event-Triggered Control for Discrete-Time Strict-Feedback Nonlinear Systems.自适应神经网络事件触发控制的离散时间严格反馈非线性系统。
IEEE Trans Cybern. 2020 Jul;50(7):2946-2958. doi: 10.1109/TCYB.2019.2921733. Epub 2019 Jul 18.
4
Reinforcement learning output feedback NN control using deterministic learning technique.使用确定性学习技术的强化学习输出反馈神经网络控制。
IEEE Trans Neural Netw Learn Syst. 2014 Mar;25(3):635-41. doi: 10.1109/TNNLS.2013.2292704.
5
Event-Triggered Reinforcement Learning-Based Adaptive Tracking Control for Completely Unknown Continuous-Time Nonlinear Systems.基于事件触发强化学习的完全未知连续时间非线性系统自适应跟踪控制
IEEE Trans Cybern. 2020 Jul;50(7):3231-3242. doi: 10.1109/TCYB.2019.2903108. Epub 2019 Mar 29.
6
Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用
IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.
7
Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.基于强化学习的约束非线性系统自适应最优控制:一种新型状态依赖变换方法
ISA Trans. 2023 Feb;133:29-41. doi: 10.1016/j.isatra.2022.07.006. Epub 2022 Jul 12.
8
Periodic Event-Triggered Synchronization for Discrete-Time Complex Dynamical Networks.离散时间复杂动态网络的周期性事件触发同步
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3622-3633. doi: 10.1109/TNNLS.2021.3053652. Epub 2022 Aug 3.
9
Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation.具有输入饱和的不确定非线性系统的自适应强化学习神经网络控制。
IEEE Trans Cybern. 2020 Aug;50(8):3433-3443. doi: 10.1109/TCYB.2019.2921057. Epub 2019 Jun 26.
10
Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints.具有输入约束的非线性离散时间系统的基于强化学习神经网络的控制器
IEEE Trans Syst Man Cybern B Cybern. 2007 Apr;37(2):425-36. doi: 10.1109/tsmcb.2006.883869.

引用本文的文献

1
Improving hepatocellular carcinoma diagnosis using an ensemble classification approach based on Harris Hawks Optimization.基于哈里斯鹰优化算法的集成分类方法用于改善肝细胞癌诊断
Heliyon. 2023 Dec 9;10(1):e23497. doi: 10.1016/j.heliyon.2023.e23497. eCollection 2024 Jan 15.
2
Computational-Intelligence-Based Scheduling with Edge Computing in Cyber-Physical Production Systems.基于计算智能的网络物理生产系统中的边缘计算调度
Entropy (Basel). 2023 Dec 9;25(12):1640. doi: 10.3390/e25121640.
3
Toward improving the performance of learning by joining feature selection and ensemble classification techniques: an application for cancer diagnosis.
为了提高学习性能,结合特征选择和集成分类技术:在癌症诊断中的应用。
J Cancer Res Clin Oncol. 2023 Dec;149(19):16993-17006. doi: 10.1007/s00432-023-05422-6. Epub 2023 Sep 23.
4
Reinforcement Learning-Based Decentralized Safety Control for Constrained Interconnected Nonlinear Safety-Critical Systems.基于强化学习的约束互联非线性安全关键系统的分散式安全控制
Entropy (Basel). 2023 Aug 2;25(8):1158. doi: 10.3390/e25081158.
5
A structured combination of ensemble classifier and filter-based feature selection to improve breast cancer diagnosis.基于集成分类器和基于过滤器的特征选择的结构化组合,以提高乳腺癌诊断。
J Cancer Res Clin Oncol. 2023 Nov;149(16):14519-14534. doi: 10.1007/s00432-023-05238-4. Epub 2023 Aug 12.
6
Data mining techniques in breast cancer diagnosis at the cellular-molecular level.细胞分子水平上乳腺癌诊断中的数据挖掘技术
J Cancer Res Clin Oncol. 2023 Nov;149(14):12605-12620. doi: 10.1007/s00432-023-05090-6. Epub 2023 Jul 14.
7
Combining ensemble classification and integrated filter-evolutionary search for breast cancer diagnosis.结合集成分类与集成滤波器-进化搜索用于乳腺癌诊断。
J Cancer Res Clin Oncol. 2023 Sep;149(12):10753-10769. doi: 10.1007/s00432-023-04968-9. Epub 2023 Jun 13.
8
Modeling the CO separation capability of poly(4-methyl-1-pentane) membrane modified with different nanoparticles by artificial neural networks.用人工神经网络模拟不同纳米粒子改性聚(4-甲基-1-戊烯)膜的 CO2 分离性能。
Sci Rep. 2023 May 31;13(1):8812. doi: 10.1038/s41598-023-36071-x.
9
Investigation of kinetic, isotherm and adsorption efficacy of thorium by orange peel immobilized on calcium alginate.用藻酸钙固定化的橙皮对钍的动力学、等温线和吸附效果的研究。
Sci Rep. 2023 May 24;13(1):8393. doi: 10.1038/s41598-023-35629-z.
10
Wavelet-artificial neural network to predict the acetone sensing by indium oxide/iron oxide nanocomposites.基于小波分析和人工神经网络的方法预测氧化铟/氧化铁纳米复合材料的丙酮传感性能
Sci Rep. 2023 Mar 14;13(1):4266. doi: 10.1038/s41598-023-29898-x.