基于强化学习的非线性离散时间系统周期性事件触发自适应跟踪控制设计。

Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning.

机构信息

College of Control Science and Engineering, Bohai University, Jinzhou 121013, Liaoning, China.

School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China.

出版信息

Neural Netw. 2022 Oct;154:43-55. doi: 10.1016/j.neunet.2022.06.039. Epub 2022 Jun 30.

DOI:10.1016/j.neunet.2022.06.039

PMID:35853319

Abstract

In this paper, an event-triggered control scheme with periodic characteristic is developed for nonlinear discrete-time systems under an actor-critic architecture of reinforcement learning (RL). The periodic event-triggered mechanism (ETM) is constructed to decide whether the sampling data are delivered to controllers or not. Meanwhile, the controller is updated only when the event-triggered condition deviates from a prescribed threshold. Compared with traditional continuous ETMs, the proposed periodic ETM can guarantee a minimal lower bound of the inter-event intervals and avoid sampling calculation point-to-point, which means that the partial communication resources can be efficiently economized. The critic and actor neural networks (NNs), consisting of radial basis function neural networks (RBFNNs), aim to approximate the unknown long-term performance index function and the ideal event-triggered controller, respectively. A rigorous stability analysis based on the Lyapunov difference method is provided to substantiate that the closed-loop system can be stabilized. All error signals of the closed-loop system are uniformly ultimately bounded (UUB) under the guidance of the proposed control scheme. Finally, two simulation examples are given to validate the effectiveness of the control design.

摘要

在强化学习（RL）的演员-评论家架构下，为非线性离散时间系统开发了具有周期性特征的事件触发控制方案。构建周期性事件触发机制（ETM）以决定是否将采样数据传输到控制器。同时，仅当事件触发条件偏离规定阈值时，才会更新控制器。与传统的连续 ETM 相比，所提出的周期性 ETM 可以保证最小的事件间隔下界，并避免采样计算点对点，这意味着可以有效地节省部分通信资源。由径向基函数神经网络（RBFNN）组成的评论家神经网络和演员神经网络，旨在分别逼近未知的长期性能指标函数和理想的事件触发控制器。基于李雅普诺夫差分方法的严格稳定性分析证明了闭环系统可以稳定。在提出的控制方案的指导下，闭环系统的所有误差信号都是一致有界的（UUB）。最后，给出了两个仿真示例，以验证控制设计的有效性。

相似文献

Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning.

Neural Netw. 2022 Oct;154:43-55. doi: 10.1016/j.neunet.2022.06.039. Epub 2022 Jun 30.

Dynamic event-triggered controller design for nonlinear systems: Reinforcement learning strategy.

Neural Netw. 2023 Jun;163:341-353. doi: 10.1016/j.neunet.2023.04.008. Epub 2023 Apr 19.

Adaptive Neural Event-Triggered Control for Discrete-Time Strict-Feedback Nonlinear Systems.

IEEE Trans Cybern. 2020 Jul;50(7):2946-2958. doi: 10.1109/TCYB.2019.2921733. Epub 2019 Jul 18.

Reinforcement learning output feedback NN control using deterministic learning technique.

IEEE Trans Neural Netw Learn Syst. 2014 Mar;25(3):635-41. doi: 10.1109/TNNLS.2013.2292704.

Event-Triggered Reinforcement Learning-Based Adaptive Tracking Control for Completely Unknown Continuous-Time Nonlinear Systems.

IEEE Trans Cybern. 2020 Jul;50(7):3231-3242. doi: 10.1109/TCYB.2019.2903108. Epub 2019 Mar 29.

Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.

IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.

Reinforcement learning based adaptive optimal control for constrained nonlinear system via a novel state-dependent transformation.

ISA Trans. 2023 Feb;133:29-41. doi: 10.1016/j.isatra.2022.07.006. Epub 2022 Jul 12.

Periodic Event-Triggered Synchronization for Discrete-Time Complex Dynamical Networks.

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3622-3633. doi: 10.1109/TNNLS.2021.3053652. Epub 2022 Aug 3.

Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation.

IEEE Trans Cybern. 2020 Aug;50(8):3433-3443. doi: 10.1109/TCYB.2019.2921057. Epub 2019 Jun 26.

Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints.

IEEE Trans Syst Man Cybern B Cybern. 2007 Apr;37(2):425-36. doi: 10.1109/tsmcb.2006.883869.

引用本文的文献

Improving hepatocellular carcinoma diagnosis using an ensemble classification approach based on Harris Hawks Optimization.

Heliyon. 2023 Dec 9;10(1):e23497. doi: 10.1016/j.heliyon.2023.e23497. eCollection 2024 Jan 15.

Computational-Intelligence-Based Scheduling with Edge Computing in Cyber-Physical Production Systems.

Entropy (Basel). 2023 Dec 9;25(12):1640. doi: 10.3390/e25121640.

Toward improving the performance of learning by joining feature selection and ensemble classification techniques: an application for cancer diagnosis.

J Cancer Res Clin Oncol. 2023 Dec;149(19):16993-17006. doi: 10.1007/s00432-023-05422-6. Epub 2023 Sep 23.

Reinforcement Learning-Based Decentralized Safety Control for Constrained Interconnected Nonlinear Safety-Critical Systems.

Entropy (Basel). 2023 Aug 2;25(8):1158. doi: 10.3390/e25081158.

A structured combination of ensemble classifier and filter-based feature selection to improve breast cancer diagnosis.

J Cancer Res Clin Oncol. 2023 Nov;149(16):14519-14534. doi: 10.1007/s00432-023-05238-4. Epub 2023 Aug 12.

Data mining techniques in breast cancer diagnosis at the cellular-molecular level.

J Cancer Res Clin Oncol. 2023 Nov;149(14):12605-12620. doi: 10.1007/s00432-023-05090-6. Epub 2023 Jul 14.

Combining ensemble classification and integrated filter-evolutionary search for breast cancer diagnosis.

J Cancer Res Clin Oncol. 2023 Sep;149(12):10753-10769. doi: 10.1007/s00432-023-04968-9. Epub 2023 Jun 13.

Modeling the CO separation capability of poly(4-methyl-1-pentane) membrane modified with different nanoparticles by artificial neural networks.

Sci Rep. 2023 May 31;13(1):8812. doi: 10.1038/s41598-023-36071-x.

Investigation of kinetic, isotherm and adsorption efficacy of thorium by orange peel immobilized on calcium alginate.

Sci Rep. 2023 May 24;13(1):8393. doi: 10.1038/s41598-023-35629-z.

Wavelet-artificial neural network to predict the acetone sensing by indium oxide/iron oxide nanocomposites.

Sci Rep. 2023 Mar 14;13(1):4266. doi: 10.1038/s41598-023-29898-x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于强化学习的非线性离散时间系统周期性事件触发自适应跟踪控制设计。

Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献