基于自适应观测的不确定系统高效强化学习。

Adaptive Observation-Based Efficient Reinforcement Learning for Uncertain Systems.

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5492-5503. doi: 10.1109/TNNLS.2021.3070852. Epub 2022 Oct 5.

DOI:10.1109/TNNLS.2021.3070852

Abstract

This article develops an adaptive observation-based efficient reinforcement learning (RL) approach for systems with uncertain drift dynamics. A novel concurrent learning adaptive extended observer (CL-AEO) is first designed to jointly estimate the system state and parameter. This observer has a two-time-scale structure and does not require any additional numerical techniques to calculate the state derivative information. The idea of concurrent learning (CL) is leveraged to use the recorded data, which leads to a relaxed verifiable excitation condition for the convergence of parameter estimation. Based on the estimated state and parameter provided by the CL-AEO, a simulation of experience-based RL scheme is developed to online approximate the optimal control policy. Rigorous theoretical analysis is given to show that the practical convergence of the system state to the origin and the developed policy to the ideal optimal policy can be achieved without the persistence of excitation (PE) condition. Finally, the effectiveness and superiority of the developed methodology are demonstrated via comparative simulations.

摘要

本文为具有不确定漂移动态的系统开发了一种自适应观测的高效强化学习（RL）方法。首先设计了一种新颖的并发学习自适应扩展观测器（CL-AEO），用于联合估计系统状态和参数。该观测器具有双时间尺度结构，不需要任何额外的数值技术来计算状态导数信息。利用并发学习（CL）的思想，使用记录的数据，这为参数估计的收敛提供了一个宽松的可验证激励条件。基于 CL-AEO 提供的估计状态和参数，开发了基于模拟的经验 RL 方案，以在线近似最优控制策略。给出了严格的理论分析，证明了在没有持续激励（PE）条件的情况下，系统状态能够实际收敛到原点，所开发的策略能够收敛到理想最优策略。最后，通过比较仿真验证了所提出方法的有效性和优越性。

相似文献

Adaptive Observation-Based Efficient Reinforcement Learning for Uncertain Systems.基于自适应观测的不确定系统高效强化学习。

IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5492-5503. doi: 10.1109/TNNLS.2021.3070852. Epub 2022 Oct 5.

Reinforcement-Learning-Based Disturbance Rejection Control for Uncertain Nonlinear Systems.基于强化学习的不确定非线性系统干扰抑制控制。

IEEE Trans Cybern. 2022 Sep;52(9):9621-9633. doi: 10.1109/TCYB.2021.3060736. Epub 2022 Aug 18.

Optimal Robust Control of Nonlinear Systems with Unknown Dynamics via NN Learning with Relaxed Excitation.基于松弛激励的神经网络学习实现对未知动态非线性系统的最优鲁棒控制

Entropy (Basel). 2024 Jan 14;26(1):0. doi: 10.3390/e26010072.

Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles.基于复合观测器的高超声速飞行器强化学习最优姿态跟踪控制

IEEE Trans Cybern. 2023 Feb;53(2):913-926. doi: 10.1109/TCYB.2022.3192871. Epub 2023 Jan 13.

Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning.基于强化学习的非线性离散时间系统周期性事件触发自适应跟踪控制设计。

Neural Netw. 2022 Oct;154:43-55. doi: 10.1016/j.neunet.2022.06.039. Epub 2022 Jun 30.

Concurrent Learning Robust Adaptive Fault Tolerant Boundary Regulation of Hyperbolic Distributed Parameter Systems.双曲分布参数系统的并发学习鲁棒自适应容错边界调节

IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6286-6300. doi: 10.1109/TNNLS.2022.3224245. Epub 2024 May 2.

Model-Based Reinforcement Learning for Infinite-Horizon Approximate Optimal Tracking.基于模型的强化学习在无限时域近似最优跟踪中的应用。

IEEE Trans Neural Netw Learn Syst. 2017 Mar;28(3):753-758. doi: 10.1109/TNNLS.2015.2511658. Epub 2016 Feb 3.

Reinforcement learning solution for HJB equation arising in constrained optimal control problem.约束最优控制问题中出现的HJB方程的强化学习解决方案。

Neural Netw. 2015 Nov;71:150-8. doi: 10.1016/j.neunet.2015.08.007. Epub 2015 Aug 24.

Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning.基于深度强化学习的好奇心驱动推荐策略，用于自适应学习。

Br J Math Stat Psychol. 2020 Nov;73(3):522-540. doi: 10.1111/bmsp.12199. Epub 2020 Feb 21.

Data-Driven Inverse Reinforcement Learning Control for Linear Multiplayer Games.线性多人游戏的数据驱动逆强化学习控制

IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):2028-2041. doi: 10.1109/TNNLS.2022.3186229. Epub 2024 Feb 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于自适应观测的不确定系统高效强化学习。

Adaptive Observation-Based Efficient Reinforcement Learning for Uncertain Systems.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献