复杂非线性系统的数据高效强化学习

Data-Efficient Reinforcement Learning for Complex Nonlinear Systems.

作者信息

Donge Vrushabh S, Lian Bosen, Lewis Frank L, Davoudi Ali

出版信息

IEEE Trans Cybern. 2024 Mar;54(3):1391-1402. doi: 10.1109/TCYB.2023.3324601. Epub 2024 Feb 9.

DOI:10.1109/TCYB.2023.3324601

Abstract

This article proposes a data-efficient model-free reinforcement learning (RL) algorithm using Koopman operators for complex nonlinear systems. A high-dimensional data-driven optimal control of the nonlinear system is developed by lifting it into the linear system model. We use a data-driven model-based RL framework to derive an off-policy Bellman equation. Building upon this equation, we deduce the data-efficient RL algorithm, which does not need a Koopman-built linear system model. This algorithm preserves dynamic information while reducing the required data for optimal control learning. Numerical and theoretical analyses of the Koopman eigenfunctions for dataset truncation are discussed in the proposed model-free data-efficient RL algorithm. We validate our framework on the excitation control of the power system.

摘要

本文提出了一种使用库普曼算子的、适用于复杂非线性系统的数据高效免模型强化学习（RL）算法。通过将非线性系统提升到线性系统模型，开发了一种高维数据驱动的非线性系统最优控制方法。我们使用基于数据驱动模型的RL框架来推导离策略贝尔曼方程。基于此方程，我们推导出了数据高效RL算法，该算法不需要库普曼构建的线性系统模型。该算法在减少最优控制学习所需数据的同时保留了动态信息。在所提出的免模型数据高效RL算法中讨论了用于数据集截断的库普曼本征函数的数值和理论分析。我们在电力系统的励磁控制上验证了我们的框架。

相似文献

Data-Efficient Reinforcement Learning for Complex Nonlinear Systems.复杂非线性系统的数据高效强化学习

IEEE Trans Cybern. 2024 Mar;54(3):1391-1402. doi: 10.1109/TCYB.2023.3324601. Epub 2024 Feb 9.

Koopman Invariant Subspaces and Finite Linear Representations of Nonlinear Dynamical Systems for Control.用于控制的非线性动力系统的库普曼不变子空间和有限线性表示

PLoS One. 2016 Feb 26;11(2):e0150171. doi: 10.1371/journal.pone.0150171. eCollection 2016.

Model-Free Reinforcement Learning for Fully Cooperative Consensus Problem of Nonlinear Multiagent Systems.用于非线性多智能体系统完全协作一致性问题的无模型强化学习

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1482-1491. doi: 10.1109/TNNLS.2020.3042508. Epub 2022 Apr 4.

Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors.具有近似误差的哈密顿驱动自适应动态规划

IEEE Trans Cybern. 2022 Dec;52(12):13762-13773. doi: 10.1109/TCYB.2021.3108034. Epub 2022 Nov 18.

Koopman-Based MPC With Learned Dynamics: Hierarchical Neural Network Approach.基于库普曼模型的带学习动力学的模型预测控制：分层神经网络方法

IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3630-3639. doi: 10.1109/TNNLS.2022.3194958. Epub 2024 Feb 29.

Data-Driven Inverse Reinforcement Learning Control for Linear Multiplayer Games.线性多人游戏的数据驱动逆强化学习控制

IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):2028-2041. doi: 10.1109/TNNLS.2022.3186229. Epub 2024 Feb 5.

Deep learning for Koopman Operator Optimal Control.用于库普曼算子最优控制的深度学习

ISA Trans. 2021 Jan 6. doi: 10.1016/j.isatra.2021.01.005.

Optimized Backstepping Consensus Control Using Reinforcement Learning for a Class of Nonlinear Strict-Feedback-Dynamic Multi-Agent Systems.一类非线性严格反馈动态多智能体系统的基于强化学习的优化反步一致性控制

IEEE Trans Neural Netw Learn Syst. 2023 Mar;34(3):1524-1536. doi: 10.1109/TNNLS.2021.3105548. Epub 2023 Feb 28.

Deep learning for universal linear embeddings of nonlinear dynamics.深度学习用于非线性动力学的通用线性嵌入。

Nat Commun. 2018 Nov 23;9(1):4950. doi: 10.1038/s41467-018-07210-0.

Cooperative Differential Game-Based Distributed Optimal Synchronization Control of Heterogeneous Nonlinear Multiagent Systems.基于合作微分博弈的异构非线性多智能体系统分布式最优同步控制

IEEE Trans Cybern. 2023 Dec;53(12):7933-7942. doi: 10.1109/TCYB.2023.3240983. Epub 2023 Nov 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

复杂非线性系统的数据高效强化学习

Data-Efficient Reinforcement Learning for Complex Nonlinear Systems.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献