有限时域最优共识控制的未知多智能体时滞系统。

Finite-Horizon Optimal Consensus Control for Unknown Multiagent State-Delay Systems.

出版信息

IEEE Trans Cybern. 2020 Feb;50(2):402-413. doi: 10.1109/TCYB.2018.2856510. Epub 2018 Sep 10.

DOI:10.1109/TCYB.2018.2856510

PMID:30207970

Abstract

This paper investigates finite-horizon optimal consensus control problem for unknown multiagent systems with state delays. It is well known that optimal consensus control is the solutions to the coupled Hamilton-Jacobi-Bellman (HJB) equations. An off-policy reinforcement learning (RL) algorithm is developed to learn the two-stage optimal consensus solutions to the coupled time-varying HJB equations using the measurable state data instead of the knowledge of the state-delayed system dynamics. Subsequently, for each agent, a single critic neural network (NN) is utilized to approximate the time-varying cost function and help to calculate optimal consensus control policy. Based on the method of weighted residuals, adaptive weight update laws for the critic NNs are proposed. Finally, the simulation results are provided to illustrate the effectiveness of the proposed off-policy RL method.

摘要

本文研究了具有状态时滞的未知多智能体系统的有限时域最优共识控制问题。众所周知，最优共识控制是耦合 Hamilton-Jacobi-Bellman（HJB）方程的解。提出了一种离线强化学习（RL）算法，使用可测量的状态数据而不是状态时滞系统动力学的知识来学习耦合时变 HJB 方程的两阶段最优共识解。随后，对于每个智能体，使用单个评论家神经网络（NN）来近似时变代价函数，并帮助计算最优共识控制策略。基于加权残值法，提出了评论家 NN 的自适应权重更新律。最后，提供了仿真结果以说明所提出的离线 RL 方法的有效性。

相似文献

Finite-Horizon Optimal Consensus Control for Unknown Multiagent State-Delay Systems.

IEEE Trans Cybern. 2020 Feb;50(2):402-413. doi: 10.1109/TCYB.2018.2856510. Epub 2018 Sep 10.

Distributed Optimal Consensus Control for Multiagent Systems With Input Delay.

IEEE Trans Cybern. 2018 Jun;48(6):1747-1759. doi: 10.1109/TCYB.2017.2714173. Epub 2017 Jun 27.

Data-Driven Distributed Optimal Consensus Control for Unknown Multiagent Systems With Input-Delay.

IEEE Trans Cybern. 2019 Jun;49(6):2095-2105. doi: 10.1109/TCYB.2018.2819695. Epub 2018 Apr 9.

Optimal Synchronization Control of Multiagent Systems With Input Saturation via Off-Policy Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):85-96. doi: 10.1109/TNNLS.2018.2832025. Epub 2018 May 24.

Model-Free Reinforcement Learning for Fully Cooperative Consensus Problem of Nonlinear Multiagent Systems.

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1482-1491. doi: 10.1109/TNNLS.2020.3042508. Epub 2022 Apr 4.

Neural network-based finite-horizon optimal control of uncertain affine nonlinear discrete-time systems.

IEEE Trans Neural Netw Learn Syst. 2015 Mar;26(3):486-99. doi: 10.1109/TNNLS.2014.2315646.

Reinforcement learning solution for HJB equation arising in constrained optimal control problem.

Neural Netw. 2015 Nov;71:150-8. doi: 10.1016/j.neunet.2015.08.007. Epub 2015 Aug 24.

Dynamic Event-Driven Finite-Horizon Optimal Consensus Control for Constrained Multiagent Systems.

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16167-16180. doi: 10.1109/TNNLS.2023.3292154. Epub 2024 Oct 29.

Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems.

IEEE Trans Neural Netw Learn Syst. 2015 Jan;26(1):140-51. doi: 10.1109/TNNLS.2014.2358227. Epub 2014 Oct 8.

Off-Policy Reinforcement Learning for Synchronization in Multiagent Graphical Games.

IEEE Trans Neural Netw Learn Syst. 2017 Oct;28(10):2434-2445. doi: 10.1109/TNNLS.2016.2609500. Epub 2017 Apr 17.

引用本文的文献

An Overview of Recent Advances of Resilient Consensus for Multiagent Systems under Attacks.

Comput Intell Neurosci. 2022 Aug 2;2022:6732343. doi: 10.1155/2022/6732343. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

有限时域最优共识控制的未知多智能体时滞系统。

Finite-Horizon Optimal Consensus Control for Unknown Multiagent State-Delay Systems.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献