• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

切换拓扑下多智能体系统输出一致性的双层强化学习

Two-Layer Reinforcement Learning for Output Consensus of Multiagent Systems Under Switching Topology.

作者信息

Wang Zhanshan, Liu Yingying, Zhang Huaguang

出版信息

IEEE Trans Cybern. 2024 Sep;54(9):5463-5472. doi: 10.1109/TCYB.2024.3380001. Epub 2024 Aug 26.

DOI:10.1109/TCYB.2024.3380001
PMID:38598404
Abstract

In this article, the data-based output consensus of discrete-time multiagent systems under switching topology (ST) is studied via reinforcement learning. Due to the existence of ST, the kernel matrix of value function is switching-varying, which cannot be applied to existing algorithms. To overcome the inapplicability of varying kernel matrix, a two-layer reinforcement learning algorithm is proposed in this article. To further implement the proposed algorithm, a data-based distributed control policy is presented, which is applicable to both fixed topology and ST. Besides, the proposed method does not need assumptions on the eigenvalues of leader's dynamic matrix, it avoids the assumptions in the previous method. Subsequently, the convergence of algorithm is analyzed. Finally, three simulation examples are provided to verify the proposed algorithm.

摘要

本文通过强化学习研究切换拓扑(ST)下离散时间多智能体系统基于数据的输出一致性问题。由于ST的存在,值函数的核矩阵是切换变化的,无法应用于现有算法。为克服可变核矩阵的不可用性,本文提出了一种双层强化学习算法。为进一步实现所提算法,给出了一种基于数据的分布式控制策略,该策略适用于固定拓扑和ST。此外,所提方法不需要对领导者动态矩阵的特征值做假设,避免了先前方法中的假设。随后,分析了算法的收敛性。最后,提供了三个仿真实例来验证所提算法。

相似文献

1
Two-Layer Reinforcement Learning for Output Consensus of Multiagent Systems Under Switching Topology.切换拓扑下多智能体系统输出一致性的双层强化学习
IEEE Trans Cybern. 2024 Sep;54(9):5463-5472. doi: 10.1109/TCYB.2024.3380001. Epub 2024 Aug 26.
2
Data-Driven H∞ Output Consensus for Heterogeneous Multiagent Systems Under Switching Topology via Reinforcement Learning.基于强化学习的切换拓扑下异构多智能体系统的数据驱动H∞输出一致性
IEEE Trans Cybern. 2024 Dec;54(12):7865-7876. doi: 10.1109/TCYB.2024.3419056. Epub 2024 Nov 27.
3
Adaptive Bipartite Event-Triggered Output Consensus of Heterogeneous Linear Multiagent Systems Under Fixed and Switching Topologies.固定和切换拓扑下异构线性多智能体系统的自适应二分事件触发输出一致性
IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4816-4830. doi: 10.1109/TNNLS.2019.2958107. Epub 2020 Oct 30.
4
Bipartite Fixed-Time Output Consensus of Heterogeneous Linear Multiagent Systems.异构线性多智能体系统的二分固定时间输出一致性
IEEE Trans Cybern. 2021 Feb;51(2):548-557. doi: 10.1109/TCYB.2019.2936009. Epub 2021 Jan 15.
5
Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning.基于数据的多智能体系统最优共识控制与策略梯度强化学习
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3872-3883. doi: 10.1109/TNNLS.2021.3054685. Epub 2022 Aug 3.
6
Distributed Leader-Following Consensus of Feedforward Nonlinear Delayed Multiagent Systems via General Switched Compensation Control.基于广义切换补偿控制的前馈非线性时滞多智能体系统分布式 leader-following 一致性
IEEE Trans Cybern. 2024 Apr;54(4):2495-2504. doi: 10.1109/TCYB.2023.3245125. Epub 2024 Mar 18.
7
Reinforcement Learning-Based Cooperative Optimal Output Regulation via Distributed Adaptive Internal Model.基于强化学习的分布式自适应内模协同最优输出调节
IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5229-5240. doi: 10.1109/TNNLS.2021.3069728. Epub 2022 Oct 5.
8
Optimal Tracking Control of Heterogeneous MASs Using Event-Driven Adaptive Observer and Reinforcement Learning.基于事件驱动自适应观测器和强化学习的异构多智能体系统最优跟踪控制
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5577-5587. doi: 10.1109/TNNLS.2022.3208237. Epub 2024 Apr 4.
9
Finite-Time Event-Triggered Output Consensus of Heterogeneous Fractional-Order Multiagent Systems With Intermittent Communication.有限时间事件触发输出共识的异构分数阶多智能体系统与间歇通信。
IEEE Trans Cybern. 2023 Apr;53(4):2164-2176. doi: 10.1109/TCYB.2021.3110964. Epub 2023 Mar 16.
10
Matrix-Weighted Consensus of Second-Order Discrete-Time Multiagent Systems.二阶离散时间多智能体系统的矩阵加权共识
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3539-3548. doi: 10.1109/TNNLS.2022.3194010. Epub 2024 Feb 29.