• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的分布式自适应内模协同最优输出调节

Reinforcement Learning-Based Cooperative Optimal Output Regulation via Distributed Adaptive Internal Model.

作者信息

Gao Weinan, Mynuddin Mohammed, Wunsch Donald C, Jiang Zhong-Ping

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5229-5240. doi: 10.1109/TNNLS.2021.3069728. Epub 2022 Oct 5.

DOI:10.1109/TNNLS.2021.3069728
PMID:33852393
Abstract

In this article, a data-driven distributed control method is proposed to solve the cooperative optimal output regulation problem of leader-follower multiagent systems. Different from traditional studies on cooperative output regulation, a distributed adaptive internal model is originally developed, which includes a distributed internal model and a distributed observer to estimate the leader's dynamics. Without relying on the dynamics of multiagent systems, we have proposed two reinforcement learning algorithms, policy iteration and value iteration, to learn the optimal controller through online input and state data, and estimated values of the leader's state. By combining these methods, we have established a basis for connecting data-distributed control methods with adaptive dynamic programming approaches in general since these are the theoretical foundation from which they are built.

摘要

本文提出了一种数据驱动的分布式控制方法,以解决领导者-跟随者多智能体系统的协同最优输出调节问题。与传统的协同输出调节研究不同,本文首次开发了一种分布式自适应内模,它包括一个分布式内模和一个用于估计领导者动态的分布式观测器。在不依赖多智能体系统动态的情况下,我们提出了两种强化学习算法,即策略迭代和值迭代,通过在线输入和状态数据以及领导者状态的估计值来学习最优控制器。通过结合这些方法,我们为将数据分布式控制方法与自适应动态规划方法在总体上进行连接奠定了基础,因为这些是构建它们的理论基础。

相似文献

1
Reinforcement Learning-Based Cooperative Optimal Output Regulation via Distributed Adaptive Internal Model.基于强化学习的分布式自适应内模协同最优输出调节
IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5229-5240. doi: 10.1109/TNNLS.2021.3069728. Epub 2022 Oct 5.
2
Optimal Tracking Control of Heterogeneous MASs Using Event-Driven Adaptive Observer and Reinforcement Learning.基于事件驱动自适应观测器和强化学习的异构多智能体系统最优跟踪控制
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5577-5587. doi: 10.1109/TNNLS.2022.3208237. Epub 2024 Apr 4.
3
Leader-Follower Output Synchronization of Linear Heterogeneous Systems With Active Leader Using Reinforcement Learning.使用强化学习的主动领导者的线性异类系统的领导者-跟随者输出同步。
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2139-2153. doi: 10.1109/TNNLS.2018.2803059.
4
Adaptive Dynamic Event-Triggered Distributed Output Observer for Leader-Follower Multiagent Systems Under Directed Graphs.有向图下领导者-跟随者多智能体系统的自适应动态事件触发分布式输出观测器
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17440-17449. doi: 10.1109/TNNLS.2023.3303863. Epub 2024 Dec 2.
5
Cooperative Differential Game-Based Distributed Optimal Synchronization Control of Heterogeneous Nonlinear Multiagent Systems.基于合作微分博弈的异构非线性多智能体系统分布式最优同步控制
IEEE Trans Cybern. 2023 Dec;53(12):7933-7942. doi: 10.1109/TCYB.2023.3240983. Epub 2023 Nov 29.
6
Data-Driven H∞ Output Consensus for Heterogeneous Multiagent Systems Under Switching Topology via Reinforcement Learning.基于强化学习的切换拓扑下异构多智能体系统的数据驱动H∞输出一致性
IEEE Trans Cybern. 2024 Dec;54(12):7865-7876. doi: 10.1109/TCYB.2024.3419056. Epub 2024 Nov 27.
7
Leader-Follower Bipartite Output Synchronization on Signed Digraphs Under Adversarial Factors via Data-Based Reinforcement Learning.基于数据强化学习的带对抗因素符号图上的领导者-跟随者二分输出同步
IEEE Trans Neural Netw Learn Syst. 2020 Oct;31(10):4185-4195. doi: 10.1109/TNNLS.2019.2952611. Epub 2019 Dec 11.
8
Leader-Following Consensus of Heterogeneous Linear Multiagent Systems With Communication Time-Delays via Adaptive Distributed Observers.基于自适应分布式观测器的具有通信时延的异构线性多智能体系统的领导者-跟随者一致性
IEEE Trans Cybern. 2022 Dec;52(12):13336-13349. doi: 10.1109/TCYB.2021.3115124. Epub 2022 Nov 18.
9
A Hierarchical Distributed Data-Driven Adaptive Learning Control for Nonaffine Nonlinear MASs.一种用于非仿射非线性多智能体系统的分层分布式数据驱动自适应学习控制
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4428-4436. doi: 10.1109/TNNLS.2024.3362864. Epub 2025 Feb 28.
10
Data-driven optimal cooperative tracking control for heterogeneous multi-agent systems.异构多智能体系统的数据驱动最优协同跟踪控制
ISA Trans. 2024 Nov;154:23-31. doi: 10.1016/j.isatra.2024.08.026. Epub 2024 Sep 3.

引用本文的文献

1
Comparative Study of Cooperative Platoon Merging Control Based on Reinforcement Learning.基于强化学习的协同编队合并控制的对比研究。
Sensors (Basel). 2023 Jan 15;23(2):990. doi: 10.3390/s23020990.