• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的异构多智能体系统分布式输出形成跟踪控制

Distributed output formation tracking control of heterogeneous multi-agent systems using reinforcement learning.

机构信息

School of Automation Science and Electronic Engineering, Science and Technology on Aircraft Control Laboratory, Beihang University, Beijing, 100191, PR China.

School of Automation Science and Electronic Engineering, Science and Technology on Aircraft Control Laboratory, Beihang University, Beijing, 100191, PR China; Institute of Artificial Intelligence, Beihang University, Beijing, 100191, PR China.

出版信息

ISA Trans. 2023 Jul;138:318-328. doi: 10.1016/j.isatra.2023.03.003. Epub 2023 Mar 8.

DOI:10.1016/j.isatra.2023.03.003
PMID:36925421
Abstract

This paper studies the distributed time-varying output formation tracking problem for heterogeneous multi-agent systems with both diverse dimensions and parameters. The output of each follower is supposed to track that of the virtual leader while accomplishing a time-varying formation configuration. First, a distributed trajectory generator is proposed based on neighboring interactions to reconstitute the state of virtual leader and provide expected trajectories with the formation incorporated. Second, an optimal tracking controller is designed by the model-free reinforcement learning technique using online off-policy data instead of requiring any knowledge of the followers' dynamics. Stabilities of the learning process and resulting controller are analyzed while solutions to the output regulator equations are equivalently obtained. Third, a compensational input is designed for each follower based on previous learning results and a derived feasibility condition. It is proved that the output formation tracking error converges to zero asymptotically with the biases to cost functions being restricted arbitrarily small. Finally, numerical simulations verify the proposed learning and control scheme.

摘要

本文研究了维度和参数均不同的异构多智能体系统的分布式时变输出编队跟踪问题。每个跟随者的输出都需要跟踪虚拟领导者的输出,同时完成时变编队配置。首先,基于邻居间的相互作用,提出了一种分布式轨迹生成器来重构虚拟领导者的状态,并提供包含编队的期望轨迹。其次,利用无模型强化学习技术设计了最优跟踪控制器,通过在线的非策略数据进行学习,而无需任何关于跟随者动力学的知识。分析了学习过程和所得控制器的稳定性,同时等效地得到了输出调节器方程的解。然后,根据之前的学习结果和导出的可行性条件,为每个跟随者设计了补偿输入。证明了输出编队跟踪误差随代价函数的偏差任意小而渐近收敛到零。最后,数值仿真验证了所提出的学习和控制方案。

相似文献

1
Distributed output formation tracking control of heterogeneous multi-agent systems using reinforcement learning.基于强化学习的异构多智能体系统分布式输出形成跟踪控制
ISA Trans. 2023 Jul;138:318-328. doi: 10.1016/j.isatra.2023.03.003. Epub 2023 Mar 8.
2
Leader-Follower Output Synchronization of Linear Heterogeneous Systems With Active Leader Using Reinforcement Learning.使用强化学习的主动领导者的线性异类系统的领导者-跟随者输出同步。
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2139-2153. doi: 10.1109/TNNLS.2018.2803059.
3
Data-driven optimal cooperative tracking control for heterogeneous multi-agent systems.异构多智能体系统的数据驱动最优协同跟踪控制
ISA Trans. 2024 Nov;154:23-31. doi: 10.1016/j.isatra.2024.08.026. Epub 2024 Sep 3.
4
Fully Data-Driven Robust Output Formation Tracking Control for Heterogeneous Multiagent System With Multiple Leaders and Actuator Faults.具有多个领导者和执行器故障的异构多智能体系统的全数据驱动鲁棒输出编队跟踪控制
IEEE Trans Cybern. 2024 May;54(5):3183-3196. doi: 10.1109/TCYB.2022.3227313. Epub 2024 Apr 16.
5
Optimal Tracking Control of Heterogeneous MASs Using Event-Driven Adaptive Observer and Reinforcement Learning.基于事件驱动自适应观测器和强化学习的异构多智能体系统最优跟踪控制
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5577-5587. doi: 10.1109/TNNLS.2022.3208237. Epub 2024 Apr 4.
6
Leader-Follower Bipartite Output Synchronization on Signed Digraphs Under Adversarial Factors via Data-Based Reinforcement Learning.基于数据强化学习的带对抗因素符号图上的领导者-跟随者二分输出同步
IEEE Trans Neural Netw Learn Syst. 2020 Oct;31(10):4185-4195. doi: 10.1109/TNNLS.2019.2952611. Epub 2019 Dec 11.
7
Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS With Unidentified Exosystem Dynamics.
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3181-3190. doi: 10.1109/TNNLS.2022.3172130. Epub 2024 Feb 29.
8
Dynamic Event-Triggered Formation Control for Heterogeneous Multiagent Systems With Nonautonomous Leader Agent.具有非自治领导智能体的异构多智能体系统的动态事件触发编队控制
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):9685-9699. doi: 10.1109/TNNLS.2022.3159669. Epub 2023 Nov 30.
9
Fixed-time time-varying output formation-containment control of heterogeneous general multi-agent systems.时变时滞异构广义多智能体系统的固定时间输出编队-包含控制
ISA Trans. 2023 Jun;137:210-221. doi: 10.1016/j.isatra.2023.01.008. Epub 2023 Jan 9.
10
Leader-Follower Formation Learning Control of Discrete-Time Nonlinear Multiagent Systems.离散时间非线性多智能体系统的领导者-跟随者编队学习控制
IEEE Trans Cybern. 2023 Feb;53(2):1184-1194. doi: 10.1109/TCYB.2021.3110645. Epub 2023 Jan 13.

引用本文的文献

1
Research on Autonomous Vehicle Path Planning Algorithm Based on Improved RRT* Algorithm and Artificial Potential Field Method.基于改进RRT*算法和人工势场法的自动驾驶车辆路径规划算法研究
Sensors (Basel). 2024 Jun 16;24(12):3899. doi: 10.3390/s24123899.