• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自适应评判学习的多智能体系统具有规定性能的最优二分共识

Adaptive Critic Learning-Based Optimal Bipartite Consensus for Multiagent Systems With Prescribed Performance.

作者信息

Yan Lei, Liu Junhe, Lai Guanyu, Philip Chen C L, Wu Zongze, Liu Zhi

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):5417-5427. doi: 10.1109/TNNLS.2024.3379503. Epub 2025 Feb 28.

DOI:10.1109/TNNLS.2024.3379503
PMID:38709609
Abstract

Developing a distributed bipartite optimal consensus scheme while ensuring user-predefined performance is essential in practical applications. Existing approaches to this problem typically require a complex controller structure due to adopting an identifier-actor-critic framework and prescribed performance cannot be guaranteed. In this work, an adaptive critic learning (ACL)-based optimal bipartite consensus scheme is developed to bridge the gap. A newly designed error scaling function, which defines the user-predefined settling time and steady accuracy without relying on the initial conditions, is then integrated into a cost function. The backstepping framework combines the ACL and integral reinforcement learning (IRL) algorithm to develop the adaptive optimal bipartite consensus scheme, which contributes a critic-only controller structure by removing the identifier and actor networks in the existing methods. The adaptive law of the critic network is derived by the gradient descent algorithm and experience replay to minimize the IRL-based residual error. It is shown that a compute-saving learning mechanism can achieve the optimal consensus, and the error variables of the closed-loop system are uniformly ultimately bounded (UUB). Besides, in any bounded initial condition, the evolution of bipartite consensus is limited to a user-prescribed boundary under bounded initial conditions. The illustrative simulation results validate the efficacy of the approach.

摘要

在实际应用中,开发一种分布式二分最优共识方案并确保用户预定义的性能至关重要。由于采用了标识符-执行器-评论家框架,解决该问题的现有方法通常需要复杂的控制器结构,并且无法保证规定的性能。在这项工作中,开发了一种基于自适应评论家学习(ACL)的最优二分共识方案来弥补这一差距。一种新设计的误差缩放函数被集成到成本函数中,该函数在不依赖初始条件的情况下定义了用户预定义的调节时间和稳态精度。反步框架将ACL和积分强化学习(IRL)算法相结合,开发出自适应最优二分共识方案,该方案通过去除现有方法中的标识符和执行器网络,贡献了一种仅含评论家的控制器结构。评论家网络的自适应律由梯度下降算法和经验回放推导得出,以最小化基于IRL的残差误差。结果表明,一种节省计算的学习机制可以实现最优共识,并且闭环系统的误差变量是一致最终有界的(UUB)。此外,在任何有界初始条件下,二分共识的演化在有界初始条件下被限制在用户规定的边界内。说明性的仿真结果验证了该方法的有效性。

相似文献

1
Adaptive Critic Learning-Based Optimal Bipartite Consensus for Multiagent Systems With Prescribed Performance.基于自适应评判学习的多智能体系统具有规定性能的最优二分共识
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):5417-5427. doi: 10.1109/TNNLS.2024.3379503. Epub 2025 Feb 28.
2
Adaptive fuzzy fixed-time bipartite consensus control for stochastic nonlinear multi-agent systems with performance constraints.
ISA Trans. 2024 Jul 6. doi: 10.1016/j.isatra.2024.07.004.
3
Hierarchical Sliding-Mode Surface-Based Adaptive Actor-Critic Optimal Control for Switched Nonlinear Systems With Unknown Perturbation.基于分层滑模面的未知扰动切换非线性系统自适应Actor-Critic最优控制
IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):1559-1571. doi: 10.1109/TNNLS.2022.3183991. Epub 2024 Feb 5.
4
Reinforcement learning-based consensus control for MASs with intermittent constraints.基于强化学习的具有间歇约束的 MASs 一致性控制。
Neural Netw. 2024 Apr;172:106105. doi: 10.1016/j.neunet.2024.106105. Epub 2024 Jan 6.
5
Adaptive NN Optimal Consensus Fault-Tolerant Control for Stochastic Nonlinear Multiagent Systems.自适应神经网络最优共识容错控制的随机非线性多智能体系统。
IEEE Trans Neural Netw Learn Syst. 2023 Feb;34(2):947-957. doi: 10.1109/TNNLS.2021.3104839. Epub 2023 Feb 3.
6
Data-Driven Optimal Bipartite Consensus Control for Second-Order Multiagent Systems via Policy Gradient Reinforcement Learning.基于策略梯度强化学习的二阶多智能体系统数据驱动最优二分共识控制
IEEE Trans Cybern. 2024 Jun;54(6):3468-3478. doi: 10.1109/TCYB.2023.3276797. Epub 2024 May 30.
7
Observer-Based Consensus Control for MASs With Prescribed Constraints via Reinforcement Learning Algorithm.基于观测器的具有规定约束的多智能体系统通过强化学习算法的一致性控制
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17281-17291. doi: 10.1109/TNNLS.2023.3301538. Epub 2024 Dec 2.
8
Event-based adaptive fixed-time optimal control for saturated fault-tolerant nonlinear multiagent systems via reinforcement learning algorithm.
Neural Netw. 2025 Mar;183:106952. doi: 10.1016/j.neunet.2024.106952. Epub 2024 Nov 28.
9
Reinforcement Learning-Based Predefined-Time Tracking Control for Nonlinear Systems Under Identifier-Critic-Actor Structure.基于强化学习的标识符-评论家-行动者结构下非线性系统的预定义时间跟踪控制
IEEE Trans Cybern. 2024 Nov;54(11):6345-6357. doi: 10.1109/TCYB.2024.3431670. Epub 2024 Oct 30.
10
Adaptive Event-Triggered Bipartite Formation for Multiagent Systems via Reinforcement Learning.基于强化学习的多智能体系统自适应事件触发二分编队
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17817-17828. doi: 10.1109/TNNLS.2023.3309326. Epub 2024 Dec 2.