• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于观测器的具有规定约束的多智能体系统通过强化学习算法的一致性控制

Observer-Based Consensus Control for MASs With Prescribed Constraints via Reinforcement Learning Algorithm.

作者信息

Luo Ao, Zhou Qi, Ma Hui, Li Hongyi

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17281-17291. doi: 10.1109/TNNLS.2023.3301538. Epub 2024 Dec 2.

DOI:10.1109/TNNLS.2023.3301538
PMID:37603472
Abstract

In this article, an adaptive optimal consensus control problem is studied for multiagent systems (MASs) with external disturbances, unmeasurable states, and prescribed constraints. First, by using neural networks (NNs), a composite observer is constructed to estimate the unmeasurable states and disturbances simultaneously. Then, the consensus error is guaranteed within a prescribed boundary by presenting an improved prescribed performance control (PPC) technique, and the initial conditions for the error are eliminated. In addition, the updating laws of actor-critic NNs are established by using a simplified reinforcement learning (RL) algorithm based on the uniqueness of optimal solution, and the asymmetric input saturation is resolved by designing auxiliary system instead of using nonquadratic cost functions in other optimal control methods. Finally, the boundedness of all signals in the closed-loop system is proved by using Lyapunov stability theory. The effectiveness of the proposed control method is verified by a simulation example.

摘要

本文研究了具有外部干扰、不可测状态和规定约束的多智能体系统(MASs)的自适应最优一致性控制问题。首先,利用神经网络(NNs)构造了一个复合观测器,用于同时估计不可测状态和干扰。然后,通过提出一种改进的规定性能控制(PPC)技术,将一致性误差保证在规定边界内,并消除误差的初始条件。此外,基于最优解的唯一性,采用简化的强化学习(RL)算法建立了actor-critic神经网络的更新律,通过设计辅助系统解决了非对称输入饱和问题,而不是像其他最优控制方法那样使用非二次代价函数。最后,利用李雅普诺夫稳定性理论证明了闭环系统中所有信号的有界性。通过仿真例子验证了所提控制方法的有效性。

相似文献

1
Observer-Based Consensus Control for MASs With Prescribed Constraints via Reinforcement Learning Algorithm.基于观测器的具有规定约束的多智能体系统通过强化学习算法的一致性控制
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17281-17291. doi: 10.1109/TNNLS.2023.3301538. Epub 2024 Dec 2.
2
Reinforcement learning-based consensus control for MASs with intermittent constraints.基于强化学习的具有间歇约束的 MASs 一致性控制。
Neural Netw. 2024 Apr;172:106105. doi: 10.1016/j.neunet.2024.106105. Epub 2024 Jan 6.
3
Hierarchical Sliding-Mode Surface-Based Adaptive Actor-Critic Optimal Control for Switched Nonlinear Systems With Unknown Perturbation.基于分层滑模面的未知扰动切换非线性系统自适应Actor-Critic最优控制
IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):1559-1571. doi: 10.1109/TNNLS.2022.3183991. Epub 2024 Feb 5.
4
Adaptive Fault-Tolerant Tracking Control for Discrete-Time Multiagent Systems via Reinforcement Learning Algorithm.基于强化学习算法的离散时间多智能体系统自适应容错跟踪控制
IEEE Trans Cybern. 2021 Mar;51(3):1163-1174. doi: 10.1109/TCYB.2020.2982168. Epub 2021 Feb 17.
5
Quantization-Based Adaptive Actor-Critic Tracking Control With Tracking Error Constraints.基于量化的具有跟踪误差约束的自适应动作-评论家跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2018 Apr;29(4):970-980. doi: 10.1109/TNNLS.2017.2651104. Epub 2017 Feb 1.
6
Prescribed Performance Path-Following Control for Rotor-Assisted Vehicles via an Improved Reinforcement Learning Mechanism.通过改进的强化学习机制对旋翼辅助飞行器进行规定性能路径跟踪控制
IEEE Trans Neural Netw Learn Syst. 2025 Sep;36(9):17395-17405. doi: 10.1109/TNNLS.2025.3562245.
7
Event-based adaptive fixed-time optimal control for saturated fault-tolerant nonlinear multiagent systems via reinforcement learning algorithm.
Neural Netw. 2025 Mar;183:106952. doi: 10.1016/j.neunet.2024.106952. Epub 2024 Nov 28.
8
Optimized Backstepping-Based Containment Control for Multiagent Systems With Deferred Constraints Using a Universal Nonlinear Transformation.
IEEE Trans Cybern. 2024 Oct;54(10):6058-6068. doi: 10.1109/TCYB.2024.3440004. Epub 2024 Oct 9.
9
Observer-Based NN Control for Nonlinear Systems With Full-State Constraints and External Disturbances.具有全状态约束和外部干扰的非线性系统基于观测器的神经网络控制
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4322-4331. doi: 10.1109/TNNLS.2021.3056524. Epub 2022 Aug 31.
10
Estimator-Based Reinforcement Learning Consensus Control for Multiagent Systems With Discontinuous Constraints.具有不连续约束的多智能体系统基于估计器的强化学习共识控制
IEEE Trans Neural Netw Learn Syst. 2025 Jun;36(6):11008-11019. doi: 10.1109/TNNLS.2024.3445880.

引用本文的文献

1
Reinforcement-Learning-Based Fixed-Time Prescribed Performance Consensus Control for Stochastic Nonlinear MASs with Sensor Faults.基于强化学习的具有传感器故障的随机非线性多智能体系统的固定时间预设性能一致性控制
Sensors (Basel). 2024 Dec 11;24(24):7906. doi: 10.3390/s24247906.
2
Consensus-Based Power System State Estimation Algorithm Under Collaborative Attack.协同攻击下基于共识的电力系统状态估计算法
Sensors (Basel). 2024 Oct 27;24(21):6886. doi: 10.3390/s24216886.