具有不连续约束的多智能体系统基于估计器的强化学习共识控制

Estimator-Based Reinforcement Learning Consensus Control for Multiagent Systems With Discontinuous Constraints.

作者信息

Luo Ao, Ma Hui, Ren Hongru, Li Hongyi

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Jun;36(6):11008-11019. doi: 10.1109/TNNLS.2024.3445880.

DOI:10.1109/TNNLS.2024.3445880

Abstract

This article focuses on the optimal consensus control problem for multiagent systems (MASs) with discontinuous constraints. The case of discontinuous constraints is a particular instance of state constraints, which has been studied less but occurs in many practical situations. Due to the discontinuous constraint boundaries, the traditional barrier function-based backstepping methods cannot be used directly. In response to this thorny problem, a novel constraint boundary reconstruction technique is proposed by designing a class of switch-like functions. The technique can convert discontinuous constraint boundaries into continuous ones, and it strictly proves that when the states satisfy the transformed constraint boundaries, the original constraints are also absolutely fulfilled. Meanwhile, with the aid of the barrier function and distributed event-triggered estimator, an improved coordinate transformation is constructed, which can remove the "feasibility condition" and simplify the controller design. In addition, by introducing prediction error and revised term into the learning process of neural networks (NNs), the optimal consensus problem is resolved by constructing a modified reinforcement learning strategy. Finally, the stability of the MASs is testified through the Lyapunov stability theory, and a simulation example verifies the effectiveness of the proposed method.

摘要

本文聚焦于具有不连续约束的多智能体系统（MASs）的最优一致性控制问题。不连续约束的情况是状态约束的一个特殊实例，对此研究较少，但在许多实际情形中都会出现。由于约束边界不连续，传统的基于障碍函数的反步方法无法直接使用。针对这一棘手问题，通过设计一类类似开关的函数，提出了一种新颖的约束边界重构技术。该技术可将不连续的约束边界转换为连续的边界，并严格证明当状态满足变换后的约束边界时，原始约束也能绝对满足。同时，借助障碍函数和分布式事件触发估计器，构建了一种改进的坐标变换，可消除“可行性条件”并简化控制器设计。此外，通过在神经网络（NNs）的学习过程中引入预测误差和修正项，构建一种改进的强化学习策略来解决最优一致性问题。最后，通过李雅普诺夫稳定性理论证明了多智能体系统的稳定性，一个仿真示例验证了所提方法的有效性。

相似文献

Estimator-Based Reinforcement Learning Consensus Control for Multiagent Systems With Discontinuous Constraints.具有不连续约束的多智能体系统基于估计器的强化学习共识控制

IEEE Trans Neural Netw Learn Syst. 2025 Jun;36(6):11008-11019. doi: 10.1109/TNNLS.2024.3445880.

Reinforcement learning-based consensus control for MASs with intermittent constraints.基于强化学习的具有间歇约束的 MASs 一致性控制。

Neural Netw. 2024 Apr;172:106105. doi: 10.1016/j.neunet.2024.106105. Epub 2024 Jan 6.

Observer-Based Consensus Control for MASs With Prescribed Constraints via Reinforcement Learning Algorithm.基于观测器的具有规定约束的多智能体系统通过强化学习算法的一致性控制

IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17281-17291. doi: 10.1109/TNNLS.2023.3301538. Epub 2024 Dec 2.

Event-based distributed cooperative neural learning control for nonlinear multiagent systems with time-varying output constraints.具有时变输出约束的非线性多智能体系统的基于事件的分布式协同神经学习控制

Neural Netw. 2025 Jul;187:107383. doi: 10.1016/j.neunet.2025.107383. Epub 2025 Mar 17.

Cooperative Control for Stochastic Multiagent Systems With Deferred Dynamic Constraints via a Novel Universal Barrier Function Approach.

IEEE Trans Cybern. 2024 Mar;54(3):1806-1815. doi: 10.1109/TCYB.2023.3259967. Epub 2024 Feb 9.

Adaptive Neural Network Control for a Class of Nonlinear Systems With Function Constraints on States.带状态函数约束的一类非线性系统的自适应神经网络控制。

IEEE Trans Neural Netw Learn Syst. 2023 Jun;34(6):2732-2741. doi: 10.1109/TNNLS.2021.3107600. Epub 2023 Jun 1.

Adaptive Neural Consensus Tracking Control for Nonlinear Multiagent Systems Using Integral Barrier Lyapunov Functionals.基于积分型障碍李雅普诺夫泛函的非线性多智能体系统自适应神经一致性跟踪控制

IEEE Trans Neural Netw Learn Syst. 2023 Aug;34(8):4544-4554. doi: 10.1109/TNNLS.2021.3112763. Epub 2023 Aug 4.

Optimized Backstepping-Based Containment Control for Multiagent Systems With Deferred Constraints Using a Universal Nonlinear Transformation.

IEEE Trans Cybern. 2024 Oct;54(10):6058-6068. doi: 10.1109/TCYB.2024.3440004. Epub 2024 Oct 9.

Event-Triggered Approximate Optimal Path-Following Control for Unmanned Surface Vehicles With State Constraints.具有状态约束的无人水面舰艇的事件触发近似最优路径跟踪控制

IEEE Trans Neural Netw Learn Syst. 2023 Jan;34(1):104-118. doi: 10.1109/TNNLS.2021.3090054. Epub 2023 Jan 5.

Adaptive Full-State-Constrained Control of Nonlinear Systems With Deferred Constraints Based on Nonbarrier Lyapunov Function Method.基于非障碍Lyapunov函数方法的具有延迟约束的非线性系统自适应全状态约束控制

IEEE Trans Cybern. 2022 Aug;52(8):7634-7642. doi: 10.1109/TCYB.2020.3036646. Epub 2022 Jul 19.

引用本文的文献

Reinforcement-Learning-Based Fixed-Time Prescribed Performance Consensus Control for Stochastic Nonlinear MASs with Sensor Faults.基于强化学习的具有传感器故障的随机非线性多智能体系统的固定时间预设性能一致性控制

Sensors (Basel). 2024 Dec 11;24(24):7906. doi: 10.3390/s24247906.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验