• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于事件触发强化学习的模块化无人系统非零和博弈坐标控制

Event-Trigger Reinforcement Learning-Based Coordinate Control of Modular Unmanned System via Nonzero-Sum Game.

作者信息

Liu Yebao, An Tianjiao, Chen Jianguo, Zhong Luyang, Qian Yuhan

机构信息

Aerospace Times Feihong Technology Company Limited, Beijing 130012, China.

Department of Control Science and Engineering, Changchun University of Technology, Changchun 130012, China.

出版信息

Sensors (Basel). 2025 Jan 7;25(2):314. doi: 10.3390/s25020314.

DOI:10.3390/s25020314
PMID:39860684
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11768288/
Abstract

Decreasing the position error and control torque is important for the coordinate control of a modular unmanned system with less communication burden between the sensor and the actuator. Therefore, this paper proposes event-trigger reinforcement learning (ETRL)-based coordinate control of a modular unmanned system (MUS) via the nonzero-sum game (NZSG) strategy. The dynamic model of the MUS is established via joint torque feedback (JTF) technology. Based on the NZSG strategy, the existing coordinate control problem is transformed into an RL issue. With the help of the ET mechanism, the periodic communication mechanism of the system is avoided. The ET-critic neural network (NN) is used to approximate the performance index function, thus obtaining the ETRL coordinate control policy. The stability of the closed-loop system is verified via Lyapunov's theorem. Experiment results demonstrate the validity of the proposed method. The experimental results show that the proposed method reduces the position error by 30% and control torque by 10% compared with the existing control methods.

摘要

对于传感器与执行器之间通信负担较小的模块化无人系统的坐标控制而言,降低位置误差和控制转矩至关重要。因此,本文提出了一种基于事件触发强化学习(ETRL)的模块化无人系统(MUS)坐标控制方法,该方法采用非零和博弈(NZSG)策略。通过关节转矩反馈(JTF)技术建立了MUS的动态模型。基于NZSG策略,将现有的坐标控制问题转化为强化学习问题。借助事件触发(ET)机制,避免了系统的周期性通信机制。利用ET-评论家神经网络(NN)逼近性能指标函数,从而获得ETRL坐标控制策略。通过李雅普诺夫定理验证了闭环系统的稳定性。实验结果证明了所提方法的有效性。实验结果表明,与现有控制方法相比,所提方法将位置误差降低了30%,控制转矩降低了10%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/084f/11768288/b778035f1860/sensors-25-00314-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/084f/11768288/b778035f1860/sensors-25-00314-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/084f/11768288/b778035f1860/sensors-25-00314-g002.jpg

相似文献

1
Event-Trigger Reinforcement Learning-Based Coordinate Control of Modular Unmanned System via Nonzero-Sum Game.基于事件触发强化学习的模块化无人系统非零和博弈坐标控制
Sensors (Basel). 2025 Jan 7;25(2):314. doi: 10.3390/s25020314.
2
Barrier-critic-disturbance approximate optimal control of nonzero-sum differential games for modular robot manipulators.模块化机器人操纵器非零和微分对策的障碍-批评家-干扰近似最优控制
Neural Netw. 2025 Jan;181:106880. doi: 10.1016/j.neunet.2024.106880. Epub 2024 Nov 6.
3
Event-triggered control for input-constrained nonzero-sum games through particle swarm optimized neural networks.通过粒子群优化神经网络实现输入受限非零和博弈的事件触发控制。
Neural Netw. 2025 Aug;188:107430. doi: 10.1016/j.neunet.2025.107430. Epub 2025 Apr 9.
4
Dynamic Event-Triggered Strategy-Based Optimal Control of Modular Robot Manipulator: A Multiplayer Nonzero-Sum Game Perspective.基于动态事件触发策略的模块化机器人操纵器最优控制:多人非零和博弈视角
IEEE Trans Cybern. 2024 Dec;54(12):7514-7526. doi: 10.1109/TCYB.2024.3468875. Epub 2024 Nov 27.
5
Event-driven H control with critic learning for nonlinear systems.事件驱动的 H 控制与非线性系统的批评学习。
Neural Netw. 2020 Dec;132:30-42. doi: 10.1016/j.neunet.2020.08.004. Epub 2020 Aug 20.
6
Asynchronous learning for actor-critic neural networks and synchronous triggering for multiplayer system.异步学习的演员-批评神经网络和同步触发的多人系统。
ISA Trans. 2022 Oct;129(Pt B):295-308. doi: 10.1016/j.isatra.2022.02.007. Epub 2022 Feb 10.
7
Event-triggered integral reinforcement learning for nonzero-sum games with asymmetric input saturation.具有非零和博弈的事件触发积分强化学习与非对称输入饱和
Neural Netw. 2022 Aug;152:212-223. doi: 10.1016/j.neunet.2022.04.013. Epub 2022 Apr 21.
8
Approximate Optimal Distributed Control of Nonlinear Interconnected Systems Using Event-Triggered Nonzero-Sum Games.基于事件触发非零和博弈的非线性互联系统近似最优分布式控制
IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1512-1522. doi: 10.1109/TNNLS.2018.2869896. Epub 2018 Oct 8.
9
Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game.未知非线性多人非零和博弈的无模型自适应最优控制
IEEE Trans Neural Netw Learn Syst. 2022 Feb;33(2):879-892. doi: 10.1109/TNNLS.2020.3030127. Epub 2022 Feb 3.
10
Observer-based event-triggered control for zero-sum games of input constrained multi-player nonlinear systems.基于观测器的事件触发控制用于输入受限多玩家非线性系统的零和博弈。
Neural Netw. 2021 Dec;144:101-112. doi: 10.1016/j.neunet.2021.08.012. Epub 2021 Aug 25.

本文引用的文献

1
Barrier-critic-disturbance approximate optimal control of nonzero-sum differential games for modular robot manipulators.模块化机器人操纵器非零和微分对策的障碍-批评家-干扰近似最优控制
Neural Netw. 2025 Jan;181:106880. doi: 10.1016/j.neunet.2024.106880. Epub 2024 Nov 6.
2
Dynamic Event-Triggered Strategy-Based Optimal Control of Modular Robot Manipulator: A Multiplayer Nonzero-Sum Game Perspective.基于动态事件触发策略的模块化机器人操纵器最优控制:多人非零和博弈视角
IEEE Trans Cybern. 2024 Dec;54(12):7514-7526. doi: 10.1109/TCYB.2024.3468875. Epub 2024 Nov 27.
3
Data-Driven Optimal Tracking Control for Discrete-Time Nonlinear Systems With Unknown Dynamics Using Deterministic ADP.
基于确定性自适应动态规划的未知动力学离散时间非线性系统的数据驱动最优跟踪控制
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1184-1198. doi: 10.1109/TNNLS.2023.3323142. Epub 2025 Jan 7.
4
Optimum Control for Path Tracking Problem of Vehicle Handling Inverse Dynamics.车辆操纵逆动力学路径跟踪问题的最优控制
Sensors (Basel). 2023 Jul 25;23(15):6673. doi: 10.3390/s23156673.
5
Cooperative Game-Based Approximate Optimal Control of Modular Robot Manipulators for Human-Robot Collaboration.基于协同博弈的模块化机器人操作器近似最优控制用于人机协作。
IEEE Trans Cybern. 2023 Jul;53(7):4691-4703. doi: 10.1109/TCYB.2023.3277558. Epub 2023 Jun 15.
6
Constrained-Cost Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems.用于离散时间非线性系统最优控制的约束成本自适应动态规划
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3251-3264. doi: 10.1109/TNNLS.2023.3237586. Epub 2024 Feb 29.
7
Convergence Analysis of Value Iteration Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems.连续时间非线性系统的价值迭代自适应动态规划收敛性分析
IEEE Trans Cybern. 2024 Mar;54(3):1639-1649. doi: 10.1109/TCYB.2022.3232599. Epub 2024 Feb 9.
8
Distributed Fault Tolerant Consensus Control of Nonlinear Multiagent Systems via Adaptive Dynamic Programming.基于自适应动态规划的非线性多智能体系统分布式容错一致性控制
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9041-9053. doi: 10.1109/TNNLS.2022.3217774. Epub 2024 Jul 8.
9
Neural-Network-Based Immune Optimization Regulation Using Adaptive Dynamic Programming.基于神经网络的自适应动态规划免疫优化调节
IEEE Trans Cybern. 2023 Mar;53(3):1944-1953. doi: 10.1109/TCYB.2022.3179302. Epub 2023 Feb 15.
10
Optimal Regulation Strategy for Nonzero-Sum Games of the Immune System Using Adaptive Dynamic Programming.基于自适应动态规划的免疫系统非零和博弈最优调控策略
IEEE Trans Cybern. 2023 Mar;53(3):1475-1484. doi: 10.1109/TCYB.2021.3103820. Epub 2023 Feb 15.