模块化机器人操纵器非零和微分对策的障碍-批评家-干扰近似最优控制

Barrier-critic-disturbance approximate optimal control of nonzero-sum differential games for modular robot manipulators.

作者信息

Dong Bo, Zhu Xinye, An Tianjiao, Jiang Hucheng, Ma Bing

机构信息

Department of Control Science and Engineering, Changchun University of Technology, Changchun, 130012, Jilin, China.

出版信息

Neural Netw. 2025 Jan;181:106880. doi: 10.1016/j.neunet.2024.106880. Epub 2024 Nov 6.

DOI:10.1016/j.neunet.2024.106880

PMID:39546873

Abstract

In this paper, for addressing the safe control problem of modular robot manipulators (MRMs) system with uncertain disturbances, an approximate optimal control scheme of nonzero-sum (NZS) differential games is proposed based on the control barrier function (CBF). The dynamic model of the manipulator system integrates joint subsystems through the utilization of joint torque feedback (JTF) technique, incorporating interconnected dynamic coupling (IDC) effects. By integrating the cost functions relevant to each player with the CBF, the evolution of system states is ensured to remain within the safe region. Subsequently, the optimal tracking control problem for the MRM system is reformulated as an NZS game involving multiple joint subsystems. Based on the adaptive dynamic programming (ADP) algorithm, a cost function approximator for solving Hamilton-Jacobi (HJ) equation using only critic neural networks (NN) is established, which promotes the feasible derivation of the approximate optimal control strategy. The Lyapunov theory is utilized to demonstrate that the tracking error is uniformly ultimately bounded (UUB). Utilizing the CBF's state constraint mechanism prevents the robot from deviating from the safe region, and the application of the NZS game approach ensures that the subsystems of the MRM reach a Nash equilibrium. The proposed control method effectively addresses the problem of safe and approximate optimal control of MRM system under uncertain disturbances. Finally, the effectiveness and superiority of the proposed method are verified through simulations and experiments.

摘要

本文针对具有不确定干扰的模块化机器人操纵器（MRM）系统的安全控制问题，基于控制障碍函数（CBF）提出了一种非零和（NZS）微分博弈的近似最优控制方案。操纵器系统的动态模型通过利用关节转矩反馈（JTF）技术集成关节子系统，纳入了相互连接的动态耦合（IDC）效应。通过将与每个参与者相关的成本函数与CBF相结合，确保系统状态的演变保持在安全区域内。随后，将MRM系统的最优跟踪控制问题重新表述为一个涉及多个关节子系统的NZS博弈。基于自适应动态规划（ADP）算法，建立了仅使用评判神经网络（NN）求解哈密顿-雅可比（HJ）方程的成本函数逼近器，这促进了近似最优控制策略的可行推导。利用李雅普诺夫理论证明跟踪误差是一致最终有界（UUB）的。利用CBF的状态约束机制防止机器人偏离安全区域，NZS博弈方法的应用确保了MRM的子系统达到纳什均衡。所提出的控制方法有效地解决了不确定干扰下MRM系统的安全和近似最优控制问题。最后，通过仿真和实验验证了所提方法的有效性和优越性。

相似文献

Barrier-critic-disturbance approximate optimal control of nonzero-sum differential games for modular robot manipulators.模块化机器人操纵器非零和微分对策的障碍-批评家-干扰近似最优控制

Neural Netw. 2025 Jan;181:106880. doi: 10.1016/j.neunet.2024.106880. Epub 2024 Nov 6.

Cooperative Game-Based Approximate Optimal Control of Modular Robot Manipulators for Human-Robot Collaboration.基于协同博弈的模块化机器人操作器近似最优控制用于人机协作。

IEEE Trans Cybern. 2023 Jul;53(7):4691-4703. doi: 10.1109/TCYB.2023.3277558. Epub 2023 Jun 15.

Dynamic Event-Triggered Strategy-Based Optimal Control of Modular Robot Manipulator: A Multiplayer Nonzero-Sum Game Perspective.基于动态事件触发策略的模块化机器人操纵器最优控制：多人非零和博弈视角

IEEE Trans Cybern. 2024 Dec;54(12):7514-7526. doi: 10.1109/TCYB.2024.3468875. Epub 2024 Nov 27.

Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics.具有未知动态的非零和博弈系统最优控制的经验回放。

IEEE Trans Cybern. 2016 Mar;46(3):854-65. doi: 10.1109/TCYB.2015.2488680. Epub 2015 Oct 26.

Approximate Optimal Distributed Control of Nonlinear Interconnected Systems Using Event-Triggered Nonzero-Sum Games.基于事件触发非零和博弈的非线性互联系统近似最优分布式控制

IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1512-1522. doi: 10.1109/TNNLS.2018.2869896. Epub 2018 Oct 8.

Robust Trajectory Tracking Control for Continuous-Time Nonlinear Systems with State Constraints and Uncertain Disturbances.具有状态约束和不确定干扰的连续时间非线性系统的鲁棒轨迹跟踪控制

Entropy (Basel). 2022 Jun 11;24(6):816. doi: 10.3390/e24060816.

Advanced optimal tracking integrating a neural critic technique for asymmetric constrained zero-sum games.高级最优跟踪，整合神经批评技术，用于非对称约束零和博弈。

Neural Netw. 2024 Sep;177:106388. doi: 10.1016/j.neunet.2024.106388. Epub 2024 May 15.

Optimal H tracking control of nonlinear systems with zero-equilibrium-free via novel adaptive critic designs.通过新颖的自适应评价设计实现具有零平衡点的非线性系统的最优 H 跟踪控制。

Neural Netw. 2023 Jul;164:105-114. doi: 10.1016/j.neunet.2023.04.021. Epub 2023 Apr 20.

Differential-game for resource aware approximate optimal control of large-scale nonlinear systems with multiple players.具有多玩家的大规模非线性系统资源感知近似最优控制的微分博弈。

Neural Netw. 2020 Apr;124:95-108. doi: 10.1016/j.neunet.2019.12.031. Epub 2020 Jan 14.

Approximate N-Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System.不确定连续非线性系统的近似 N 人非零和博弈解。

IEEE Trans Neural Netw Learn Syst. 2015 Aug;26(8):1645-58. doi: 10.1109/TNNLS.2014.2350835. Epub 2014 Oct 8.

引用本文的文献

Event-Trigger Reinforcement Learning-Based Coordinate Control of Modular Unmanned System via Nonzero-Sum Game.基于事件触发强化学习的模块化无人系统非零和博弈坐标控制

Sensors (Basel). 2025 Jan 7;25(2):314. doi: 10.3390/s25020314.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

模块化机器人操纵器非零和微分对策的障碍-批评家-干扰近似最优控制

Barrier-critic-disturbance approximate optimal control of nonzero-sum differential games for modular robot manipulators.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献