• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于共识算法的机器人操作分布式神经网络训练

Distributed Neural Networks Training for Robotic Manipulation With Consensus Algorithm.

作者信息

Liu Wenxing, Niu Hanlin, Jang Inmo, Herrmann Guido, Carrasco Joaquin

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):2732-2746. doi: 10.1109/TNNLS.2022.3191021. Epub 2024 Feb 5.

DOI:10.1109/TNNLS.2022.3191021
PMID:35853061
Abstract

In this article, we propose an algorithm that combines actor-critic-based off-policy method with consensus-based distributed training to deal with multiagent deep reinforcement learning problems. Specifically, convergence analysis of a consensus algorithm for a type of nonlinear system with a Lyapunov method is developed, and we use this result to analyze the convergence properties of the actor training parameters and the critic training parameters in our algorithm. Through the convergence analysis, it can be verified that all agents will converge to the same optimal model as the training time goes to infinity. To validate the implementation of our algorithm, a multiagent training framework is proposed to train each Universal Robot 5 (UR5) robot arm to reach the random target position. Finally, experiments are provided to demonstrate the effectiveness and feasibility of the proposed algorithm.

摘要

在本文中,我们提出了一种算法,该算法将基于演员-评论家的离策略方法与基于共识的分布式训练相结合,以处理多智能体深度强化学习问题。具体而言,利用李雅普诺夫方法对一类非线性系统的共识算法进行了收敛性分析,并用此结果分析了我们算法中演员训练参数和评论家训练参数的收敛特性。通过收敛性分析,可以验证随着训练时间趋于无穷大,所有智能体将收敛到相同的最优模型。为了验证我们算法的实现,提出了一个多智能体训练框架,用于训练每个通用机器人5(UR5)机器人手臂到达随机目标位置。最后,通过实验证明了所提算法的有效性和可行性。

相似文献

1
Distributed Neural Networks Training for Robotic Manipulation With Consensus Algorithm.基于共识算法的机器人操作分布式神经网络训练
IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):2732-2746. doi: 10.1109/TNNLS.2022.3191021. Epub 2024 Feb 5.
2
Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning.基于数据的多智能体系统最优共识控制与策略梯度强化学习
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3872-3883. doi: 10.1109/TNNLS.2021.3054685. Epub 2022 Aug 3.
3
Model-Free Reinforcement Learning for Fully Cooperative Consensus Problem of Nonlinear Multiagent Systems.用于非线性多智能体系统完全协作一致性问题的无模型强化学习
IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1482-1491. doi: 10.1109/TNNLS.2020.3042508. Epub 2022 Apr 4.
4
Data-Driven Optimal Bipartite Consensus Control for Second-Order Multiagent Systems via Policy Gradient Reinforcement Learning.基于策略梯度强化学习的二阶多智能体系统数据驱动最优二分共识控制
IEEE Trans Cybern. 2024 Jun;54(6):3468-3478. doi: 10.1109/TCYB.2023.3276797. Epub 2024 May 30.
5
Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination.用于大规模分布式多机器人协调的图软演员-评论家强化学习
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):665-676. doi: 10.1109/TNNLS.2023.3329530. Epub 2025 Jan 7.
6
Combined control algorithm based on synchronous reinforcement learning for a self-balancing bicycle robot.基于同步强化学习的自平衡自行车机器人组合控制算法
ISA Trans. 2024 Feb;145:479-492. doi: 10.1016/j.isatra.2023.11.032. Epub 2023 Nov 23.
7
Adaptive Event-Triggered Bipartite Formation for Multiagent Systems via Reinforcement Learning.基于强化学习的多智能体系统自适应事件触发二分编队
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17817-17828. doi: 10.1109/TNNLS.2023.3309326. Epub 2024 Dec 2.
8
Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning.通过深度强化学习的仿生水下航行器目标跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3741-3752. doi: 10.1109/TNNLS.2021.3054402. Epub 2022 Aug 3.
9
Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.基于双目标框架的多机器人编队分布式深度强化学习
Neural Netw. 2024 Mar;171:61-72. doi: 10.1016/j.neunet.2023.11.063. Epub 2023 Dec 1.
10
Optimal Tracking Control of Nonlinear Multiagent Systems Using Internal Reinforce Q-Learning.
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):4043-4055. doi: 10.1109/TNNLS.2021.3055761. Epub 2022 Aug 3.

引用本文的文献

1
A route planning for oil sample transportation based on improved A* algorithm.基于改进A*算法的油样运输路径规划
Sci Rep. 2023 Dec 12;13(1):22041. doi: 10.1038/s41598-023-49266-z.