• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多智能体深度强化学习的机器人手臂装配研究。

Multi-agent deep reinforcement learning-based robotic arm assembly research.

作者信息

Cao Guohua, Bai Jimeng

机构信息

School of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun, China.

出版信息

PLoS One. 2025 Feb 18;20(2):e0311550. doi: 10.1371/journal.pone.0311550. eCollection 2025.

DOI:10.1371/journal.pone.0311550
PMID:39965012
Abstract

Due to the complexity and variability of application scenarios and the increasing demands for assembly, single-agent algorithms often face challenges in convergence and exhibit poor performance in robotic arm assembly processes. To address these issues, this paper proposes a method that employs a multi-agent reinforcement learning algorithm for the shaft-hole assembly of robotic arms, with a specific focus on square shaft-hole assemblies. First, we analyze the stages of hole-seeking, alignment, and insertion in the shaft-hole assembly process, based on a comprehensive study of the interactions between shafts and holes. Next, a reward function is designed by integrating the decoupled multi-agent deterministic deep deterministic policy gradient (DMDDPG) algorithm. Finally, a simulation environment is created in Gazebo, using circular and square shaft-holes as experimental subjects to model the robotic arm's shaft-hole assembly. The simulation results indicate that the proposed algorithm, which models the first three joints and the last three joints of the robotic arm as multi-agents, demonstrates not only enhanced adaptability but also faster and more stable convergence.

摘要

由于应用场景的复杂性和多变性以及对装配要求的不断提高,单智能体算法在收敛方面常常面临挑战,并且在机器人手臂装配过程中表现出较差的性能。为了解决这些问题,本文提出了一种将多智能体强化学习算法应用于机器人手臂轴孔装配的方法,特别关注方轴孔装配。首先,在对轴与孔之间的相互作用进行全面研究的基础上,我们分析了轴孔装配过程中的找孔、对齐和插入阶段。接下来,通过集成解耦的多智能体确定性深度确定性策略梯度(DMDDPG)算法设计了一个奖励函数。最后,在Gazebo中创建了一个仿真环境,以圆形和方形轴孔作为实验对象对机器人手臂的轴孔装配进行建模。仿真结果表明,将机器人手臂的前三个关节和后三个关节建模为多智能体的所提出算法,不仅展示出增强的适应性,而且收敛更快、更稳定。

相似文献

1
Multi-agent deep reinforcement learning-based robotic arm assembly research.基于多智能体深度强化学习的机器人手臂装配研究。
PLoS One. 2025 Feb 18;20(2):e0311550. doi: 10.1371/journal.pone.0311550. eCollection 2025.
2
Novel deep reinforcement learning based collision avoidance approach for path planning of robots in unknown environment.基于新型深度强化学习的未知环境中机器人路径规划碰撞避免方法。
PLoS One. 2025 Jan 16;20(1):e0312559. doi: 10.1371/journal.pone.0312559. eCollection 2025.
3
Reinforcement Learning-Based Control for Collaborative Robotic Brain Retraction.基于强化学习的协作式机器人脑回缩控制
Sensors (Basel). 2024 Dec 20;24(24):8150. doi: 10.3390/s24248150.
4
A fully value distributional deep reinforcement learning framework for multi-agent cooperation.一种用于多智能体合作的全值分布深度强化学习框架。
Neural Netw. 2025 Apr;184:107035. doi: 10.1016/j.neunet.2024.107035. Epub 2024 Dec 14.
5
Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm.基于鲸鱼优化算法优化自动驾驶中深度强化学习的超参数。
PLoS One. 2021 Jun 10;16(6):e0252754. doi: 10.1371/journal.pone.0252754. eCollection 2021.
6
An enhanced deep deterministic policy gradient algorithm for intelligent control of robotic arms.一种用于机器人手臂智能控制的增强型深度确定性策略梯度算法。
Front Neuroinform. 2023 Jan 23;17:1096053. doi: 10.3389/fninf.2023.1096053. eCollection 2023.
7
Multi-Objective Optimal Trajectory Planning for Robotic Arms Using Deep Reinforcement Learning.使用深度强化学习的机械臂多目标最优轨迹规划。
Sensors (Basel). 2023 Jun 27;23(13):5974. doi: 10.3390/s23135974.
8
Simulating fish autonomous swimming behaviours using deep reinforcement learning based on Kolmogorov-Arnold Networks.基于柯尔莫哥洛夫-阿诺德网络,使用深度强化学习模拟鱼类自主游泳行为。
Bioinspir Biomim. 2025 Jan 16;20(2). doi: 10.1088/1748-3190/ada59c.
9
A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation.基于特征变换的求解多目标旅行商问题的深度强化学习算法框架。
Neural Netw. 2024 Aug;176:106359. doi: 10.1016/j.neunet.2024.106359. Epub 2024 May 3.
10
A novel trajectory learning method for robotic arms based on Gaussian Mixture Model and k-value selection algorithm.一种基于高斯混合模型和k值选择算法的新型机器人手臂轨迹学习方法。
PLoS One. 2025 Feb 14;20(2):e0318403. doi: 10.1371/journal.pone.0318403. eCollection 2025.