• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度强化学习的改进深度确定性策略梯度算法的机器人抓取方法优化

Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning.

作者信息

Zhang Hongxu, Wang Fei, Wang Jianhui, Cui Ben

机构信息

College of Information Science and Engineering, Northeastern University, Shenyang, China.

Faculty of Robot Science and Engineering, Northeastern University, Shenyang, China.

出版信息

Rev Sci Instrum. 2021 Feb 1;92(2):025114. doi: 10.1063/5.0034101.

DOI:10.1063/5.0034101
PMID:33648152
Abstract

Robot grasping has become a very hot research field so that the requirements for robot operation are getting higher and higher. In previous research studies, the use of traditional target detection algorithms for grasping is often very inefficient, and this article is dedicated to improving the deep reinforcement learning algorithm to improve the grasping efficiency and solve the problem of robots dealing with the impact of unknown disturbances on grasping. Using the characteristic that deep reinforcement learning actively explores the unknown environment, a Gaussian parameter Deep Deterministic Policy Gradient (Gaussian-DDPG) algorithm based on the Importance-Weighted Autoencoder (IWAE) is proposed to realize the robot's autonomous learning of the grasping task. Traditional coordinate positioning methods and deep learning methods have poor grasping effects for disturbed situations (such as the movement of the target object). The IWAE algorithm is used to compress the high-dimensional information of the original visual input to the hidden space and pass it to the deep reinforcement learning network as part of the state value. Based on the classic DDPG algorithm, it smoothly adds Gaussian parameters to improve the exploratory nature of the algorithm, dynamically sets the robot grasping space parameters to adapt to the workspace of multiple scales, and finally, realizes the accurate grasping of the robot. Relying on the possible position information deviation of the visual information, the control of the grasping position by the manipulator torque information is further optimized to improve the grasping efficiency of disturbed objects.

摘要

机器人抓取已成为一个非常热门的研究领域,以至于对机器人操作的要求越来越高。在以往的研究中,使用传统目标检测算法进行抓取往往效率很低,本文致力于改进深度强化学习算法,以提高抓取效率,并解决机器人应对未知干扰对抓取影响的问题。利用深度强化学习主动探索未知环境的特性,提出了一种基于重要性加权自编码器(IWAE)的高斯参数深度确定性策略梯度(Gaussian-DDPG)算法,以实现机器人对抓取任务的自主学习。传统的坐标定位方法和深度学习方法在受干扰情况(如目标物体移动)下的抓取效果较差。IWAE算法用于将原始视觉输入的高维信息压缩到隐藏空间,并作为状态值的一部分传递给深度强化学习网络。基于经典的DDPG算法,它平滑地添加高斯参数以提高算法的探索性,动态设置机器人抓取空间参数以适应多尺度工作空间,最终实现机器人的精确抓取。依靠视觉信息可能存在的位置信息偏差,进一步优化利用机械手扭矩信息对抓取位置的控制,以提高对受干扰物体的抓取效率。

相似文献

1
Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning.基于深度强化学习的改进深度确定性策略梯度算法的机器人抓取方法优化
Rev Sci Instrum. 2021 Feb 1;92(2):025114. doi: 10.1063/5.0034101.
2
SLP-Improved DDPG Path-Planning Algorithm for Mobile Robot in Large-Scale Dynamic Environment.基于 SLP 的改进 DDPG 路径规划算法在大规模动态环境下的移动机器人应用
Sensors (Basel). 2023 Mar 28;23(7):3521. doi: 10.3390/s23073521.
3
The Path Planning of Mobile Robot by Neural Networks and Hierarchical Reinforcement Learning.基于神经网络和分层强化学习的移动机器人路径规划
Front Neurorobot. 2020 Oct 2;14:63. doi: 10.3389/fnbot.2020.00063. eCollection 2020.
4
Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient.基于深度确定性策略梯度的移动机器人高效路径规划。
Sensors (Basel). 2022 May 8;22(9):3579. doi: 10.3390/s22093579.
5
A parallel heterogeneous policy deep reinforcement learning algorithm for bipedal walking motion design.一种用于双足步行运动设计的并行异构策略深度强化学习算法。
Front Neurorobot. 2023 Aug 8;17:1205775. doi: 10.3389/fnbot.2023.1205775. eCollection 2023.
6
A method for robotic grasping based on improved Gaussian mixture model.基于改进高斯混合模型的机器人抓取方法。
Math Biosci Eng. 2019 Dec 2;17(2):1495-1510. doi: 10.3934/mbe.2020077.
7
Model predictive control for constrained robot manipulator visual servoing tuned by reinforcement learning.基于强化学习整定的约束机器人视觉伺服模型预测控制。
Math Biosci Eng. 2023 Apr 10;20(6):10495-10513. doi: 10.3934/mbe.2023463.
8
Multi-robot task allocation in e-commerce RMFS based on deep reinforcement learning.基于深度强化学习的电子商务 RMFS 中的多机器人任务分配。
Math Biosci Eng. 2023 Jan;20(2):1903-1918. doi: 10.3934/mbe.2023087. Epub 2022 Nov 8.
9
An enhanced deep deterministic policy gradient algorithm for intelligent control of robotic arms.一种用于机器人手臂智能控制的增强型深度确定性策略梯度算法。
Front Neuroinform. 2023 Jan 23;17:1096053. doi: 10.3389/fninf.2023.1096053. eCollection 2023.
10
A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control.基于深度强化学习和双同步控制的面向多任务的机械臂运动规划方案。
Sensors (Basel). 2020 Jun 21;20(12):3515. doi: 10.3390/s20123515.

引用本文的文献

1
Deep deterministic policy gradient algorithm: A systematic review.深度确定性策略梯度算法:一项系统综述。
Heliyon. 2024 May 7;10(9):e30697. doi: 10.1016/j.heliyon.2024.e30697. eCollection 2024 May 15.