• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基于图的事后诸葛亮目标生成实现复杂机器人操作

Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation.

作者信息

Bing Zhenshan, Brucker Matthias, Morin Fabrice O, Li Rui, Su Xiaojie, Huang Kai, Knoll Alois

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7863-7876. doi: 10.1109/TNNLS.2021.3088947. Epub 2022 Nov 30.

DOI:10.1109/TNNLS.2021.3088947
PMID:34181552
Abstract

Reinforcement learning algorithms, such as hindsight experience replay (HER) and hindsight goal generation (HGG), have been able to solve challenging robotic manipulation tasks in multigoal settings with sparse rewards. HER achieves its training success through hindsight replays of past experience with heuristic goals but underperforms in challenging tasks in which goals are difficult to explore. HGG enhances HER by selecting intermediate goals that are easy to achieve in the short term and promising to lead to target goals in the long term. This guided exploration makes HGG applicable to tasks in which target goals are far away from the object's initial position. However, the vanilla HGG is not applicable to manipulation tasks with obstacles because the Euclidean metric used for HGG is not an accurate distance metric in such an environment. Although, with the guidance of a handcrafted distance grid, grid-based HGG can solve manipulation tasks with obstacles, a more feasible method that can solve such tasks automatically is still in demand. In this article, we propose graph-based hindsight goal generation (G-HGG), an extension of HGG selecting hindsight goals based on shortest distances in an obstacle-avoiding graph, which is a discrete representation of the environment. We evaluated G-HGG on four challenging manipulation tasks with obstacles, where significant enhancements in both sample efficiency and overall success rate are shown over HGG and HER. Videos can be viewed at https://videoviewsite.wixsite.com/ghgg.

摘要

强化学习算法,如indsight经验回放(HER)和indsight目标生成(HGG),已经能够在稀疏奖励的多目标设置中解决具有挑战性的机器人操作任务。HER通过对具有启发式目标的过去经验进行indsight回放来实现训练成功,但在目标难以探索的具有挑战性的任务中表现不佳。HGG通过选择在短期内易于实现且有望在长期内导向目标目标的中间目标来增强HER。这种有指导的探索使得HGG适用于目标目标远离物体初始位置的任务。然而,普通的HGG不适用于有障碍物的操作任务,因为用于HGG的欧几里得度量在这样的环境中不是准确的距离度量。虽然在手工制作的距离网格的指导下,基于网格的HGG可以解决有障碍物的操作任务,但仍然需要一种更可行的能够自动解决此类任务的方法。在本文中,我们提出了基于图的indsight目标生成(G-HGG),它是HGG的扩展,基于避障图中的最短距离选择indsight目标,避障图是环境的离散表示。我们在四个具有挑战性的有障碍物的操作任务上评估了G-HGG,结果表明它在样本效率和总体成功率方面都比HGG和HER有显著提高。视频可在https://videoviewsite.wixsite.com/ghgg观看。

相似文献

1
Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation.通过基于图的事后诸葛亮目标生成实现复杂机器人操作
IEEE Trans Neural Netw Learn Syst. 2022 Dec;33(12):7863-7876. doi: 10.1109/TNNLS.2021.3088947. Epub 2022 Nov 30.
2
Robotic Manipulation in Dynamic Scenarios via Bounding-Box-Based Hindsight Goal Generation.通过基于边界框的事后目标生成实现动态场景中的机器人操作。
IEEE Trans Neural Netw Learn Syst. 2023 Aug;34(8):5037-5050. doi: 10.1109/TNNLS.2021.3124366. Epub 2023 Aug 4.
3
Addressing Hindsight Bias in Multigoal Reinforcement Learning.解决多目标强化学习中的事后诸葛亮偏差
IEEE Trans Cybern. 2023 Jan;53(1):392-405. doi: 10.1109/TCYB.2021.3107202. Epub 2022 Dec 23.
4
Curriculum learning with Hindsight Experience Replay for sequential object manipulation tasks.基于后见之明经验回放的序贯物体操作任务课程学习。
Neural Netw. 2022 Jan;145:260-270. doi: 10.1016/j.neunet.2021.10.011. Epub 2021 Oct 21.
5
Sampling Rate Decay in Hindsight Experience Replay for Robot Control.事后经验回放中机器人控制的采样率衰减。
IEEE Trans Cybern. 2022 Mar;52(3):1515-1526. doi: 10.1109/TCYB.2020.2990722. Epub 2022 Mar 11.
6
Progressively Learning to Reach Remote Goals by Continuously Updating Boundary Goals.通过不断更新边界目标逐步学习以实现远程目标。
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):9238-9252. doi: 10.1109/TNNLS.2024.3428323. Epub 2025 May 2.
7
AHEGC: Adaptive Hindsight Experience Replay With Goal-Amended Curiosity Module for Robot Control.AHEGC:用于机器人控制的带目标修正好奇心模块的自适应后见经验回放
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16602-16615. doi: 10.1109/TNNLS.2023.3296765. Epub 2024 Oct 29.
8
Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor-Critic with Hindsight Experience Replay.使用深度强化学习的多臂机械臂路径规划:带有后见之明经验回放的软动作-评论家。
Sensors (Basel). 2020 Oct 19;20(20):5911. doi: 10.3390/s20205911.
9
Learning robotic manipulation skills with multiple semantic goals by conservative curiosity-motivated exploration.通过保守的好奇心驱动探索学习具有多个语义目标的机器人操作技能。
Front Neurorobot. 2023 Mar 7;17:1089270. doi: 10.3389/fnbot.2023.1089270. eCollection 2023.
10
Representation Learning and Reinforcement Learning for Dynamic Complex Motion Planning System.用于动态复杂运动规划系统的表征学习与强化学习
IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):11049-11063. doi: 10.1109/TNNLS.2023.3247160. Epub 2024 Aug 5.

引用本文的文献

1
A Gaussian convolutional optimization algorithm with tent chaotic mapping.一种带有帐篷混沌映射的高斯卷积优化算法。
Sci Rep. 2024 Dec 28;14(1):31027. doi: 10.1038/s41598-024-82277-y.
2
A reinforcement learning enhanced pseudo-inverse approach to self-collision avoidance of redundant robots.一种用于冗余机器人自碰撞避免的强化学习增强伪逆方法。
Front Neurorobot. 2024 Mar 28;18:1375309. doi: 10.3389/fnbot.2024.1375309. eCollection 2024.
3
Human skill knowledge guided global trajectory policy reinforcement learning method.人类技能知识引导的全局轨迹策略强化学习方法。
Front Neurorobot. 2024 Mar 15;18:1368243. doi: 10.3389/fnbot.2024.1368243. eCollection 2024.
4
Motion planning framework based on dual-agent DDPG method for dual-arm robots guided by human joint angle constraints.基于双智能体深度确定性策略梯度(DDPG)方法的双臂机器人运动规划框架,由人体关节角度约束引导。
Front Neurorobot. 2024 Feb 22;18:1362359. doi: 10.3389/fnbot.2024.1362359. eCollection 2024.
5
Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation.基于任务几何表示的ProMPs的插销入孔装配技能模仿学习方法
Front Neurorobot. 2023 Nov 9;17:1320251. doi: 10.3389/fnbot.2023.1320251. eCollection 2023.
6
A perspective on large-scale simulation as an enabler for novel biorobotics applications.关于大规模模拟作为新型生物机器人应用推动者的观点。
Front Robot AI. 2023 Aug 25;10:1102286. doi: 10.3389/frobt.2023.1102286. eCollection 2023.
7
American Sign Language Translation Using Wearable Inertial and Electromyography Sensors for Tracking Hand Movements and Facial Expressions.使用可穿戴惯性和肌电图传感器进行美国手语翻译,以跟踪手部动作和面部表情。
Front Neurosci. 2022 Jul 19;16:962141. doi: 10.3389/fnins.2022.962141. eCollection 2022.