• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于两个空间交替学习的机器人运动技能迁移

Robot Motor Skill Transfer With Alternate Learning in Two Spaces.

作者信息

Fu Jian, Teng Xiang, Cao Ce, Ju Zhaojie, Lou Ping

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4553-4564. doi: 10.1109/TNNLS.2020.3021530. Epub 2021 Oct 5.

DOI:10.1109/TNNLS.2020.3021530
PMID:32970599
Abstract

Recent research achievements in learning from demonstration (LfD) illustrate that the reinforcement learning is effective for the robots to improve their movement skills. The current challenge mainly remains in how to generate new robot motions automatically to perform new tasks, which have a similar preassigned performance indicator but are different from the demonstration tasks. To deal with the abovementioned issue, this article proposes a framework to represent the policy and conduct imitation learning and optimization for robot intelligent trajectory planning, based on the improved locally weighted regression (iLWR) and policy improvement with path integral by dual perturbation (PI-DP). Besides, the reward-guided weight searching and basis function's adaptive evolving are performed alternately in two spaces, i.e., the basis function space and the weight space, to deal with the abovementioned problem. The alternate learning process constructs a sequence of two-tuples that join the demonstration task and new one together for motor skill transfer, so that the robot gradually acquires motor skill, from the task similar to demonstration to dissimilar tasks with different performance metrics. Classical via-points trajectory planning experiments are performed with the SCARA manipulator, a 10-degree of freedom (DOF) planar, and the UR robot. These results show that the proposed method is not only feasible but also effective.

摘要

近期从示范中学习(LfD)的研究成果表明,强化学习对于机器人提高其运动技能是有效的。当前的挑战主要仍在于如何自动生成新的机器人运动以执行新任务,这些新任务具有类似的预先指定的性能指标,但与示范任务不同。为解决上述问题,本文提出了一个框架,用于基于改进的局部加权回归(iLWR)和通过双重扰动的路径积分进行策略改进(PI-DP)来表示策略并对机器人智能轨迹规划进行模仿学习和优化。此外,奖励引导的权重搜索和基函数的自适应演化在两个空间中交替进行,即基函数空间和权重空间,以处理上述问题。交替学习过程构建了一系列二元组,将示范任务和新任务连接在一起以进行运动技能转移,从而使机器人逐渐获得运动技能,从类似于示范的任务到具有不同性能指标的不相似任务。使用SCARA机械手、10自由度(DOF)平面机器人和UR机器人进行了经典的通过点轨迹规划实验。这些结果表明所提出的方法不仅可行而且有效。

相似文献

1
Robot Motor Skill Transfer With Alternate Learning in Two Spaces.基于两个空间交替学习的机器人运动技能迁移
IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4553-4564. doi: 10.1109/TNNLS.2020.3021530. Epub 2021 Oct 5.
2
Guided Stochastic Optimization for Motion Planning.用于运动规划的引导式随机优化
Front Robot AI. 2019 Nov 12;6:105. doi: 10.3389/frobt.2019.00105. eCollection 2019.
3
Human-robot skills transfer interfaces for a flexible surgical robot.用于灵活手术机器人的人机技能转移接口。
Comput Methods Programs Biomed. 2014 Sep;116(2):81-96. doi: 10.1016/j.cmpb.2013.12.015. Epub 2014 Jan 8.
4
A Framework for Composite Layup Skill Learning and Generalizing Through Teleoperation.一种通过遥操作进行复合材料铺层技能学习与泛化的框架。
Front Neurorobot. 2022 Feb 11;16:840240. doi: 10.3389/fnbot.2022.840240. eCollection 2022.
5
Generalize Robot Learning From Demonstration to Variant Scenarios With Evolutionary Policy Gradient.通过进化策略梯度将机器人从示范学习推广到不同场景。
Front Neurorobot. 2020 Apr 21;14:21. doi: 10.3389/fnbot.2020.00021. eCollection 2020.
6
Human skill knowledge guided global trajectory policy reinforcement learning method.人类技能知识引导的全局轨迹策略强化学习方法。
Front Neurorobot. 2024 Mar 15;18:1368243. doi: 10.3389/fnbot.2024.1368243. eCollection 2024.
7
Vision-Based Intelligent Perceiving and Planning System of a 7-DoF Collaborative Robot.基于视觉的 7 自由度协作机器人智能感知与规划系统。
Comput Intell Neurosci. 2021 Sep 14;2021:5810371. doi: 10.1155/2021/5810371. eCollection 2021.
8
Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning.机器人学习:深度强化学习、模仿学习、迁移学习。
Sensors (Basel). 2021 Feb 11;21(4):1278. doi: 10.3390/s21041278.
9
Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation.基于任务几何表示的ProMPs的插销入孔装配技能模仿学习方法
Front Neurorobot. 2023 Nov 9;17:1320251. doi: 10.3389/fnbot.2023.1320251. eCollection 2023.
10
Human-robot skill transmission for mobile robot via learning by demonstration.通过示范学习实现移动机器人的人机技能传递。
Neural Comput Appl. 2021 Sep 22:1-11. doi: 10.1007/s00521-021-06449-x.