• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过强化学习利用可重构机器人肢体实现运动与操作的衔接

Bridging Locomotion and Manipulation Using Reconfigurable Robotic Limbs via Reinforcement Learning.

作者信息

Sun Haoran, Yang Linhan, Gu Yuping, Pan Jia, Wan Fang, Song Chaoyang

机构信息

Department of Mechanical and Energy Engineering, Southern University of Science and Technology, Shenzhen 518055, China.

Department of Computer Science, The University of Hong Kong, Hong Kong SAR, China.

出版信息

Biomimetics (Basel). 2023 Aug 14;8(4):364. doi: 10.3390/biomimetics8040364.

DOI:10.3390/biomimetics8040364
PMID:37622969
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10452096/
Abstract

Locomotion and manipulation are two essential skills in robotics but are often divided or decoupled into two separate problems. It is widely accepted that the topological duality between multi-legged locomotion and multi-fingered manipulation shares an intrinsic model. However, a lack of research remains to identify the data-driven evidence for further research. This paper explores a unified formulation of the loco-manipulation problem using reinforcement learning (RL) by reconfiguring robotic limbs with an overconstrained design into multi-legged and multi-fingered robots. Such design reconfiguration allows for adopting a co-training architecture for reinforcement learning towards a unified loco-manipulation policy. As a result, we find data-driven evidence to support the transferability between locomotion and manipulation skills using a single RL policy with a multilayer perceptron or graph neural network. We also demonstrate the Sim2Real transfer of the learned loco-manipulation skills in a robotic prototype. This work expands the knowledge frontiers on loco-manipulation transferability with learning-based evidence applied in a novel platform with overconstrained robotic limbs.

摘要

运动和操作是机器人技术中的两项基本技能,但它们常常被分开或解耦为两个独立的问题。人们普遍认为,多足运动和多指操作之间的拓扑对偶性共享一个内在模型。然而,仍缺乏研究来确定进一步研究的数据驱动证据。本文通过将具有过约束设计的机器人肢体重新配置为多足和多指机器人,探索了一种使用强化学习(RL)的运动-操作问题的统一表述。这种设计重新配置允许采用一种协同训练架构进行强化学习,以实现统一的运动-操作策略。结果,我们找到了数据驱动的证据,支持使用具有多层感知器或图神经网络的单一强化学习策略在运动和操作技能之间进行可迁移性。我们还在一个机器人原型中展示了所学运动-操作技能从模拟到现实的迁移。这项工作利用应用于具有过约束机器人肢体的新型平台上的基于学习的证据,扩展了运动-操作可迁移性的知识前沿。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/565d1ded9d66/biomimetics-08-00364-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/d5dffaee28be/biomimetics-08-00364-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/3cd95922ef61/biomimetics-08-00364-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/6b5344aecbf2/biomimetics-08-00364-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/a04f2fcecf5e/biomimetics-08-00364-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/565d1ded9d66/biomimetics-08-00364-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/d5dffaee28be/biomimetics-08-00364-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/3cd95922ef61/biomimetics-08-00364-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/6b5344aecbf2/biomimetics-08-00364-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/a04f2fcecf5e/biomimetics-08-00364-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c318/10452096/565d1ded9d66/biomimetics-08-00364-g005.jpg

相似文献

1
Bridging Locomotion and Manipulation Using Reconfigurable Robotic Limbs via Reinforcement Learning.通过强化学习利用可重构机器人肢体实现运动与操作的衔接
Biomimetics (Basel). 2023 Aug 14;8(4):364. doi: 10.3390/biomimetics8040364.
2
Dexterous Manipulation for Multi-Fingered Robotic Hands With Reinforcement Learning: A Review.基于强化学习的多指机器人手灵巧操作综述
Front Neurorobot. 2022 Apr 25;16:861825. doi: 10.3389/fnbot.2022.861825. eCollection 2022.
3
Multi-expert synthesis for versatile locomotion and manipulation skills.用于通用运动和操作技能的多专家合成
Front Robot AI. 2022 Sep 28;9:970890. doi: 10.3389/frobt.2022.970890. eCollection 2022.
4
Variational Information Bottleneck Regularized Deep Reinforcement Learning for Efficient Robotic Skill Adaptation.变分信息瓶颈正则化深度强化学习在机器人高效技能自适应中的应用。
Sensors (Basel). 2023 Jan 9;23(2):762. doi: 10.3390/s23020762.
5
Modelling and Control of a Reconfigurable Robot for Achieving Reconfiguration and Locomotion with Different Shapes.具有不同形状的可重构机器人的建模与控制,以实现重构和运动。
Sensors (Basel). 2021 Aug 9;21(16):5362. doi: 10.3390/s21165362.
6
Scaling Up Soft Robotics: A Meter-Scale, Modular, and Reconfigurable Soft Robotic System.扩大软机器人技术规模:一种米级、模块化且可重构的软机器人系统。
Soft Robot. 2022 Apr;9(2):324-336. doi: 10.1089/soro.2020.0123. Epub 2021 Mar 25.
7
Vision-Based Efficient Robotic Manipulation with a Dual-Streaming Compact Convolutional Transformer.基于双通道紧凑型卷积 Transformer 的基于视觉的高效机器人操作
Sensors (Basel). 2023 Jan 3;23(1):515. doi: 10.3390/s23010515.
8
A Survey on Deep Reinforcement Learning Algorithms for Robotic Manipulation.机器人操作的深度强化学习算法研究综述。
Sensors (Basel). 2023 Apr 5;23(7):3762. doi: 10.3390/s23073762.
9
Review of Learning-Based Robotic Manipulation in Cluttered Environments.基于学习的杂乱环境机器人操作综述。
Sensors (Basel). 2022 Oct 18;22(20):7938. doi: 10.3390/s22207938.
10
Coverage Path Planning Using Reinforcement Learning-Based TSP for hTetran-A Polyabolo-Inspired Self-Reconfigurable Tiling Robot.基于强化学习的旅行商问题算法在hTetran - A多角形启发的自重构平铺机器人覆盖路径规划中的应用
Sensors (Basel). 2021 Apr 7;21(8):2577. doi: 10.3390/s21082577.

引用本文的文献

1
Leveraging imitation learning in agricultural robotics: a comprehensive survey and comparative analysis.农业机器人中模仿学习的应用:全面综述与比较分析
Front Robot AI. 2024 Oct 17;11:1441312. doi: 10.3389/frobt.2024.1441312. eCollection 2024.
2
Enhancing the Efficiency of a Cybersecurity Operations Center Using Biomimetic Algorithms Empowered by Deep Q-Learning.利用深度Q学习赋能的仿生算法提高网络安全运营中心的效率
Biomimetics (Basel). 2024 May 21;9(6):307. doi: 10.3390/biomimetics9060307.

本文引用的文献

1
Learning agile and dynamic motor skills for legged robots.学习用于腿部机器人的敏捷和动态运动技能。
Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aau5872.
2
Learning quadrupedal locomotion over challenging terrain.学习在具有挑战性的地形上进行四足运动。
Sci Robot. 2020 Oct 21;5(47). doi: 10.1126/scirobotics.abc5986.