• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用于生物启发机器人强化学习的仿真到真实迁移技术综述。

A Survey of Sim-to-Real Transfer Techniques Applied to Reinforcement Learning for Bioinspired Robots.

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3444-3459. doi: 10.1109/TNNLS.2021.3112718. Epub 2023 Jul 6.

DOI:10.1109/TNNLS.2021.3112718
PMID:34587101
Abstract

The state-of-the-art reinforcement learning (RL) techniques have made innumerable advancements in robot control, especially in combination with deep neural networks (DNNs), known as deep reinforcement learning (DRL). In this article, instead of reviewing the theoretical studies on RL, which were almost fully completed several decades ago, we summarize some state-of-the-art techniques added to commonly used RL frameworks for robot control. We mainly review bioinspired robots (BIRs) because they can learn to locomote or produce natural behaviors similar to animals and humans. With the ultimate goal of practical applications in real world, we further narrow our review scope to techniques that could aid in sim-to-real transfer. We categorized these techniques into four groups: 1) use of accurate simulators; 2) use of kinematic and dynamic models; 3) use of hierarchical and distributed controllers; and 4) use of demonstrations. The purposes of these four groups of techniques are to supply general and accurate environments for RL training, improve sampling efficiency, divide and conquer complex motion tasks and redundant robot structures, and acquire natural skills. We found that, by synthetically using these techniques, it is possible to deploy RL on physical BIRs in actuality.

摘要

最先进的强化学习 (RL) 技术在机器人控制方面取得了无数的进展,特别是与深度神经网络 (DNN) 结合使用时,称为深度强化学习 (DRL)。在本文中,我们没有回顾几十年前几乎已经完成的 RL 的理论研究,而是总结了一些添加到常用的机器人控制 RL 框架中的最先进技术。我们主要回顾生物启发机器人 (BIR),因为它们可以学习类似于动物和人类的运动或产生自然行为。考虑到实际应用的最终目标,我们进一步缩小了评论范围,重点介绍了有助于模拟到现实转移的技术。我们将这些技术分为四组:1)使用精确的模拟器;2)使用运动学和动力学模型;3)使用分层和分布式控制器;4)使用演示。这四组技术的目的是为 RL 培训提供通用和准确的环境,提高采样效率,分解和征服复杂的运动任务和冗余的机器人结构,并获得自然技能。我们发现,通过综合使用这些技术,有可能在实际的物理 BIR 上部署 RL。

相似文献

1
A Survey of Sim-to-Real Transfer Techniques Applied to Reinforcement Learning for Bioinspired Robots.应用于生物启发机器人强化学习的仿真到真实迁移技术综述。
IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3444-3459. doi: 10.1109/TNNLS.2021.3112718. Epub 2023 Jul 6.
2
RL-DOVS: Reinforcement Learning for Autonomous Robot Navigation in Dynamic Environments.RL-DOVS:动态环境下自主机器人导航的强化学习。
Sensors (Basel). 2022 May 19;22(10):3847. doi: 10.3390/s22103847.
3
Variational Information Bottleneck Regularized Deep Reinforcement Learning for Efficient Robotic Skill Adaptation.变分信息瓶颈正则化深度强化学习在机器人高效技能自适应中的应用。
Sensors (Basel). 2023 Jan 9;23(2):762. doi: 10.3390/s23020762.
4
Deep Q-network for social robotics using emotional social signals.利用情感社交信号的社交机器人深度Q网络。
Front Robot AI. 2022 Sep 26;9:880547. doi: 10.3389/frobt.2022.880547. eCollection 2022.
5
Learning-based control approaches for service robots on cloth manipulation and dressing assistance: a comprehensive review.基于学习的服务机器人布料操作和穿衣辅助控制方法:全面综述。
J Neuroeng Rehabil. 2022 Nov 3;19(1):117. doi: 10.1186/s12984-022-01078-4.
6
Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning.基于强化学习和逆强化学习的蛇形机器人节能与损伤恢复蠕动步态设计。
Neural Netw. 2020 Sep;129:323-333. doi: 10.1016/j.neunet.2020.05.029. Epub 2020 Jun 16.
7
Human-Guided Reinforcement Learning With Sim-to-Real Transfer for Autonomous Navigation.用于自主导航的基于人引导强化学习的模拟到现实迁移
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14745-14759. doi: 10.1109/TPAMI.2023.3314762. Epub 2023 Nov 3.
8
Emergence of integrated behaviors through direct optimization for homeostasis.通过直接优化实现体内平衡来产生综合行为。
Neural Netw. 2024 Sep;177:106379. doi: 10.1016/j.neunet.2024.106379. Epub 2024 May 8.
9
Control of Magnetic Surgical Robots With Model-Based Simulators and Reinforcement Learning.基于模型的模拟器和强化学习对磁性手术机器人的控制
IEEE Trans Med Robot Bionics. 2022 Nov;4(4):945-956. doi: 10.1109/tmrb.2022.3214426. Epub 2022 Oct 12.
10
Mobile Robot Application with Hierarchical Start Position DQN.分层起始位置 DQN 的移动机器人应用。
Comput Intell Neurosci. 2022 Sep 5;2022:4115767. doi: 10.1155/2022/4115767. eCollection 2022.

引用本文的文献

1
Bridging the Gap to Bionic Motion: Challenges in Legged Robot Limb Unit Design, Modeling, and Control.弥合与仿生运动的差距:有腿机器人肢体单元设计、建模与控制中的挑战
Cyborg Bionic Syst. 2025 Aug 19;6:0365. doi: 10.34133/cbsystems.0365. eCollection 2025.
2
Motor synergy and energy efficiency emerge in whole-body locomotion learning.运动协同和能量效率在全身运动学习中显现出来。
Sci Rep. 2025 Jan 3;15(1):712. doi: 10.1038/s41598-024-82472-x.
3
Swimtrans Net: a multimodal robotic system for swimming action recognition driven via Swin-Transformer.
Swimtrans网络:一种通过Swin Transformer驱动的用于游泳动作识别的多模态机器人系统。
Front Neurorobot. 2024 Sep 24;18:1452019. doi: 10.3389/fnbot.2024.1452019. eCollection 2024.
4
Stable Jumping Control Based on Deep Reinforcement Learning for a Locust-Inspired Robot.基于深度强化学习的仿蝗虫机器人稳定跳跃控制
Biomimetics (Basel). 2024 Sep 11;9(9):548. doi: 10.3390/biomimetics9090548.
5
Dexterous Manipulation for Multi-Fingered Robotic Hands With Reinforcement Learning: A Review.基于强化学习的多指机器人手灵巧操作综述
Front Neurorobot. 2022 Apr 25;16:861825. doi: 10.3389/fnbot.2022.861825. eCollection 2022.