• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习用于腿部机器人的敏捷和动态运动技能。

Learning agile and dynamic motor skills for legged robots.

机构信息

Robotic Systems Lab, ETH Zurich, Zurich, Switzerland.

Intelligent Systems Lab, Intel, Munich, Germany.

出版信息

Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aau5872.

DOI:10.1126/scirobotics.aau5872
PMID:33137755
Abstract

Legged robots pose one of the greatest challenges in robotics. Dynamic and agile maneuvers of animals cannot be imitated by existing methods that are crafted by humans. A compelling alternative is reinforcement learning, which requires minimal craftsmanship and promotes the natural evolution of a control policy. However, so far, reinforcement learning research for legged robots is mainly limited to simulation, and only few and comparably simple examples have been deployed on real systems. The primary reason is that training with real robots, particularly with dynamically balancing systems, is complicated and expensive. In the present work, we introduce a method for training a neural network policy in simulation and transferring it to a state-of-the-art legged system, thereby leveraging fast, automated, and cost-effective data generation schemes. The approach is applied to the ANYmal robot, a sophisticated medium-dog-sized quadrupedal system. Using policies trained in simulation, the quadrupedal machine achieves locomotion skills that go beyond what had been achieved with prior methods: ANYmal is capable of precisely and energy-efficiently following high-level body velocity commands, running faster than before, and recovering from falling even in complex configurations.

摘要

腿式机器人是机器人领域面临的最大挑战之一。现有的机器人技术无法模仿动物的动态和敏捷动作,这些动作是由人类精心设计的。一种引人注目的替代方法是强化学习,它需要最少的人工干预,并促进控制策略的自然进化。然而,到目前为止,腿式机器人的强化学习研究主要局限于模拟,只有少数相对简单的例子被部署在真实系统上。主要原因是使用真实机器人进行训练,特别是使用动态平衡系统进行训练,既复杂又昂贵。在本工作中,我们提出了一种在模拟中训练神经网络策略并将其转移到最先进的腿式系统的方法,从而利用快速、自动化和具有成本效益的数据生成方案。该方法应用于 ANYmal 机器人,这是一种复杂的中犬型四足系统。使用在模拟中训练的策略,四足机器人实现了超越以前方法的运动技能:ANYmal 能够精确、节能地跟踪高级身体速度指令,比以前跑得更快,即使在复杂的配置中也能从跌倒中恢复。

相似文献

1
Learning agile and dynamic motor skills for legged robots.学习用于腿部机器人的敏捷和动态运动技能。
Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aau5872.
2
Dynamic Fall Recovery Control for Legged Robots via Reinforcement Learning.基于强化学习的有腿机器人动态跌倒恢复控制
Biomimetics (Basel). 2024 Mar 22;9(4):193. doi: 10.3390/biomimetics9040193.
3
ANYmal parkour: Learning agile navigation for quadrupedal robots.ANYmal 跑酷:学习四足机器人的敏捷导航。
Sci Robot. 2024 Mar 13;9(88):eadi7566. doi: 10.1126/scirobotics.adi7566.
4
Multi-expert synthesis for versatile locomotion and manipulation skills.用于通用运动和操作技能的多专家合成
Front Robot AI. 2022 Sep 28;9:970890. doi: 10.3389/frobt.2022.970890. eCollection 2022.
5
Generic Neural Locomotion Control Framework for Legged Robots.腿足机器人通用神经运动控制框架。
IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):4013-4025. doi: 10.1109/TNNLS.2020.3016523. Epub 2021 Aug 31.
6
Learning quadrupedal locomotion over challenging terrain.学习在具有挑战性的地形上进行四足运动。
Sci Robot. 2020 Oct 21;5(47). doi: 10.1126/scirobotics.abc5986.
7
Hybrid learning mechanisms under a neural control network for various walking speed generation of a quadruped robot.用于四足机器人各种行走速度生成的神经控制网络下的混合学习机制。
Neural Netw. 2023 Oct;167:292-308. doi: 10.1016/j.neunet.2023.08.030. Epub 2023 Aug 26.
8
Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics.基于强化学习和启发式算法的混合双足运动
Micromachines (Basel). 2022 Oct 7;13(10):1688. doi: 10.3390/mi13101688.
9
On the biomimetic design of agile-robot legs.仿生设计的敏捷机器人腿。
Sensors (Basel). 2011;11(12):11305-34. doi: 10.3390/s111211305. Epub 2011 Nov 28.
10
Convergent evolution and locomotion through complex terrain by insects, vertebrates and robots.昆虫、脊椎动物和机器人在复杂地形中的趋同进化与运动
Arthropod Struct Dev. 2004 Jul;33(3):361-79. doi: 10.1016/j.asd.2004.05.001.

引用本文的文献

1
Bridging the Gap to Bionic Motion: Challenges in Legged Robot Limb Unit Design, Modeling, and Control.弥合与仿生运动的差距:有腿机器人肢体单元设计、建模与控制中的挑战
Cyborg Bionic Syst. 2025 Aug 19;6:0365. doi: 10.34133/cbsystems.0365. eCollection 2025.
2
Joints with angle dependent damping can help to reduce impact forces in robots.具有角度依赖性阻尼的关节有助于减少机器人中的冲击力。
Sci Rep. 2025 Aug 5;15(1):28578. doi: 10.1038/s41598-025-13055-7.
3
Robust Visuomotor Control for Humanoid Loco-Manipulation Using Hybrid Reinforcement Learning.
基于混合强化学习的仿人机器人运动操纵的稳健视觉运动控制
Biomimetics (Basel). 2025 Jul 17;10(7):469. doi: 10.3390/biomimetics10070469.
4
An Efficient Closed-Loop Adaptive Controller for a Small-Sized Quadruped Robotic Rat.一种用于小型四足机器人鼠的高效闭环自适应控制器。
Cyborg Bionic Syst. 2024 May 27;5:0096. doi: 10.34133/cbsystems.0096. eCollection 2024.
5
Hierarchical reinforcement learning with central pattern generator for enabling a quadruped robot simulator to walk on a variety of terrains.用于使四足机器人模拟器能够在各种地形上行走的基于中枢模式发生器的分层强化学习。
Sci Rep. 2025 Apr 2;15(1):11262. doi: 10.1038/s41598-025-94163-2.
6
Motion Hologram: Jointly optimized hologram generation and motion planning for photorealistic 3D displays via reinforcement learning.动态全息图:通过强化学习实现用于逼真3D显示的联合优化全息图生成与运动规划。
Sci Adv. 2025 Jan 31;11(5):eads9876. doi: 10.1126/sciadv.ads9876. Epub 2025 Jan 29.
7
Foot trajectory as a key factor for diverse gait patterns in quadruped robot locomotion.足部轨迹作为四足机器人运动中多种步态模式的关键因素。
Sci Rep. 2025 Jan 13;15(1):1861. doi: 10.1038/s41598-024-84060-5.
8
Standing balance of single-legged hopping robot model using reinforcement learning approach in the presence of external disturbances.基于强化学习方法的单腿跳跃机器人模型在存在外部干扰情况下的站立平衡。
Sci Rep. 2024 Dec 30;14(1):32036. doi: 10.1038/s41598-024-83749-x.
9
Exploring spiking neural networks for deep reinforcement learning in robotic tasks.探索用于机器人任务中深度强化学习的脉冲神经网络。
Sci Rep. 2024 Dec 28;14(1):30648. doi: 10.1038/s41598-024-77779-8.
10
ARS: AI-Driven Recovery Controller for Quadruped Robot Using Single-Network Model.ARS:基于单网络模型的四足机器人人工智能驱动恢复控制器
Biomimetics (Basel). 2024 Dec 10;9(12):749. doi: 10.3390/biomimetics9120749.