• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度强化学习控制Solo12四足机器人。

Controlling the Solo12 quadruped robot with deep reinforcement learning.

作者信息

Aractingi Michel, Léziart Pierre-Alexandre, Flayols Thomas, Perez Julien, Silander Tomi, Souères Philippe

机构信息

LAAS-CNRS, Université de Toulouse, 31400, Toulouse, France.

NAVER LABS Europe, 38240, Meylan, France.

出版信息

Sci Rep. 2023 Jul 24;13(1):11945. doi: 10.1038/s41598-023-38259-7.

DOI:10.1038/s41598-023-38259-7
PMID:37488193
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10366154/
Abstract

Quadruped robots require robust and general locomotion skills to exploit their mobility potential in complex and challenging environments. In this work, we present an implementation of a robust end-to-end learning-based controller on the Solo12 quadruped. Our method is based on deep reinforcement learning of joint impedance references. The resulting control policies follow a commanded velocity reference while being efficient in its energy consumption and easy to deploy. We detail the learning procedure and method for transfer on the real robot. We show elaborate experiments. Finally, we present experimental results of the learned locomotion on various grounds indoors and outdoors. These results show that the Solo12 robot is a suitable open-source platform for research combining learning and control because of the easiness in transferring and deploying learned controllers.

摘要

四足机器人需要强大且通用的运动技能,以便在复杂且具有挑战性的环境中充分发挥其移动潜力。在这项工作中,我们展示了在Solo12四足机器人上基于端到端学习的稳健控制器的实现。我们的方法基于关节阻抗参考的深度强化学习。由此产生的控制策略遵循指令速度参考,同时能耗高效且易于部署。我们详细介绍了在真实机器人上的学习过程和迁移方法。我们展示了详尽的实验。最后,我们展示了在室内和室外各种地面上学习到的运动的实验结果。这些结果表明,由于在迁移和部署学习到的控制器方面很容易,Solo12机器人是一个适合结合学习与控制进行研究的开源平台。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/fb351cdd6fa4/41598_2023_38259_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/597e463c6d9e/41598_2023_38259_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/2178afab3dcb/41598_2023_38259_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/fd44766a387b/41598_2023_38259_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/0cbef9d60cd7/41598_2023_38259_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/634fca74acb9/41598_2023_38259_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/b0586e2657f8/41598_2023_38259_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/fb351cdd6fa4/41598_2023_38259_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/597e463c6d9e/41598_2023_38259_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/2178afab3dcb/41598_2023_38259_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/fd44766a387b/41598_2023_38259_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/0cbef9d60cd7/41598_2023_38259_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/634fca74acb9/41598_2023_38259_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/b0586e2657f8/41598_2023_38259_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd03/10366154/fb351cdd6fa4/41598_2023_38259_Fig7_HTML.jpg

相似文献

1
Controlling the Solo12 quadruped robot with deep reinforcement learning.利用深度强化学习控制Solo12四足机器人。
Sci Rep. 2023 Jul 24;13(1):11945. doi: 10.1038/s41598-023-38259-7.
2
Oncilla Robot: A Versatile Open-Source Quadruped Research Robot With Compliant Pantograph Legs.虎猫机器人:一种具有柔顺缩放机构腿部的多功能开源四足研究机器人。
Front Robot AI. 2018 Jun 19;5:67. doi: 10.3389/frobt.2018.00067. eCollection 2018.
3
Dynamic Fall Recovery Control for Legged Robots via Reinforcement Learning.基于强化学习的有腿机器人动态跌倒恢复控制
Biomimetics (Basel). 2024 Mar 22;9(4):193. doi: 10.3390/biomimetics9040193.
4
An Optimization-Based Locomotion Controller for Quadruped Robots Leveraging Cartesian Impedance Control.一种基于优化的四足机器人运动控制器,利用笛卡尔阻抗控制
Front Robot AI. 2020 Apr 24;7:48. doi: 10.3389/frobt.2020.00048. eCollection 2020.
5
A Hierarchical Framework for Quadruped Robots Gait Planning Based on DDPG.一种基于深度确定性策略梯度(DDPG)的四足机器人步态规划分层框架。
Biomimetics (Basel). 2023 Aug 22;8(5):382. doi: 10.3390/biomimetics8050382.
6
Hybrid learning mechanisms under a neural control network for various walking speed generation of a quadruped robot.用于四足机器人各种行走速度生成的神经控制网络下的混合学习机制。
Neural Netw. 2023 Oct;167:292-308. doi: 10.1016/j.neunet.2023.08.030. Epub 2023 Aug 26.
7
Joint elasticity produces energy efficiency in underwater locomotion: Verification with deep reinforcement learning.关节弹性在水下运动中产生能量效率:通过深度强化学习进行验证。
Front Robot AI. 2022 Sep 8;9:957931. doi: 10.3389/frobt.2022.957931. eCollection 2022.
8
Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics.基于强化学习和启发式算法的混合双足运动
Micromachines (Basel). 2022 Oct 7;13(10):1688. doi: 10.3390/mi13101688.
9
Fall prediction, control, and recovery of quadruped robots.四足机器人的跌倒预测、控制与恢复
ISA Trans. 2024 Aug;151:86-102. doi: 10.1016/j.isatra.2024.05.039. Epub 2024 May 25.
10
A sensory-driven controller for quadruped locomotion.一种用于四足动物运动的感官驱动控制器。
Biol Cybern. 2017 Feb;111(1):49-67. doi: 10.1007/s00422-016-0708-4. Epub 2017 Jan 7.

引用本文的文献

1
Foot trajectory as a key factor for diverse gait patterns in quadruped robot locomotion.足部轨迹作为四足机器人运动中多种步态模式的关键因素。
Sci Rep. 2025 Jan 13;15(1):1861. doi: 10.1038/s41598-024-84060-5.
2
Efficient Reinforcement Learning for 3D Jumping Monopods.用于3D跳跃单足机器人的高效强化学习
Sensors (Basel). 2024 Aug 1;24(15):4981. doi: 10.3390/s24154981.

本文引用的文献

1
Learning robust perceptive locomotion for quadrupedal robots in the wild.在野外环境中学习四足机器人的鲁棒感知运动。
Sci Robot. 2022 Jan 19;7(62):eabk2822. doi: 10.1126/scirobotics.abk2822.
2
Learning agile and dynamic motor skills for legged robots.学习用于腿部机器人的敏捷和动态运动技能。
Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aau5872.
3
Learning quadrupedal locomotion over challenging terrain.学习在具有挑战性的地形上进行四足运动。
Sci Robot. 2020 Oct 21;5(47). doi: 10.1126/scirobotics.abc5986.
4
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
5
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
6
Reinforcement learning of motor skills with policy gradients.基于策略梯度的运动技能强化学习。
Neural Netw. 2008 May;21(4):682-97. doi: 10.1016/j.neunet.2008.02.003. Epub 2008 Apr 26.
7
Catastrophic forgetting in connectionist networks.联结主义网络中的灾难性遗忘。
Trends Cogn Sci. 1999 Apr;3(4):128-135. doi: 10.1016/s1364-6613(99)01294-2.