• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习方法的单腿跳跃机器人模型在存在外部干扰情况下的站立平衡。

Standing balance of single-legged hopping robot model using reinforcement learning approach in the presence of external disturbances.

作者信息

Hoseinifard S Mohamad, Sadedel Majid

机构信息

Department of Mechanical Engineering, Tarbiat Modares University, Tehran, Iran.

出版信息

Sci Rep. 2024 Dec 30;14(1):32036. doi: 10.1038/s41598-024-83749-x.

DOI:10.1038/s41598-024-83749-x
PMID:39738451
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11686392/
Abstract

In this scholarly investigation, the study focuses on scrutinizing the locomotion and control mechanisms governing a single-legged robot. The analysis encompasses the robot's movement dynamics pertaining to two primary objectives: executing jumps and sustaining equilibrium throughout successive jump sequences. Diverse concepts of this robot model have been scrutinized, leading to the introduction of a distinctive semi-active model devised for maintaining the robot's balance. The research involves an initial design for the robot model followed by the introduction of a multi-phase composite control system. As per the proposed model, the jumping action is facilitated through a four-link mechanism augmented by a spring, while balance preservation is achieved through the independent operation of two arms connected to the upper body. To address the successive jumps within the four-link mechanism, a multi-phase feedback controller is engineered. Additionally, a hybrid control strategy, incorporating the Deep Deterministic Policy Gradient algorithm (DDPG) along with a feedback controller, is proposed to sustain balance throughout the robot's contact and flight phases. The research outcomes, acquired through a series of comprehensive tests conducted within the Simulink simulator environment, demonstrate the robot's capacity to maintain balance over 80 consecutive jumps. The evaluations encompassed various simulated external disturbances, including 1- horizontal impacts on the upper body, 2- disparities in ground height, and 3- alterations in ground angle between consecutive steps. Notably, the findings showcase the robot's adeptness in maintaining balance despite an impact with an amplitude of 25 N for a duration of 0.1 seconds, as well as its resilience in managing ground height disparities up to 3 cm and ground angle variations of up to 3°.

摘要

在这项学术研究中,该研究聚焦于审视单腿机器人的运动及控制机制。分析涵盖了与两个主要目标相关的机器人运动动力学:执行跳跃动作以及在连续跳跃序列中保持平衡。对该机器人模型的多种概念进行了审视,进而引入了一种为维持机器人平衡而设计的独特半主动模型。研究包括对机器人模型的初步设计,随后引入了一种多阶段复合控制系统。根据所提出的模型,跳跃动作通过一个由弹簧增强的四连杆机构来实现,而平衡的保持则通过连接到上半身的两条手臂的独立运作来达成。为解决四连杆机构中的连续跳跃问题,设计了一种多阶段反馈控制器。此外,还提出了一种混合控制策略,将深度确定性策略梯度算法(DDPG)与反馈控制器相结合,以在机器人的接触阶段和飞行阶段维持平衡。通过在Simulink模拟器环境中进行的一系列全面测试所获得的研究成果表明,该机器人能够在连续80次跳跃中保持平衡。评估涵盖了各种模拟的外部干扰,包括:1 - 对上半身的水平冲击,2 - 地面高度差异,以及3 - 连续步之间地面角度的变化。值得注意的是,研究结果显示,尽管受到幅度为25 N、持续时间为0.1秒的冲击,该机器人仍能熟练地保持平衡,并且在处理高达3 cm的地面高度差异以及高达3°的地面角度变化时也具有很强的适应性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/848126d576f6/41598_2024_83749_Fig23_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/6fdd88114156/41598_2024_83749_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/4a2ca4d92250/41598_2024_83749_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/5fc4f5a7d96d/41598_2024_83749_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/13184b8f45dd/41598_2024_83749_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/562688a2b8ab/41598_2024_83749_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/e7d08a79feb9/41598_2024_83749_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/c17c5921c1fc/41598_2024_83749_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/61df97594beb/41598_2024_83749_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/b50e0a7882b1/41598_2024_83749_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/7fb828807fbb/41598_2024_83749_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/0d241b489bf5/41598_2024_83749_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/5424f543bfe2/41598_2024_83749_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/c9356c64a0b6/41598_2024_83749_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/55e132001876/41598_2024_83749_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/00966399dacd/41598_2024_83749_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/56cddbdac208/41598_2024_83749_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/51292e8935fc/41598_2024_83749_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/5ddb9317bb1f/41598_2024_83749_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/61def11dd039/41598_2024_83749_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/93e9dbbe70c7/41598_2024_83749_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/ab9478e8887e/41598_2024_83749_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/7dc515012ac8/41598_2024_83749_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/848126d576f6/41598_2024_83749_Fig23_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/6fdd88114156/41598_2024_83749_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/4a2ca4d92250/41598_2024_83749_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/5fc4f5a7d96d/41598_2024_83749_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/13184b8f45dd/41598_2024_83749_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/562688a2b8ab/41598_2024_83749_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/e7d08a79feb9/41598_2024_83749_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/c17c5921c1fc/41598_2024_83749_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/61df97594beb/41598_2024_83749_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/b50e0a7882b1/41598_2024_83749_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/7fb828807fbb/41598_2024_83749_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/0d241b489bf5/41598_2024_83749_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/5424f543bfe2/41598_2024_83749_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/c9356c64a0b6/41598_2024_83749_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/55e132001876/41598_2024_83749_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/00966399dacd/41598_2024_83749_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/56cddbdac208/41598_2024_83749_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/51292e8935fc/41598_2024_83749_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/5ddb9317bb1f/41598_2024_83749_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/61def11dd039/41598_2024_83749_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/93e9dbbe70c7/41598_2024_83749_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/ab9478e8887e/41598_2024_83749_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/7dc515012ac8/41598_2024_83749_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e16a/11686392/848126d576f6/41598_2024_83749_Fig23_HTML.jpg

相似文献

1
Standing balance of single-legged hopping robot model using reinforcement learning approach in the presence of external disturbances.基于强化学习方法的单腿跳跃机器人模型在存在外部干扰情况下的站立平衡。
Sci Rep. 2024 Dec 30;14(1):32036. doi: 10.1038/s41598-024-83749-x.
2
Oncilla Robot: A Versatile Open-Source Quadruped Research Robot With Compliant Pantograph Legs.虎猫机器人:一种具有柔顺缩放机构腿部的多功能开源四足研究机器人。
Front Robot AI. 2018 Jun 19;5:67. doi: 10.3389/frobt.2018.00067. eCollection 2018.
3
Virtual Energy Management for Physical Energy Savings in a Legged Robot Hopping on Granular Media.用于在粒状介质上跳跃的有腿机器人中实现物理节能的虚拟能量管理
Front Robot AI. 2021 Dec 21;8:740927. doi: 10.3389/frobt.2021.740927. eCollection 2021.
4
Vertical Jumping for Legged Robot Based on Quadratic Programming.基于二次规划的足式机器人垂直跳跃。
Sensors (Basel). 2021 May 25;21(11):3679. doi: 10.3390/s21113679.
5
Design and dynamic analysis of jumping wheel-legged robot in complex terrain environment.复杂地形环境下跳跃轮腿机器人的设计与动力学分析
Front Neurorobot. 2022 Dec 2;16:1066714. doi: 10.3389/fnbot.2022.1066714. eCollection 2022.
6
Gait and locomotion analysis of a soft-hybrid multi-legged modular miniature robot.软杂交多足模块化微型机器人的步态和运动分析。
Bioinspir Biomim. 2021 Sep 28;16(6). doi: 10.1088/1748-3190/ac245e.
7
SLP-Improved DDPG Path-Planning Algorithm for Mobile Robot in Large-Scale Dynamic Environment.基于 SLP 的改进 DDPG 路径规划算法在大规模动态环境下的移动机器人应用
Sensors (Basel). 2023 Mar 28;23(7):3521. doi: 10.3390/s23073521.
8
Research on wheel-legged robot based on LQR and ADRC.基于线性二次型调节器(LQR)和自抗扰控制器(ADRC)的轮腿式机器人研究
Sci Rep. 2023 Sep 13;13(1):15122. doi: 10.1038/s41598-023-41462-1.
9
Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning.基于深度强化学习的改进深度确定性策略梯度算法的机器人抓取方法优化
Rev Sci Instrum. 2021 Feb 1;92(2):025114. doi: 10.1063/5.0034101.
10
Stable Jumping Control Based on Deep Reinforcement Learning for a Locust-Inspired Robot.基于深度强化学习的仿蝗虫机器人稳定跳跃控制
Biomimetics (Basel). 2024 Sep 11;9(9):548. doi: 10.3390/biomimetics9090548.

引用本文的文献

1
Bridging the Gap to Bionic Motion: Challenges in Legged Robot Limb Unit Design, Modeling, and Control.弥合与仿生运动的差距:有腿机器人肢体单元设计、建模与控制中的挑战
Cyborg Bionic Syst. 2025 Aug 19;6:0365. doi: 10.34133/cbsystems.0365. eCollection 2025.
2
Near real-time online reinforcement learning with synchronous or asynchronous updates.具有同步或异步更新的近实时在线强化学习。
Sci Rep. 2025 May 17;15(1):17158. doi: 10.1038/s41598-025-00492-7.

本文引用的文献

1
3D-SLIP model based dynamic stability strategy for legged robots with impact disturbance rejection.基于 3D-SLIP 模型的具有冲击干扰抑制的腿足机器人动态稳定性策略。
Sci Rep. 2022 Apr 7;12(1):5892. doi: 10.1038/s41598-022-09937-9.
2
Learning agile and dynamic motor skills for legged robots.学习用于腿部机器人的敏捷和动态运动技能。
Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aau5872.
3
A flight-phase terrain following control strategy for stable and robust hopping of a one-legged robot under large terrain variations.
一种用于单腿机器人在大地形变化下稳定和鲁棒跳跃的飞行阶段地形跟随控制策略。
Bioinspir Biomim. 2017 Aug 4;12(4):046011. doi: 10.1088/1748-3190/aa741f.