• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于中央模式发生器的控制器和强化学习探索毛毛虫状软体机器人的行为。

Exploring Behaviors of Caterpillar-Like Soft Robots with a Central Pattern Generator-Based Controller and Reinforcement Learning.

机构信息

Graduate School of Information Science and Technology, The University of Tokyo, Bunkyo-ku, Tokyo, Japan.

Department of Information Science and Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan.

出版信息

Soft Robot. 2019 Oct;6(5):579-594. doi: 10.1089/soro.2018.0126. Epub 2019 May 20.

DOI:10.1089/soro.2018.0126
PMID:31107172
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6786347/
Abstract

Due to their flexibility, soft-bodied robots can potentially achieve rich and various behaviors within a single body. However, to date, no methodology has effectively harnessed these robots to achieve such diverse desired functionalities. Controllers that accomplish only a limited range of behaviors in such robots have been handcrafted. Moreover, the behaviors of these robots should be determined through body-environment interactions because an appropriate behavior may not always be manifested even if the body dynamics are given. Therefore, we have proposed SenseCPG-PGPE, a method for automatically designing behaviors for caterpillar-like soft-bodied robots. This method optimizes mechanosensory feedback to a central pattern generator (CPG)-based controller, which controls actuators in a robot, using policy gradients with parameter-based exploration (PGPE). In this article, we deeply investigated this method. We found that PGPE can optimize a CPG-based controller for soft-bodied robots that exhibit viscoelasticity and large deformation, whereas other popular policy gradient methods, such as trust region policy optimization and proximal policy optimization, cannot. Scalability of the method was confirmed using simulation as well. Although SenseCPG-PGPE uses a CPG-based controller, it can achieve nonsteady motion such as climbing a step in a simulated robot. The approach also resulted in distinctive behaviors depending on different body-environment conditions. These results demonstrate that the proposed method enables soft robots to explore a variety of behaviors automatically.

摘要

由于其灵活性,软体机器人可以在单个机器人中实现丰富多样的行为。然而,迄今为止,还没有有效的方法能够有效地利用这些机器人来实现如此多样化的期望功能。在这样的机器人中完成有限行为范围的控制器是手工制作的。此外,这些机器人的行为应该通过体-环境相互作用来确定,因为即使给出了身体动力学,也不一定始终表现出适当的行为。因此,我们提出了 SenseCPG-PGPE,这是一种为毛毛虫状软体机器人自动设计行为的方法。该方法使用基于策略梯度的参数探索(PGPE)优化基于中央模式发生器(CPG)的控制器的机械感觉反馈,该控制器控制机器人中的致动器。在本文中,我们深入研究了这种方法。我们发现,PGPE 可以优化具有粘弹性和大变形的软体机器人的基于 CPG 的控制器,而其他流行的策略梯度方法,如信任区域策略优化和近端策略优化,则不能。我们还通过仿真验证了该方法的可扩展性。尽管 SenseCPG-PGPE 使用基于 CPG 的控制器,但它可以在模拟机器人中实现非稳态运动,例如爬楼梯。该方法还根据不同的体-环境条件产生了不同的行为。这些结果表明,所提出的方法使软体机器人能够自动探索各种行为。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d068/6786347/e9869a1bb9d5/fig-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d068/6786347/e0c2d7537e4b/fig-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d068/6786347/38da7b9c6c70/fig-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d068/6786347/e9869a1bb9d5/fig-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d068/6786347/e0c2d7537e4b/fig-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d068/6786347/38da7b9c6c70/fig-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d068/6786347/e9869a1bb9d5/fig-5.jpg

相似文献

1
Exploring Behaviors of Caterpillar-Like Soft Robots with a Central Pattern Generator-Based Controller and Reinforcement Learning.基于中央模式发生器的控制器和强化学习探索毛毛虫状软体机器人的行为。
Soft Robot. 2019 Oct;6(5):579-594. doi: 10.1089/soro.2018.0126. Epub 2019 May 20.
2
Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning.基于强化学习和逆强化学习的蛇形机器人节能与损伤恢复蠕动步态设计。
Neural Netw. 2020 Sep;129:323-333. doi: 10.1016/j.neunet.2020.05.029. Epub 2020 Jun 16.
3
A Hierarchical Framework for Quadruped Robots Gait Planning Based on DDPG.一种基于深度确定性策略梯度(DDPG)的四足机器人步态规划分层框架。
Biomimetics (Basel). 2023 Aug 22;8(5):382. doi: 10.3390/biomimetics8050382.
4
Generic Neural Locomotion Control Framework for Legged Robots.腿足机器人通用神经运动控制框架。
IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):4013-4025. doi: 10.1109/TNNLS.2020.3016523. Epub 2021 Aug 31.
5
Robust control of CPG-based 3D neuromusculoskeletal walking model.基于中枢模式发生器的三维神经肌肉骨骼行走模型的鲁棒控制
Biol Cybern. 2011 Oct;105(3-4):269-82. doi: 10.1007/s00422-011-0464-4. Epub 2011 Dec 3.
6
Discrete-Time Impedance Control for Dynamic Response Regulation of Parallel Soft Robots.用于并联柔性机器人动态响应调节的离散时间阻抗控制
Biomimetics (Basel). 2024 May 28;9(6):323. doi: 10.3390/biomimetics9060323.
7
Enhancing postural stability in musculoskeletal quadrupedal locomotion through tension feedback for CPG-based controller.通过张力反馈增强基于 CPG 的控制器在肌肉骨骼四足运动中的姿势稳定性。
Bioinspir Biomim. 2024 Nov 7;20(1). doi: 10.1088/1748-3190/ad839e.
8
Soft-body dynamics induces energy efficiency in undulatory swimming: A deep learning study.软体动力学在波动游泳中诱导能量效率:一项深度学习研究。
Front Robot AI. 2023 Feb 9;10:1102854. doi: 10.3389/frobt.2023.1102854. eCollection 2023.
9
Task space adaptation via the learning of gait controllers of magnetic soft millirobots.通过学习磁性软微型机器人的步态控制器实现任务空间自适应
Int J Rob Res. 2021 Dec 1;40(12-14):1331-1351. doi: 10.1177/02783649211021869. Epub 2021 Jun 16.
10
Lamarckian Evolution of Simulated Modular Robots.模拟模块化机器人的拉马克进化
Front Robot AI. 2019 Feb 18;6:9. doi: 10.3389/frobt.2019.00009. eCollection 2019.

引用本文的文献

1
Research on control strategy of pneumatic soft bionic robot based on improved CPG.基于改进的 CPG 的气动软体仿生机器人控制策略研究。
PLoS One. 2024 Jul 5;19(7):e0306320. doi: 10.1371/journal.pone.0306320. eCollection 2024.
2
A Survey on Reinforcement Learning Methods in Bionic Underwater Robots.仿生水下机器人中强化学习方法的综述
Biomimetics (Basel). 2023 Apr 20;8(2):168. doi: 10.3390/biomimetics8020168.
3
Robust and reusable self-organized locomotion of legged robots under adaptive physical and neural communications.

本文引用的文献

1
Evolving Soft Locomotion in Aquatic and Terrestrial Environments: Effects of Material Properties and Environmental Transitions.水生和陆地环境中不断演变的软运动:材料特性和环境转变的影响。
Soft Robot. 2018 Aug;5(4):475-495. doi: 10.1089/soro.2017.0055. Epub 2018 Jul 9.
2
Robotics-inspired biology.受机器人技术启发的生物学。
J Exp Biol. 2018 Mar 29;221(Pt 7):jeb138438. doi: 10.1242/jeb.138438.
3
A Quadruped Robot Exhibiting Spontaneous Gait Transitions from Walking to Trotting to Galloping.一种能够自主实现从行走、小跑至奔腾步态转换的四足机器人
自适应物理和神经通信下的腿足机器人强健且可重复的自组织运动。
Front Neural Circuits. 2023 Mar 31;17:1111285. doi: 10.3389/fncir.2023.1111285. eCollection 2023.
4
Characterization of continuum robot arms under reinforcement learning and derived improvements.强化学习下连续体机器人手臂的特性及衍生改进
Front Robot AI. 2022 Sep 1;9:895388. doi: 10.3389/frobt.2022.895388. eCollection 2022.
5
Insect wing 3D printing.昆虫翅膀 3D 打印。
Sci Rep. 2021 Oct 14;11(1):18631. doi: 10.1038/s41598-021-98242-y.
6
Biomimetic Prosthetic Hand Enabled by Liquid Crystal Elastomer Tendons.由液晶弹性体肌腱驱动的仿生假手。
Micromachines (Basel). 2021 Jun 23;12(7):736. doi: 10.3390/mi12070736.
7
General Distributed Neural Control and Sensory Adaptation for Self-Organized Locomotion and Fast Adaptation to Damage of Walking Robots.通用分布式神经控制和感觉适应,用于自主运动和快速适应步行机器人的损伤。
Front Neural Circuits. 2020 Aug 17;14:46. doi: 10.3389/fncir.2020.00046. eCollection 2020.
Sci Rep. 2017 Mar 21;7(1):277. doi: 10.1038/s41598-017-00348-9.
4
Gait control in a soft robot by sensing interactions with the environment using self-deformation.通过利用自身变形感知与环境的相互作用来实现软机器人的步态控制。
R Soc Open Sci. 2016 Dec 7;3(12):160766. doi: 10.1098/rsos.160766. eCollection 2016 Dec.
5
Soft robotics: a bioinspired evolution in robotics.软机器人:机器人学的仿生进化。
Trends Biotechnol. 2013 May;31(5):287-94. doi: 10.1016/j.tibtech.2013.03.002. Epub 2013 Apr 12.
6
The role of feedback in morphological computation with compliant bodies.反馈在柔性物体形态计算中的作用。
Biol Cybern. 2012 Nov;106(10):595-613. doi: 10.1007/s00422-012-0516-4. Epub 2012 Sep 6.
7
CPG-inspired workspace trajectory generation and adaptive locomotion control for quadruped robots.受CPG启发的四足机器人工作空间轨迹生成与自适应运动控制
IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):867-80. doi: 10.1109/TSMCB.2010.2097589. Epub 2011 Jan 6.
8
Parameter-exploring policy gradients.参数探索策略梯度。
Neural Netw. 2010 May;23(4):551-9. doi: 10.1016/j.neunet.2009.12.004. Epub 2009 Dec 16.
9
Emergence of functional hierarchy in a multiple timescale neural network model: a humanoid robot experiment.多时间尺度神经网络模型中功能层次结构的出现:一项仿人机器人实验。
PLoS Comput Biol. 2008 Nov;4(11):e1000220. doi: 10.1371/journal.pcbi.1000220. Epub 2008 Nov 7.
10
Central pattern generators for locomotion control in animals and robots: a review.动物和机器人运动控制中的中枢模式发生器:综述
Neural Netw. 2008 May;21(4):642-53. doi: 10.1016/j.neunet.2008.03.014. Epub 2008 May 14.