• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于PGRL、LPI和模糊逻辑的双足机器人步行运动生成、合成与控制

Walking motion generation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic.

作者信息

Li Tzuu-Hseng S, Su Yu-Te, Lai Shao-Wei, Hu Jhen-Jia

机构信息

Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):736-48. doi: 10.1109/TSMCB.2010.2089978. Epub 2010 Nov 18.

DOI:10.1109/TSMCB.2010.2089978
PMID:21095871
Abstract

This paper proposes the implementation of fuzzy motion control based on reinforcement learning (RL) and Lagrange polynomial interpolation (LPI) for gait synthesis of biped robots. First, the procedure of a walking gait is redefined into three states, and the parameters of this designed walking gait are determined. Then, the machine learning approach applied to adjusting the walking parameters is policy gradient RL (PGRL), which can execute real-time performance and directly modify the policy without calculating the dynamic function. Given a parameterized walking motion designed for biped robots, the PGRL algorithm automatically searches the set of possible parameters and finds the fastest possible walking motion. The reward function mainly considered is first the walking speed, which can be estimated from the vision system. However, the experiment illustrates that there are some stability problems in this kind of learning process. To solve these problems, the desired zero moment point trajectory is added to the reward function. The results show that the robot not only has more stable walking but also increases its walking speed after learning. This is more effective and attractive than manual trial-and-error tuning. LPI, moreover, is employed to transform the existing motions to the motion which has a revised angle determined by the fuzzy motion controller. Then, the biped robot can continuously walk in any desired direction through this fuzzy motion control. Finally, the fuzzy-based gait synthesis control is demonstrated by tasks and point- and line-target tracking. The experiments show the feasibility and effectiveness of gait learning with PGRL and the practicability of the proposed fuzzy motion control scheme.

摘要

本文提出了一种基于强化学习(RL)和拉格朗日多项式插值(LPI)的模糊运动控制方法,用于两足机器人的步态合成。首先,将行走步态的过程重新定义为三种状态,并确定所设计行走步态的参数。然后,应用于调整行走参数的机器学习方法是策略梯度强化学习(PGRL),它可以执行实时性能并直接修改策略,而无需计算动态函数。给定为两足机器人设计的参数化行走运动,PGRL算法会自动搜索可能的参数集,并找到最快的行走运动。主要考虑的奖励函数首先是行走速度,它可以从视觉系统中估计出来。然而,实验表明这种学习过程存在一些稳定性问题。为了解决这些问题,将期望的零力矩点轨迹添加到奖励函数中。结果表明,机器人在学习后不仅行走更稳定,而且行走速度也有所提高。这比手动试错调整更有效且更具吸引力。此外,采用LPI将现有运动转换为由模糊运动控制器确定的具有修正角度的运动。然后,两足机器人可以通过这种模糊运动控制在任何期望的方向上连续行走。最后,通过任务以及点和线目标跟踪演示了基于模糊的步态合成控制。实验表明了使用PGRL进行步态学习的可行性和有效性,以及所提出的模糊运动控制方案的实用性。

相似文献

1
Walking motion generation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic.基于PGRL、LPI和模糊逻辑的双足机器人步行运动生成、合成与控制
IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):736-48. doi: 10.1109/TSMCB.2010.2089978. Epub 2010 Nov 18.
2
Adaptive fuzzy neural network control design via a T-S fuzzy model for a robot manipulator including actuator dynamics.基于T-S模糊模型的机器人机械手自适应模糊神经网络控制设计,包括执行器动力学。
IEEE Trans Syst Man Cybern B Cybern. 2008 Oct;38(5):1326-46. doi: 10.1109/TSMCB.2008.925749.
3
A reflexive neural network for dynamic biped walking control.一种用于动态双足步行控制的自反神经网络。
Neural Comput. 2006 May;18(5):1156-96. doi: 10.1162/089976606776241057.
4
Fractional fuzzy adaptive sliding-mode control of a 2-DOF direct-drive robot arm.二自由度直接驱动机器人手臂的分数阶模糊自适应滑模控制
IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1561-70. doi: 10.1109/TSMCB.2008.928227.
5
Fuzzy integral-based gaze control architecture incorporated with modified-univector field-based navigation for humanoid robots.基于模糊积分的注视控制架构与基于改进单矢量场的类人机器人导航相结合。
IEEE Trans Syst Man Cybern B Cybern. 2012 Feb;42(1):125-39. doi: 10.1109/TSMCB.2011.2162234. Epub 2011 Aug 30.
6
A hybrid CPG-ZMP control system for stable walking of a simulated flexible spine humanoid robot.一种用于模拟柔性脊柱人形机器人稳定行走的混合 CPG-ZMP 控制系统。
Neural Netw. 2010 Apr;23(3):452-60. doi: 10.1016/j.neunet.2009.11.003. Epub 2009 Dec 3.
7
Fuzzy auto-tuning PID control of multiple joint robot driven by ultrasonic motors.基于超声电机驱动的多关节机器人模糊自整定PID控制
Ultrasonics. 2007 Nov;46(4):303-12. doi: 10.1016/j.ultras.2007.04.001. Epub 2007 Apr 21.
8
Motion control of planar parallel robot using the fuzzy descriptor system approach.基于模糊描述符系统方法的平面并联机器人运动控制。
ISA Trans. 2012 Sep;51(5):596-608. doi: 10.1016/j.isatra.2012.04.001. Epub 2012 May 25.
9
New hybrid adaptive neuro-fuzzy algorithms for manipulator control with uncertainties- comparative study.用于具有不确定性的机械手控制的新型混合自适应神经模糊算法——比较研究
ISA Trans. 2009 Oct;48(4):497-502. doi: 10.1016/j.isatra.2009.05.003. Epub 2009 Jun 11.
10
Intelligent robust tracking control for a class of uncertain strict-feedback nonlinear systems.一类不确定严格反馈非线性系统的智能鲁棒跟踪控制
IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):142-55. doi: 10.1109/TSMCB.2008.2002854.

引用本文的文献

1
Flexible Insole Sensors with Stably Connected Electrodes for Gait Phase Detection.具有稳定连接电极的灵活鞋垫传感器,用于步态相位检测。
Sensors (Basel). 2019 Nov 27;19(23):5197. doi: 10.3390/s19235197.