基于PGRL、LPI和模糊逻辑的双足机器人步行运动生成、合成与控制

Walking motion generation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic.

作者信息

Li Tzuu-Hseng S, Su Yu-Te, Lai Shao-Wei, Hu Jhen-Jia

机构信息

Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):736-48. doi: 10.1109/TSMCB.2010.2089978. Epub 2010 Nov 18.

DOI:10.1109/TSMCB.2010.2089978

PMID:21095871

Abstract

This paper proposes the implementation of fuzzy motion control based on reinforcement learning (RL) and Lagrange polynomial interpolation (LPI) for gait synthesis of biped robots. First, the procedure of a walking gait is redefined into three states, and the parameters of this designed walking gait are determined. Then, the machine learning approach applied to adjusting the walking parameters is policy gradient RL (PGRL), which can execute real-time performance and directly modify the policy without calculating the dynamic function. Given a parameterized walking motion designed for biped robots, the PGRL algorithm automatically searches the set of possible parameters and finds the fastest possible walking motion. The reward function mainly considered is first the walking speed, which can be estimated from the vision system. However, the experiment illustrates that there are some stability problems in this kind of learning process. To solve these problems, the desired zero moment point trajectory is added to the reward function. The results show that the robot not only has more stable walking but also increases its walking speed after learning. This is more effective and attractive than manual trial-and-error tuning. LPI, moreover, is employed to transform the existing motions to the motion which has a revised angle determined by the fuzzy motion controller. Then, the biped robot can continuously walk in any desired direction through this fuzzy motion control. Finally, the fuzzy-based gait synthesis control is demonstrated by tasks and point- and line-target tracking. The experiments show the feasibility and effectiveness of gait learning with PGRL and the practicability of the proposed fuzzy motion control scheme.

摘要

本文提出了一种基于强化学习（RL）和拉格朗日多项式插值（LPI）的模糊运动控制方法，用于两足机器人的步态合成。首先，将行走步态的过程重新定义为三种状态，并确定所设计行走步态的参数。然后，应用于调整行走参数的机器学习方法是策略梯度强化学习（PGRL），它可以执行实时性能并直接修改策略，而无需计算动态函数。给定为两足机器人设计的参数化行走运动，PGRL算法会自动搜索可能的参数集，并找到最快的行走运动。主要考虑的奖励函数首先是行走速度，它可以从视觉系统中估计出来。然而，实验表明这种学习过程存在一些稳定性问题。为了解决这些问题，将期望的零力矩点轨迹添加到奖励函数中。结果表明，机器人在学习后不仅行走更稳定，而且行走速度也有所提高。这比手动试错调整更有效且更具吸引力。此外，采用LPI将现有运动转换为由模糊运动控制器确定的具有修正角度的运动。然后，两足机器人可以通过这种模糊运动控制在任何期望的方向上连续行走。最后，通过任务以及点和线目标跟踪演示了基于模糊的步态合成控制。实验表明了使用PGRL进行步态学习的可行性和有效性，以及所提出的模糊运动控制方案的实用性。

相似文献

Walking motion generation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic.

IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):736-48. doi: 10.1109/TSMCB.2010.2089978. Epub 2010 Nov 18.

Adaptive fuzzy neural network control design via a T-S fuzzy model for a robot manipulator including actuator dynamics.

IEEE Trans Syst Man Cybern B Cybern. 2008 Oct;38(5):1326-46. doi: 10.1109/TSMCB.2008.925749.

A reflexive neural network for dynamic biped walking control.

Neural Comput. 2006 May;18(5):1156-96. doi: 10.1162/089976606776241057.

Fractional fuzzy adaptive sliding-mode control of a 2-DOF direct-drive robot arm.

IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1561-70. doi: 10.1109/TSMCB.2008.928227.

Fuzzy integral-based gaze control architecture incorporated with modified-univector field-based navigation for humanoid robots.

IEEE Trans Syst Man Cybern B Cybern. 2012 Feb;42(1):125-39. doi: 10.1109/TSMCB.2011.2162234. Epub 2011 Aug 30.

A hybrid CPG-ZMP control system for stable walking of a simulated flexible spine humanoid robot.

Neural Netw. 2010 Apr;23(3):452-60. doi: 10.1016/j.neunet.2009.11.003. Epub 2009 Dec 3.

Fuzzy auto-tuning PID control of multiple joint robot driven by ultrasonic motors.

Ultrasonics. 2007 Nov;46(4):303-12. doi: 10.1016/j.ultras.2007.04.001. Epub 2007 Apr 21.

Motion control of planar parallel robot using the fuzzy descriptor system approach.

ISA Trans. 2012 Sep;51(5):596-608. doi: 10.1016/j.isatra.2012.04.001. Epub 2012 May 25.

New hybrid adaptive neuro-fuzzy algorithms for manipulator control with uncertainties- comparative study.

ISA Trans. 2009 Oct;48(4):497-502. doi: 10.1016/j.isatra.2009.05.003. Epub 2009 Jun 11.

Intelligent robust tracking control for a class of uncertain strict-feedback nonlinear systems.

IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):142-55. doi: 10.1109/TSMCB.2008.2002854.

引用本文的文献

Flexible Insole Sensors with Stably Connected Electrodes for Gait Phase Detection.

Sensors (Basel). 2019 Nov 27;19(23):5197. doi: 10.3390/s19235197.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于PGRL、LPI和模糊逻辑的双足机器人步行运动生成、合成与控制

Walking motion generation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic.

作者信息

Li Tzuu-Hseng S, Su Yu-Te, Lai Shao-Wei, Hu Jhen-Jia

机构信息

Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):736-48. doi: 10.1109/TSMCB.2010.2089978. Epub 2010 Nov 18.

DOI:10.1109/TSMCB.2010.2089978

PMID:21095871

Abstract

摘要

基于PGRL、LPI和模糊逻辑的双足机器人步行运动生成、合成与控制

Walking motion generation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于PGRL、LPI和模糊逻辑的双足机器人步行运动生成、合成与控制

Walking motion generation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic.

作者信息

机构信息

出版信息

相似文献

引用本文的文献