基于自然进化策略的稳定非线性动力系统的机器人策略改进

Robot Policy Improvement With Natural Evolution Strategies for Stable Nonlinear Dynamical System.

作者信息

Hu Yingbai, Chen Guang, Li Zhijun, Knoll Alois

出版信息

IEEE Trans Cybern. 2023 Jun;53(6):4002-4014. doi: 10.1109/TCYB.2022.3192049. Epub 2023 May 17.

DOI:10.1109/TCYB.2022.3192049

Abstract

Robot learning through kinesthetic teaching is a promising way of cloning human behaviors, but it has its limits in the performance of complex tasks with small amounts of data, due to compounding errors. In order to improve the robustness and adaptability of imitation learning, a hierarchical learning strategy is proposed: low-level learning comprises only behavioral cloning with supervised learning, and high-level learning constitutes policy improvement. First, the Gaussian mixture model (GMM)-based dynamical system is formulated to encode a motion from the demonstration. We then derive the sufficient conditions of the GMM parameters that guarantee the global stability of the dynamical system from any initial state, using the Lyapunov stability theorem. Generally, imitation learning should reason about the motion well into the future for a wide range of tasks; it is significant to improve the adaptability of the learning method by policy improvement. Finally, a method based on exponential natural evolution strategies is proposed to optimize the parameters of the dynamical system associated with the stiffness of variable impedance control, in which the exploration noise is subject to stability conditions of the dynamical system in the exploration space, thus guaranteeing the global stability. Empirical evaluations are conducted on manipulators for different scenarios, including motion planning with obstacle avoidance and stiffness learning.

摘要

通过动觉教学进行机器人学习是一种很有前景的克隆人类行为的方式，但由于误差的累积，在处理少量数据的复杂任务时存在局限性。为了提高模仿学习的鲁棒性和适应性，提出了一种分层学习策略：低级学习仅包括基于监督学习的行为克隆，高级学习则是策略改进。首先，构建基于高斯混合模型（GMM）的动态系统，对示范中的运动进行编码。然后，利用李雅普诺夫稳定性定理，推导保证动态系统从任何初始状态全局稳定的GMM参数的充分条件。一般来说，模仿学习需要对广泛任务中的未来运动进行合理推理；通过策略改进提高学习方法的适应性具有重要意义。最后，提出一种基于指数自然进化策略的方法，用于优化与可变阻抗控制刚度相关的动态系统参数，其中探索噪声在探索空间中受动态系统稳定性条件的约束，从而保证全局稳定性。针对不同场景在操纵器上进行了实证评估，包括避障运动规划和刚度学习。

相似文献

Robot Policy Improvement With Natural Evolution Strategies for Stable Nonlinear Dynamical System.基于自然进化策略的稳定非线性动力系统的机器人策略改进

IEEE Trans Cybern. 2023 Jun;53(6):4002-4014. doi: 10.1109/TCYB.2022.3192049. Epub 2023 May 17.

Research on Robot Screwing Skill Method Based on Demonstration Learning.基于示范学习的机器人拧紧技能方法研究

Sensors (Basel). 2023 Dec 19;24(1):21. doi: 10.3390/s24010021.

A study on robot force control based on the GMM/GMR algorithm fusing different compensation strategies.基于融合不同补偿策略的高斯混合模型/高斯混合回归算法的机器人力控制研究

Front Neurorobot. 2024 Jan 29;18:1290853. doi: 10.3389/fnbot.2024.1290853. eCollection 2024.

Guided Stochastic Optimization for Motion Planning.用于运动规划的引导式随机优化

Front Robot AI. 2019 Nov 12;6:105. doi: 10.3389/frobt.2019.00105. eCollection 2019.

An Improvement of Robot Stiffness-Adaptive Skill Primitive Generalization Using the Surface Electromyography in Human-Robot Collaboration.一种在人机协作中利用表面肌电图改进机器人刚度自适应技能原语泛化的方法。

Front Neurosci. 2021 Sep 14;15:694914. doi: 10.3389/fnins.2021.694914. eCollection 2021.

Human-robot skill transmission for mobile robot via learning by demonstration.通过示范学习实现移动机器人的人机技能传递。

Neural Comput Appl. 2021 Sep 22:1-11. doi: 10.1007/s00521-021-06449-x.

Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation.基于任务几何表示的ProMPs的插销入孔装配技能模仿学习方法

Front Neurorobot. 2023 Nov 9;17:1320251. doi: 10.3389/fnbot.2023.1320251. eCollection 2023.

A Learning-Based Hierarchical Control Scheme for an Exoskeleton Robot in Human-Robot Cooperative Manipulation.基于学习的人机协作操作外骨骼机器人分层控制方案。

IEEE Trans Cybern. 2020 Jan;50(1):112-125. doi: 10.1109/TCYB.2018.2864784. Epub 2018 Aug 31.

A New Noise-Tolerant Obstacle Avoidance Scheme for Motion Planning of Redundant Robot Manipulators.一种用于冗余机器人机械臂运动规划的新型抗噪声避障方案。

Front Neurorobot. 2018 Aug 29;12:51. doi: 10.3389/fnbot.2018.00051. eCollection 2018.

An Efficient Motion Planning Method with a Lazy Demonstration Graph for Repetitive Pick-and-Place.一种基于惰性演示图的高效重复抓取与放置运动规划方法。

Biomimetics (Basel). 2022 Nov 21;7(4):210. doi: 10.3390/biomimetics7040210.

引用本文的文献

Leveraging imitation learning in agricultural robotics: a comprehensive survey and comparative analysis.农业机器人中模仿学习的应用：全面综述与比较分析

Front Robot AI. 2024 Oct 17;11:1441312. doi: 10.3389/frobt.2024.1441312. eCollection 2024.

Exploring wireless device-free localization technique to assist home-based neuro-rehabilitation.探索无设备无线定位技术以辅助居家神经康复。

Front Neurosci. 2024 Feb 2;18:1344841. doi: 10.3389/fnins.2024.1344841. eCollection 2024.

Biomimetic Adaptive Pure Pursuit Control for Robot Path Tracking Inspired by Natural Motion Constraints.受自然运动约束启发的机器人路径跟踪仿生自适应纯追踪控制

Biomimetics (Basel). 2024 Jan 9;9(1):41. doi: 10.3390/biomimetics9010041.

Neuromusculoskeletal model-informed machine learning-based control of a knee exoskeleton with uncertainties quantification.基于神经肌肉骨骼模型的不确定性量化的膝关节外骨骼机器学习控制

Front Neurosci. 2023 Aug 30;17:1254088. doi: 10.3389/fnins.2023.1254088. eCollection 2023.

Surrounding-aware representation prediction in Birds-Eye-View using transformers.使用Transformer在鸟瞰视角下进行周围环境感知表示预测。

Front Neurosci. 2023 Jul 4;17:1219363. doi: 10.3389/fnins.2023.1219363. eCollection 2023.

Focus prediction of medical microscopic images based on Lightweight Densely Connected with Squeeze-and-Excitation Network.基于轻量级密集连接与挤压激励网络的医学显微图像焦点预测

Front Neurosci. 2023 Jun 29;17:1213176. doi: 10.3389/fnins.2023.1213176. eCollection 2023.

Constant Force-Tracking Control Based on Deep Reinforcement Learning in Dynamic Auscultation Environment.基于深度强化学习的动态听诊环境下的恒力跟踪控制。

Sensors (Basel). 2023 Feb 15;23(4):2186. doi: 10.3390/s23042186.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于自然进化策略的稳定非线性动力系统的机器人策略改进

Robot Policy Improvement With Natural Evolution Strategies for Stable Nonlinear Dynamical System.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献