桥接强化学习与迭代学习控制：针对未知非线性动力学的自主运动学习

Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics.

作者信息

Meindl Michael, Lehmann Dustin, Seel Thomas

机构信息

Embedded Mechatronics Laboratory, Hochschule Karlsruhe, Karlsruhe, Germany.

Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.

出版信息

Front Robot AI. 2022 Jul 12;9:793512. doi: 10.3389/frobt.2022.793512. eCollection 2022.

DOI:10.3389/frobt.2022.793512

PMID:35903721

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9315427/

Abstract

This work addresses the problem of reference tracking in autonomously learning robots with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that learns to approximate the unknown dynamics by a Gaussian Process (GP), which is used to optimize and apply a feedforward control input on each trial. Unlike existing approaches, the proposed method neither requires knowledge of the system states and their dynamics nor knowledge of an effective feedback control structure. All algorithm parameters are chosen automatically, i.e. the learning method works plug and play. The proposed method is validated in extensive simulations and real-world experiments. In contrast to most existing work, we study learning dynamics for more than one motion task as well as the robustness of performance across a large range of learning parameters. The method's plug and play applicability is demonstrated by experiments with a balancing robot, in which the proposed method rapidly learns to track the desired output. Due to its model-agnostic and plug and play properties, the proposed method is expected to have high potential for application to a large class of reference tracking problems in systems with unknown, nonlinear dynamics.

摘要

这项工作解决了具有未知非线性动力学的自主学习机器人中的参考跟踪问题。现有解决方案需要模型信息或大量参数调整，并且很少在实际实验中得到验证。我们提出了一种学习控制方案，该方案通过高斯过程（GP）学习来逼近未知动力学，该高斯过程用于在每次试验中优化并应用前馈控制输入。与现有方法不同，所提出的方法既不需要系统状态及其动力学的知识，也不需要有效的反馈控制结构的知识。所有算法参数都是自动选择的，即该学习方法可以即插即用。所提出的方法在广泛的模拟和实际实验中得到了验证。与大多数现有工作不同，我们研究了多种运动任务的学习动力学以及在大范围学习参数下性能的鲁棒性。通过对平衡机器人的实验证明了该方法的即插即用适用性，在该实验中，所提出的方法能够快速学习跟踪期望输出。由于其模型无关和即插即用的特性，预计所提出的方法在应用于具有未知非线性动力学的系统中的一大类参考跟踪问题时具有很高的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93e6/9315427/70f32e6382dd/frobt-09-793512-g001.jpg

相似文献

Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics.

Front Robot AI. 2022 Jul 12;9:793512. doi: 10.3389/frobt.2022.793512. eCollection 2022.

RL-DOVS: Reinforcement Learning for Autonomous Robot Navigation in Dynamic Environments.

Sensors (Basel). 2022 May 19;22(10):3847. doi: 10.3390/s22103847.

Reinforcement Learning-Based Tracking Control of USVs in Varying Operational Conditions.

Front Robot AI. 2020 Mar 20;7:32. doi: 10.3389/frobt.2020.00032. eCollection 2020.

Deep Reinforcement Learning for End-to-End Local Motion Planning of Autonomous Aerial Robots in Unknown Outdoor Environments: Real-Time Flight Experiments.

Sensors (Basel). 2021 Apr 4;21(7):2534. doi: 10.3390/s21072534.

Ultra-fast tuning of neural network controllers with application in path tracking of autonomous vehicle.

ISA Trans. 2024 Jun;149:394-408. doi: 10.1016/j.isatra.2024.04.029. Epub 2024 Apr 26.

Data-driven model reference control of MIMO vertical tank systems with model-free VRFT and Q-Learning.

ISA Trans. 2018 Feb;73:227-238. doi: 10.1016/j.isatra.2018.01.014. Epub 2018 Jan 8.

Event-Triggered Nonlinear Iterative Learning Control.

IEEE Trans Neural Netw Learn Syst. 2021 Nov;32(11):5118-5128. doi: 10.1109/TNNLS.2020.3027000. Epub 2021 Oct 27.

Gaussian Processes for Data-Efficient Learning in Robotics and Control.

IEEE Trans Pattern Anal Mach Intell. 2015 Feb;37(2):408-23. doi: 10.1109/TPAMI.2013.218.

A fuzzy adaptive learning control network with on-line structure and parameter learning.

Int J Neural Syst. 1996 Nov;7(5):569-90. doi: 10.1142/s0129065796000567.

Force-guided autonomous robotic ultrasound scanning control method for soft uncertain environment.

Int J Comput Assist Radiol Surg. 2021 Dec;16(12):2189-2199. doi: 10.1007/s11548-021-02462-6. Epub 2021 Aug 9.

引用本文的文献

The experimental multi-arm pendulum on a cart: A benchmark system for chaos, learning, and control.

HardwareX. 2023 Aug 7;15:e00465. doi: 10.1016/j.ohx.2023.e00465. eCollection 2023 Sep.

本文引用的文献

Iterative Learning Model Predictive Control Based on Iterative Data-Driven Modeling.

IEEE Trans Neural Netw Learn Syst. 2021 Aug;32(8):3377-3390. doi: 10.1109/TNNLS.2020.3016295. Epub 2021 Aug 3.

RBFNN-Based Data-Driven Predictive Iterative Learning Control for Nonaffine Nonlinear Systems.

IEEE Trans Neural Netw Learn Syst. 2020 Apr;31(4):1170-1182. doi: 10.1109/TNNLS.2019.2919441. Epub 2019 Jun 25.

Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data.

IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):14-25. doi: 10.1109/TSMCB.2010.2043839. Epub 2010 Mar 29.

An autonomous surgical robot for drilling a cochleostomy: preliminary porcine trial.

Clin Otolaryngol. 2008 Aug;33(4):343-7. doi: 10.1111/j.1749-4486.2008.01703.x.

Reinforcement learning of motor skills with policy gradients.

Neural Netw. 2008 May;21(4):682-97. doi: 10.1016/j.neunet.2008.02.003. Epub 2008 Apr 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

桥接强化学习与迭代学习控制：针对未知非线性动力学的自主运动学习

Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics.

作者信息

Meindl Michael, Lehmann Dustin, Seel Thomas

机构信息

Embedded Mechatronics Laboratory, Hochschule Karlsruhe, Karlsruhe, Germany.

Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.

出版信息

Front Robot AI. 2022 Jul 12;9:793512. doi: 10.3389/frobt.2022.793512. eCollection 2022.

DOI:10.3389/frobt.2022.793512

PMID:35903721

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9315427/

Abstract

摘要

桥接强化学习与迭代学习控制：针对未知非线性动力学的自主运动学习

Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

桥接强化学习与迭代学习控制：针对未知非线性动力学的自主运动学习

Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献