Meindl Michael, Lehmann Dustin, Seel Thomas
Embedded Mechatronics Laboratory, Hochschule Karlsruhe, Karlsruhe, Germany.
Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.
Front Robot AI. 2022 Jul 12;9:793512. doi: 10.3389/frobt.2022.793512. eCollection 2022.
This work addresses the problem of reference tracking in autonomously learning robots with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that learns to approximate the unknown dynamics by a Gaussian Process (GP), which is used to optimize and apply a feedforward control input on each trial. Unlike existing approaches, the proposed method neither requires knowledge of the system states and their dynamics nor knowledge of an effective feedback control structure. All algorithm parameters are chosen automatically, i.e. the learning method works plug and play. The proposed method is validated in extensive simulations and real-world experiments. In contrast to most existing work, we study learning dynamics for more than one motion task as well as the robustness of performance across a large range of learning parameters. The method's plug and play applicability is demonstrated by experiments with a balancing robot, in which the proposed method rapidly learns to track the desired output. Due to its model-agnostic and plug and play properties, the proposed method is expected to have high potential for application to a large class of reference tracking problems in systems with unknown, nonlinear dynamics.
这项工作解决了具有未知非线性动力学的自主学习机器人中的参考跟踪问题。现有解决方案需要模型信息或大量参数调整,并且很少在实际实验中得到验证。我们提出了一种学习控制方案,该方案通过高斯过程(GP)学习来逼近未知动力学,该高斯过程用于在每次试验中优化并应用前馈控制输入。与现有方法不同,所提出的方法既不需要系统状态及其动力学的知识,也不需要有效的反馈控制结构的知识。所有算法参数都是自动选择的,即该学习方法可以即插即用。所提出的方法在广泛的模拟和实际实验中得到了验证。与大多数现有工作不同,我们研究了多种运动任务的学习动力学以及在大范围学习参数下性能的鲁棒性。通过对平衡机器人的实验证明了该方法的即插即用适用性,在该实验中,所提出的方法能够快速学习跟踪期望输出。由于其模型无关和即插即用的特性,预计所提出的方法在应用于具有未知非线性动力学的系统中的一大类参考跟踪问题时具有很高的潜力。