基于条件变分自编码器的动态运动用于多任务模仿学习。

Conditional variational auto encoder based dynamic motion for multitask imitation learning.

作者信息

Xu Binzhao, Ud Din Muhayy, Hussain Irfan

机构信息

Khalifa University Center for Autonomous Robotic Systems (KUCARS), Khalifa University of Science and Technology, Abu Dhabi, UAE.

出版信息

Sci Rep. 2025 Mar 17;15(1):9196. doi: 10.1038/s41598-025-93888-4.

DOI:10.1038/s41598-025-93888-4

PMID:40097597

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11914600/

Abstract

The dynamic motion primitive-based (DMP) method is effective for learning from demonstrations. However, most current DMP-based methods focus on learning one task with one module. Although, some deep learning based frameworks can learn multi-task simultaneously. However, these methods require a large amount of training data and have limited generalization of the learned behavior to untrained states. In this paper, we propose a framework that combines the advantages of the traditional DMP-based method and conditional variational auto-encoder (cVAE). The encoder and decoder comprise a dynamic system and a deep neural network. Instead of generating a trajectory directly, deep neural networks are used to generate torque conditioned on the task parameters. This torque is then used to produce the desired trajectory in the dynamic system, based on the final state. In this way, the generated trajectory can adapt to the new goal position, similar to DMP. We also propose a fine-tuning method to guarantee the via-point constraint. Our model is trained and tested on the handwritten digit number dataset and robotic manipulation tasks, such as pushing, reaching, and grasping. Finally, the proposed model is also validated in a real robotic environment with a UR10 manipulator. Compared to traditional data-demanding deep learning-based methods, it is remarkable that our proposed method can achieve a 100% success rate in the reaching task and a 93.33% success rate in pushing and grasping tasks, with only one demonstration provided for each task.

摘要

基于动态运动基元（DMP）的方法在从示范中学习方面很有效。然而，当前大多数基于DMP的方法都专注于用一个模块学习一项任务。虽然，一些基于深度学习的框架可以同时学习多任务。然而，这些方法需要大量的训练数据，并且对未训练状态下学习到的行为的泛化能力有限。在本文中，我们提出了一个结合传统基于DMP的方法和条件变分自编码器（cVAE）优点的框架。编码器和解码器由一个动态系统和一个深度神经网络组成。深度神经网络不是直接生成轨迹，而是用于根据任务参数生成扭矩。然后，基于最终状态，该扭矩用于在动态系统中产生所需的轨迹。通过这种方式，生成的轨迹可以像DMP一样适应新的目标位置。我们还提出了一种微调方法来保证过点约束。我们的模型在手写数字数据集和机器人操作任务（如推、伸手和抓握）上进行了训练和测试。最后，所提出的模型也在配备UR10机械手的真实机器人环境中得到了验证。与传统的基于深度学习且需要大量数据的方法相比，值得注意的是，我们提出的方法在伸手任务中可以达到100%的成功率，在推和抓握任务中可以达到93.33%的成功率，每个任务仅提供一次示范。