基于观察的机器人操纵器恢复动作生成对抗模仿学习

Restored Action Generative Adversarial Imitation Learning from observation for robot manipulator.

作者信息

Park Jongcheon, Han Seungyong, Lee S M

机构信息

Cyber Physical Systems & Control Laboratory, School of Electronic and Electrical Engineering, Kyungpook National University, Daehak-ro 80, Republic of Korea.

出版信息

ISA Trans. 2022 Oct;129(Pt B):684-690. doi: 10.1016/j.isatra.2022.02.041. Epub 2022 Mar 7.

DOI:10.1016/j.isatra.2022.02.041

PMID:35292172

Abstract

In this paper, a new imitation learning algorithm is proposed based on the Restored Action Generative Adversarial Imitation Learning (RAGAIL) from observation. An action policy is trained to move a robot manipulator similar to a demonstrator's behavior by using the restored action from state-only demonstration. To imitate the demonstrator, the trajectory is generated by Recurrent Generative Adversarial Networks (RGAN), and the action is restored from the output of the tracking controller constructed by the state and the generated target trajectory. The proposed imitation learning algorithm is not required to access the demonstrator's action (internal control signal such as force/torque command) and provides better learning performances. The effectiveness of the proposed method is validated through the experimental results of the robot manipulator.

摘要

本文基于从观测中恢复的动作生成对抗模仿学习（RAGAIL）提出了一种新的模仿学习算法。通过使用仅基于状态的演示中恢复的动作，训练一个动作策略，使机器人操纵器的动作类似于演示者的行为。为了模仿演示者，由循环生成对抗网络（RGAN）生成轨迹，并根据状态和生成的目标轨迹构建的跟踪控制器的输出恢复动作。所提出的模仿学习算法不需要访问演示者的动作（如力/扭矩命令等内部控制信号），并具有更好的学习性能。通过机器人操纵器的实验结果验证了所提方法的有效性。

相似文献

Restored Action Generative Adversarial Imitation Learning from observation for robot manipulator.

ISA Trans. 2022 Oct;129(Pt B):684-690. doi: 10.1016/j.isatra.2022.02.041. Epub 2022 Mar 7.

Distributional generative adversarial imitation learning with reproducing kernel generalization.

Neural Netw. 2023 Aug;165:43-59. doi: 10.1016/j.neunet.2023.05.027. Epub 2023 May 25.

Addressing implicit bias in adversarial imitation learning with mutual information.

Neural Netw. 2023 Oct;167:847-864. doi: 10.1016/j.neunet.2023.08.058. Epub 2023 Sep 4.

Domain Adaptation for Imitation Learning Using Generative Adversarial Network.

Sensors (Basel). 2021 Jul 9;21(14):4718. doi: 10.3390/s21144718.

Error Bounds of Imitating Policies and Environments for Reinforcement Learning.

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6968-6980. doi: 10.1109/TPAMI.2021.3096966. Epub 2022 Sep 14.

Lipschitzness is all you need to tame off-policy generative adversarial imitation learning.

Mach Learn. 2022;111(4):1431-1521. doi: 10.1007/s10994-022-06144-5. Epub 2022 Apr 4.

The actions of others act as a pseudo-reward to drive imitation in the context of social reinforcement learning.

PLoS Biol. 2020 Dec 8;18(12):e3001028. doi: 10.1371/journal.pbio.3001028. eCollection 2020 Dec.

Trajectory Planning of Robot Manipulator Based on RBF Neural Network.

Entropy (Basel). 2021 Sep 13;23(9):1207. doi: 10.3390/e23091207.

Human-robot skills transfer interfaces for a flexible surgical robot.

Comput Methods Programs Biomed. 2014 Sep;116(2):81-96. doi: 10.1016/j.cmpb.2013.12.015. Epub 2014 Jan 8.

Reactive Self-Collision Avoidance for a Differentially Driven Mobile Manipulator.

Sensors (Basel). 2021 Jan 28;21(3):890. doi: 10.3390/s21030890.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于观察的机器人操纵器恢复动作生成对抗模仿学习

Restored Action Generative Adversarial Imitation Learning from observation for robot manipulator.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献