基于质量-多样性的强化学习半自动遥操作。

Quality-diversity based semi-autonomous teleoperation using reinforcement learning.

机构信息

Department of Artificial Intelligence, Korea University, Seoul, 02841, South Korea.

出版信息

Neural Netw. 2024 Nov;179:106543. doi: 10.1016/j.neunet.2024.106543. Epub 2024 Jul 22.

DOI:10.1016/j.neunet.2024.106543

Abstract

Recent successes in robot learning have significantly enhanced autonomous systems across a wide range of tasks. However, they are prone to generate similar or the same solutions, limiting the controllability of the robot to behave according to user intentions. These limited robot behaviors may lead to collisions and potential harm to humans. To resolve these limitations, we introduce a semi-autonomous teleoperation framework that enables users to operate a robot by selecting a high-level command, referred to as option. Our approach aims to provide effective and diverse options by a learned policy, thereby enhancing the efficiency of the proposed framework. In this work, we propose a quality-diversity (QD) based sampling method that simultaneously optimizes both the quality and diversity of options using reinforcement learning (RL). Additionally, we present a mixture of latent variable models to learn multiple policy distributions defined as options. In experiments, we show that the proposed method achieves superior performance in terms of the success rate and diversity of the options in simulation environments. We further demonstrate that our method outperforms manual keyboard control for time duration over cluttered real-world environments.

摘要

最近机器人学习方面的成功显著增强了自主系统在广泛任务中的能力。然而，它们容易生成相似或相同的解决方案，限制了机器人根据用户意图进行可控行为的能力。这些有限的机器人行为可能导致碰撞和对人类的潜在伤害。为了解决这些限制，我们引入了一种半自主遥操作框架，使用户能够通过选择高级命令（称为选项）来操作机器人。我们的方法旨在通过学习策略提供有效且多样化的选项，从而提高所提出框架的效率。在这项工作中，我们提出了一种基于质量多样性 (QD) 的采样方法，该方法使用强化学习 (RL) 同时优化选项的质量和多样性。此外，我们提出了一种混合潜在变量模型来学习多个策略分布，这些策略分布被定义为选项。在实验中，我们表明，所提出的方法在模拟环境中的选项成功率和多样性方面表现出优越的性能。我们进一步证明，我们的方法在杂乱的现实环境中，在耗时方面优于手动键盘控制。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于质量-多样性的强化学习半自动遥操作。

Quality-diversity based semi-autonomous teleoperation using reinforcement learning.

机构信息

出版信息

相似文献

引用本文的文献

基于质量-多样性的强化学习半自动遥操作。

Quality-diversity based semi-autonomous teleoperation using reinforcement learning.

机构信息

出版信息

相似文献

引用本文的文献