Cuan Catie, Okamura Allison, Khansari Mohi
IEEE Trans Haptics. 2024 Oct-Dec;17(4):984-991. doi: 10.1109/TOH.2024.3384482. Epub 2024 Dec 19.
Learning from demonstration is a proven technique to teach robots new skills. Data quality and quantity play a critical role in the performance of models trained using data collected from human demonstrations. In this paper we enhance an existing teleoperation data collection system with real-time haptic feedback to the human demonstrators; we observe improvements in the collected data throughput and in the performance of autonomous policies using models trained with the data. Our experimental testbed was a mobile manipulator robot that opened doors with latch handles. Evaluation of teleoperated data collection on eight real conference room doors found that adding haptic feedback improved data throughput by 6%. We additionally used the collected data to train six image-based deep imitation learning models, three with haptic feedback and three without it. These models were used to implement autonomous door-opening with the same type of robot used during data collection. A policy from a imitation learning model trained with data collected while the human demonstrators received haptic feedback performed on average 11% better than its counterpart trained with data collected without haptic feedback, indicating that haptic feedback provided during data collection resulted in improved autonomous policies.
从示范中学习是一种经证实的向机器人传授新技能的技术。数据质量和数量在使用从人类示范中收集的数据训练的模型性能中起着关键作用。在本文中,我们增强了现有的遥操作数据收集系统,为人类示范者提供实时触觉反馈;我们观察到收集到的数据吞吐量以及使用这些数据训练的自主策略的性能都有所提高。我们的实验测试平台是一个带有闩锁把手的移动操纵机器人。在八个真实会议室门上进行的遥操作数据收集评估发现,添加触觉反馈使数据吞吐量提高了6%。我们还使用收集到的数据训练了六个基于图像的深度模仿学习模型,三个带有触觉反馈,三个没有。这些模型用于使用数据收集期间使用的同类型机器人实现自主开门。与没有触觉反馈时收集的数据训练的模型相比,使用人类示范者收到触觉反馈时收集的数据训练的模仿学习模型的策略平均表现要好11%,这表明数据收集期间提供的触觉反馈带来了更好的自主策略。