Akkaladevi Sharath Chandra, Plasch Matthias, Maddukuri Sriniwas, Eitzinger Christian, Pichler Andreas, Rinner Bernhard
Profactor GmbH, Steyr-Gleink, Steyr, Austria.
Institute of Networked and Embedded Systems, Alpen-Adria-Universität Klagenfurt, Klagenfurt, Austria.
Front Robot AI. 2018 Nov 22;5:126. doi: 10.3389/frobt.2018.00126. eCollection 2018.
As manufacturing demographics change from mass production to mass customization, advances in human-robot interaction in industries have taken many forms. However, the topic of reducing the programming effort required by an expert using natural modes of communication is still open. To answer this challenge, we propose an approach based on Interactive Reinforcement Learning that learns a complete collaborative assembly process. The learning approach is done in two steps. First step consists of modeling simple tasks that compose the assembly process, using task based formalism. The robotic system then uses these modeled simple tasks and proposes to the user a set of possible actions at each step of the assembly process via a GUI. The user then "interacts" with the robotic system by selecting an option from the given choice. The robot records the action chosen and performs it, progressing the assembly process. Thereby, the user teaches the system which task to perform when. In order to reduce the number of actions proposed, the system considers additional information such as user and robot capabilities and object affordances. These set of action proposals are further reduced by modeling the proposed actions into a goal based hierarchy and by including action prerequisites. The learning framework highlights its ability to learn a complicated human robot collaborative assembly process in a user intuitive fashion. The framework also allows different users to teach different assembly processes to the robot.
随着制造业人口结构从大规模生产向大规模定制转变,工业中人机交互的进步呈现出多种形式。然而,减少专家使用自然通信模式所需编程工作量这一话题仍未解决。为应对这一挑战,我们提出一种基于交互式强化学习的方法,该方法可学习完整的协作装配过程。学习方法分两步进行。第一步包括使用基于任务的形式化方法对构成装配过程的简单任务进行建模。然后,机器人系统使用这些建模的简单任务,并通过图形用户界面(GUI)在装配过程的每个步骤向用户提出一组可能的操作。用户随后通过从给定选项中选择来与机器人系统“交互”。机器人记录所选操作并执行它,从而推进装配过程。由此,用户教导系统何时执行哪个任务。为了减少提出的操作数量,系统会考虑诸如用户和机器人能力以及物体可供性等附加信息。通过将提议的操作建模为基于目标的层次结构并纳入操作先决条件,这组操作提议进一步减少。该学习框架突出了其以用户直观的方式学习复杂人机协作装配过程的能力。该框架还允许不同用户向机器人教导不同的装配过程。