使用强化学习优化人机辅助交互。

Optimized Assistive Human-Robot Interaction Using Reinforcement Learning.

作者信息

Modares Hamidreza, Ranatunga Isura, Lewis Frank L, Popa Dan O

出版信息

IEEE Trans Cybern. 2016 Mar;46(3):655-67. doi: 10.1109/TCYB.2015.2412554. Epub 2015 Mar 24.

DOI:10.1109/TCYB.2015.2412554

Abstract

An intelligent human-robot interaction (HRI) system with adjustable robot behavior is presented. The proposed HRI system assists the human operator to perform a given task with minimum workload demands and optimizes the overall human-robot system performance. Motivated by human factor studies, the presented control structure consists of two control loops. First, a robot-specific neuro-adaptive controller is designed in the inner loop to make the unknown nonlinear robot behave like a prescribed robot impedance model as perceived by a human operator. In contrast to existing neural network and adaptive impedance-based control methods, no information of the task performance or the prescribed robot impedance model parameters is required in the inner loop. Then, a task-specific outer-loop controller is designed to find the optimal parameters of the prescribed robot impedance model to adjust the robot's dynamics to the operator skills and minimize the tracking error. The outer loop includes the human operator, the robot, and the task performance details. The problem of finding the optimal parameters of the prescribed robot impedance model is transformed into a linear quadratic regulator (LQR) problem which minimizes the human effort and optimizes the closed-loop behavior of the HRI system for a given task. To obviate the requirement of the knowledge of the human model, integral reinforcement learning is used to solve the given LQR problem. Simulation results on an x - y table and a robot arm, and experimental implementation results on a PR2 robot confirm the suitability of the proposed method.

摘要

提出了一种具有可调节机器人行为的智能人机交互（HRI）系统。所提出的HRI系统帮助人类操作员以最小的工作量需求执行给定任务，并优化整体人机系统性能。受人为因素研究的启发，所提出的控制结构由两个控制回路组成。首先，在内环设计了一个特定于机器人的神经自适应控制器，以使未知的非线性机器人表现得像人类操作员所感知的规定机器人阻抗模型。与现有的基于神经网络和自适应阻抗的控制方法相比，内环不需要任务性能或规定机器人阻抗模型参数的信息。然后，设计了一个特定于任务的外环控制器，以找到规定机器人阻抗模型的最优参数，从而将机器人的动力学调整到操作员的技能水平，并最小化跟踪误差。外环包括人类操作员、机器人和任务性能细节。将寻找规定机器人阻抗模型最优参数的问题转化为一个线性二次调节器（LQR）问题，该问题可在给定任务中最小化人力并优化HRI系统的闭环行为。为了避免对人类模型知识的需求，使用积分强化学习来解决给定的LQR问题。在xy工作台和机器人手臂上的仿真结果以及在PR2机器人上的实验实现结果证实了所提方法的适用性。

相似文献

Optimized Assistive Human-Robot Interaction Using Reinforcement Learning.

IEEE Trans Cybern. 2016 Mar;46(3):655-67. doi: 10.1109/TCYB.2015.2412554. Epub 2015 Mar 24.

Codevelopmental learning between human and humanoid robot using a dynamic neural-network model.

IEEE Trans Syst Man Cybern B Cybern. 2008 Feb;38(1):43-59. doi: 10.1109/TSMCB.2007.907738.

An Integrated Framework for Human-Robot Collaborative Manipulation.

IEEE Trans Cybern. 2015 Oct;45(10):2030-41. doi: 10.1109/TCYB.2014.2363664. Epub 2014 Oct 31.

A Human-Robot Co-Manipulation Approach Based on Human Sensorimotor Information.

IEEE Trans Neural Syst Rehabil Eng. 2017 Jul;25(7):811-822. doi: 10.1109/TNSRE.2017.2694553. Epub 2017 Apr 17.

Promoting Interactions Between Humans and Robots Using Robotic Emotional Behavior.

IEEE Trans Cybern. 2016 Dec;46(12):2911-2923. doi: 10.1109/TCYB.2015.2492999. Epub 2015 Nov 2.

On learning, representing, and generalizing a task in a humanoid robot.

IEEE Trans Syst Man Cybern B Cybern. 2007 Apr;37(2):286-98. doi: 10.1109/tsmcb.2006.886952.

Impedance learning for robotic contact tasks using natural actor-critic algorithm.

IEEE Trans Syst Man Cybern B Cybern. 2010 Apr;40(2):433-43. doi: 10.1109/TSMCB.2009.2026289. Epub 2009 Aug 18.

Oscillators and crank turning: exploiting natural dynamics with a humanoid robot arm.

Philos Trans A Math Phys Eng Sci. 2003 Oct 15;361(1811):2207-23. doi: 10.1098/rsta.2003.1272.

Research on Robot Fuzzy Neural Network Motion System Based on Artificial Intelligence.

Comput Intell Neurosci. 2022 Feb 9;2022:4347772. doi: 10.1155/2022/4347772. eCollection 2022.

Adaptive fuzzy neural network control design via a T-S fuzzy model for a robot manipulator including actuator dynamics.

IEEE Trans Syst Man Cybern B Cybern. 2008 Oct;38(5):1326-46. doi: 10.1109/TSMCB.2008.925749.

引用本文的文献

Human-like Dexterous Grasping Through Reinforcement Learning and Multimodal Perception.

Biomimetics (Basel). 2025 Mar 18;10(3):186. doi: 10.3390/biomimetics10030186.

Human-in-the-Loop Modeling and Bilateral Skill Transfer Control of Soft Exoskeleton.

Sensors (Basel). 2024 Dec 8;24(23):7845. doi: 10.3390/s24237845.

A human-centered safe robot reinforcement learning framework with interactive behaviors.

Front Neurorobot. 2023 Nov 9;17:1280341. doi: 10.3389/fnbot.2023.1280341. eCollection 2023.

A Self-Coordinating Controller with Balance-Guiding Ability for Lower-Limb Rehabilitation Exoskeleton Robot.

Sensors (Basel). 2023 Jun 3;23(11):5311. doi: 10.3390/s23115311.

Continuous mode adaptation for cable-driven rehabilitation robot using reinforcement learning.

Front Neurorobot. 2022 Dec 22;16:1068706. doi: 10.3389/fnbot.2022.1068706. eCollection 2022.

Finite-Time Interactive Control of Robots with Multiple Interaction Modes.

Sensors (Basel). 2022 May 11;22(10):3668. doi: 10.3390/s22103668.

Configuration-Dependent Optimal Impedance Control of an Upper Extremity Stroke Rehabilitation Manipulandum.

Front Robot AI. 2018 Nov 1;5:124. doi: 10.3389/frobt.2018.00124. eCollection 2018.

Variable Admittance Control Based on Fuzzy Reinforcement Learning for Minimally Invasive Surgery Manipulator.

Sensors (Basel). 2017 Apr 12;17(4):844. doi: 10.3390/s17040844.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用强化学习优化人机辅助交互。

Optimized Assistive Human-Robot Interaction Using Reinforcement Learning.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献