机器人辅助运动训练：在强化学习过程中，辅助会减少探索行为。

Robot-assisted motor training: assistance decreases exploration during reinforcement learning.

作者信息

Sans-Muntadas Albert, Duarte Jaime E, Reinkensmeyer David J

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:3516-20. doi: 10.1109/EMBC.2014.6944381.

DOI:10.1109/EMBC.2014.6944381

Abstract

Reinforcement learning (RL) is a form of motor learning that robotic therapy devices could potentially manipulate to promote neurorehabilitation. We developed a system that requires trainees to use RL to learn a predefined target movement. The system provides higher rewards for movements that are more similar to the target movement. We also developed a novel algorithm that rewards trainees of different abilities with comparable reward sizes. This algorithm measures a trainee's performance relative to their best performance, rather than relative to an absolute target performance, to determine reward. We hypothesized this algorithm would permit subjects who cannot normally achieve high reward levels to do so while still learning. In an experiment with 21 unimpaired human subjects, we found that all subjects quickly learned to make a first target movement with and without the reward equalization. However, artificially increasing reward decreased the subjects' tendency to engage in exploration and therefore slowed learning, particularly when we changed the target movement. An anti-slacking watchdog algorithm further slowed learning. These results suggest that robotic algorithms that assist trainees in achieving rewards or in preventing slacking might, over time, discourage the exploration needed for reinforcement learning.

摘要

强化学习（RL）是一种运动学习形式，机器人治疗设备有可能通过操纵它来促进神经康复。我们开发了一个系统，要求受训者使用强化学习来学习预定义的目标动作。该系统对于与目标动作更相似的动作给予更高的奖励。我们还开发了一种新颖的算法，以相当的奖励大小对不同能力的受训者进行奖励。该算法衡量受训者相对于其最佳表现的绩效，而非相对于绝对目标绩效来确定奖励。我们假设该算法将使通常无法获得高奖励水平的受试者能够做到这一点，同时仍能进行学习。在一项针对21名未受损人类受试者的实验中，我们发现，无论有无奖励均衡，所有受试者都能快速学会做出第一个目标动作。然而，人为增加奖励会降低受试者进行探索的倾向，从而减缓学习，尤其是当我们改变目标动作时。一种防懈怠监督算法进一步减缓了学习。这些结果表明，随着时间的推移，帮助受训者获得奖励或防止懈怠的机器人算法可能会抑制强化学习所需的探索。

相似文献

Robot-assisted motor training: assistance decreases exploration during reinforcement learning.机器人辅助运动训练：在强化学习过程中，辅助会减少探索行为。

Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:3516-20. doi: 10.1109/EMBC.2014.6944381.

Human-robot cooperative movement training: learning a novel sensory motor transformation during walking with robotic assistance-as-needed.人机协作运动训练：在按需机器人辅助行走过程中学习一种新的感觉运动转换。

J Neuroeng Rehabil. 2007 Mar 28;4:8. doi: 10.1186/1743-0003-4-8.

Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.使用人类生成的奖励训练用于手臂运动的 Actor-Critic 强化学习控制器。

IEEE Trans Neural Syst Rehabil Eng. 2017 Oct;25(10):1892-1905. doi: 10.1109/TNSRE.2017.2700395. Epub 2017 May 2.

Clustering analysis of movement kinematics in reinforcement learning.强化学习中运动运动学的聚类分析。

J Neurophysiol. 2022 Feb 1;127(2):341-353. doi: 10.1152/jn.00229.2021. Epub 2021 Dec 22.

MOSAIC for multiple-reward environments.多奖励环境下的 MOSAIC 算法。

Neural Comput. 2012 Mar;24(3):577-606. doi: 10.1162/NECO_a_00246. Epub 2011 Dec 14.

Efficient exploration through active learning for value function approximation in reinforcement learning.强化学习中基于主动学习的价值函数逼近的有效探索。

Neural Netw. 2010 Jun;23(5):639-48. doi: 10.1016/j.neunet.2009.12.010. Epub 2010 Jan 11.

A reinforcement learning algorithm acquires demonstration from the training agent by dividing the task space.强化学习算法通过划分任务空间从训练代理那里获取演示。

Neural Netw. 2023 Jul;164:419-427. doi: 10.1016/j.neunet.2023.04.042. Epub 2023 May 5.

LJIR: Learning Joint-Action Intrinsic Reward in cooperative multi-agent reinforcement learning.LJIR：在合作多智能体强化学习中学习联合行动内在奖励

Neural Netw. 2023 Oct;167:450-459. doi: 10.1016/j.neunet.2023.08.016. Epub 2023 Aug 22.

Reinforcement Learning based Decoding Using Internal Reward for Time Delayed Task in Brain Machine Interfaces.基于强化学习的解码：利用内部奖励实现脑机接口中的时延任务

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:3351-3354. doi: 10.1109/EMBC44109.2020.9175964.

Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning.基于强化学习和逆强化学习的蛇形机器人节能与损伤恢复蠕动步态设计。

Neural Netw. 2020 Sep;129:323-333. doi: 10.1016/j.neunet.2020.05.029. Epub 2020 Jun 16.

引用本文的文献

Self-powered robots to reduce motor slacking during upper-extremity rehabilitation: a proof of concept study.用于减少上肢康复期间肌肉松弛的自供电机器人：一项概念验证研究。

Restor Neurol Neurosci. 2018;36(6):693-708. doi: 10.3233/RNN-180830.

Spatial diversity of spontaneous activity in the cortex.皮质中自发活动的空间多样性。

Front Neural Circuits. 2015 Sep 24;9:48. doi: 10.3389/fncir.2015.00048. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

机器人辅助运动训练：在强化学习过程中，辅助会减少探索行为。

Robot-assisted motor training: assistance decreases exploration during reinforcement learning.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献