Suppr超能文献

机器人辅助运动训练:在强化学习过程中,辅助会减少探索行为。

Robot-assisted motor training: assistance decreases exploration during reinforcement learning.

作者信息

Sans-Muntadas Albert, Duarte Jaime E, Reinkensmeyer David J

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:3516-20. doi: 10.1109/EMBC.2014.6944381.

Abstract

Reinforcement learning (RL) is a form of motor learning that robotic therapy devices could potentially manipulate to promote neurorehabilitation. We developed a system that requires trainees to use RL to learn a predefined target movement. The system provides higher rewards for movements that are more similar to the target movement. We also developed a novel algorithm that rewards trainees of different abilities with comparable reward sizes. This algorithm measures a trainee's performance relative to their best performance, rather than relative to an absolute target performance, to determine reward. We hypothesized this algorithm would permit subjects who cannot normally achieve high reward levels to do so while still learning. In an experiment with 21 unimpaired human subjects, we found that all subjects quickly learned to make a first target movement with and without the reward equalization. However, artificially increasing reward decreased the subjects' tendency to engage in exploration and therefore slowed learning, particularly when we changed the target movement. An anti-slacking watchdog algorithm further slowed learning. These results suggest that robotic algorithms that assist trainees in achieving rewards or in preventing slacking might, over time, discourage the exploration needed for reinforcement learning.

摘要

强化学习(RL)是一种运动学习形式,机器人治疗设备有可能通过操纵它来促进神经康复。我们开发了一个系统,要求受训者使用强化学习来学习预定义的目标动作。该系统对于与目标动作更相似的动作给予更高的奖励。我们还开发了一种新颖的算法,以相当的奖励大小对不同能力的受训者进行奖励。该算法衡量受训者相对于其最佳表现的绩效,而非相对于绝对目标绩效来确定奖励。我们假设该算法将使通常无法获得高奖励水平的受试者能够做到这一点,同时仍能进行学习。在一项针对21名未受损人类受试者的实验中,我们发现,无论有无奖励均衡,所有受试者都能快速学会做出第一个目标动作。然而,人为增加奖励会降低受试者进行探索的倾向,从而减缓学习,尤其是当我们改变目标动作时。一种防懈怠监督算法进一步减缓了学习。这些结果表明,随着时间的推移,帮助受训者获得奖励或防止懈怠的机器人算法可能会抑制强化学习所需的探索。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验