Huang Vincent S, Shadmehr Reza, Diedrichsen Jörn
Laboratory for Computational Motor Control, Department of Biomedical Engineering, John Hopkins School of Medicine, Baltimore, Maryland, USA.
J Neurophysiol. 2008 Aug;100(2):879-87. doi: 10.1152/jn.01095.2007. Epub 2008 May 28.
When we learn a new skill (e.g., golf) without a coach, we are "active learners": we have to choose the specific components of the task on which to train (e.g., iron, driver, putter, etc.). What guides our selection of the training sequence? How do choices that people make compare with choices made by machine learning algorithms that attempt to optimize performance? We asked subjects to learn the novel dynamics of a robotic tool while moving it in four directions. They were instructed to choose their practice directions to maximize their performance in subsequent tests. We found that their choices were strongly influenced by motor errors: subjects tended to immediately repeat an action if that action had produced a large error. This strategy was correlated with better performance on test trials. However, even when participants performed perfectly on a movement, they did not avoid repeating that movement. The probability of repeating an action did not drop below chance even when no errors were observed. This behavior led to suboptimal performance. It also violated a strong prediction of current machine learning algorithms, which solve the active learning problem by choosing a training sequence that will maximally reduce the learner's uncertainty about the task. While we show that these algorithms do not provide an adequate description of human behavior, our results suggest ways to improve human motor learning by helping people choose an optimal training sequence.
当我们在没有教练的情况下学习一项新技能(例如高尔夫)时,我们就是“主动学习者”:我们必须选择要训练的任务的具体组成部分(例如铁杆、发球杆、推杆等)。是什么指导我们选择训练顺序?人们做出的选择与试图优化性能的机器学习算法所做的选择相比如何?我们要求受试者在向四个方向移动机器人工具时学习其新的动力学。他们被指示选择练习方向以在后续测试中最大化他们的表现。我们发现他们的选择受到运动误差的强烈影响:如果某个动作产生了较大误差,受试者倾向于立即重复该动作。这种策略与测试试验中的更好表现相关。然而,即使参与者在一次动作中表现完美,他们也不会避免重复该动作。即使没有观察到错误,重复一个动作的概率也不会低于随机水平。这种行为导致了次优表现。它还违背了当前机器学习算法的一个有力预测,即通过选择一个将最大程度降低学习者对任务不确定性的训练序列来解决主动学习问题。虽然我们表明这些算法不能充分描述人类行为,但我们的结果提出了通过帮助人们选择最佳训练序列来改善人类运动学习的方法。