Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, Maryland, United States of America.
PLoS Comput Biol. 2011 Mar;7(3):e1002012. doi: 10.1371/journal.pcbi.1002012. Epub 2011 Mar 10.
Voluntary motor commands produce two kinds of consequences. Initially, a sensory consequence is observed in terms of activity in our primary sensory organs (e.g., vision, proprioception). Subsequently, the brain evaluates the sensory feedback and produces a subjective measure of utility or usefulness of the motor commands (e.g., reward). As a result, comparisons between predicted and observed consequences of motor commands produce two forms of prediction error. How do these errors contribute to changes in motor commands? Here, we considered a reach adaptation protocol and found that when high quality sensory feedback was available, adaptation of motor commands was driven almost exclusively by sensory prediction errors. This form of learning had a distinct signature: as motor commands adapted, the subjects altered their predictions regarding sensory consequences of motor commands, and generalized this learning broadly to neighboring motor commands. In contrast, as the quality of the sensory feedback degraded, adaptation of motor commands became more dependent on reward prediction errors. Reward prediction errors produced comparable changes in the motor commands, but produced no change in the predicted sensory consequences of motor commands, and generalized only locally. Because we found that there was a within subject correlation between generalization patterns and sensory remapping, it is plausible that during adaptation an individual's relative reliance on sensory vs. reward prediction errors could be inferred. We suggest that while motor commands change because of sensory and reward prediction errors, only sensory prediction errors produce a change in the neural system that predicts sensory consequences of motor commands.
自愿运动指令会产生两种后果。最初,我们主要的感觉器官(如视觉、本体感觉)会观察到运动的感觉后果。随后,大脑会评估感觉反馈,并对运动指令的效用或有用性产生主观衡量(例如奖励)。因此,运动指令的预测后果和观察后果之间的比较会产生两种形式的预测误差。这些错误如何导致运动指令的变化?在这里,我们考虑了一个适应协议,并发现当高质量的感觉反馈可用时,运动指令的适应几乎完全是由感觉预测错误驱动的。这种形式的学习有一个明显的特征:随着运动指令的适应,受试者改变了他们对运动指令感觉后果的预测,并将这种学习广泛推广到邻近的运动指令。相比之下,随着感觉反馈质量的下降,运动指令的适应越来越依赖于奖励预测错误。奖励预测错误会对运动指令产生类似的变化,但不会对运动指令的感觉后果产生预测变化,而且只能在局部推广。因为我们发现,在主体间的推广模式和感觉重新映射之间存在相关性,所以在适应过程中,个体对感觉和奖励预测错误的相对依赖程度可能会被推断出来。我们认为,虽然运动指令会因为感觉和奖励预测错误而发生变化,但只有感觉预测错误会改变预测运动指令感觉后果的神经系统。