School of Kinesiology and Health, Capital University of Physical Education and Sports, Beijing, People's Republic of China.
J Neurophysiol. 2023 Nov 1;130(5):1150-1161. doi: 10.1152/jn.00242.2023. Epub 2023 Oct 4.
Reward and punishment have long been recognized as potent modulators of human behavior. Although reinforcement learning is a significant motor learning process, the exact mechanisms underlying how the brain learns movements through reward and punishment are not yet fully understood. Beyond the memory of specific examples, investigating the ability to generalize to new situations offers a better understanding of motor learning. This study hypothesizes that reward and punishment engage qualitatively different motivational systems with different neurochemical and neuroanatomical substrates, which would have differential effects on reinforcement-based motor learning and generalization. To test this hypothesis, two groups of participants learn a motor task in one direction and then relearn the same task in a new direction, receiving only performance-based reward or punishment score feedback. Our findings support our hypothesis, showing that reward led to slower learning but promoted generalization. On the other hand, punishment led to faster learning but impaired generalization. These behavioral differences may be due to different tendencies of movement variability in each group. The punishment group tended to explore more actively than the reward group during the initial learning phase, possibly due to loss aversion. In contrast, the reward group tended to explore more actively than the initial learning phase during the generalization test phase, seemingly recalling the strategy that led to the reward. These results suggest that reward and punishment may engage different neural mechanisms during reinforcement-based motor learning and generalization, with important implications for practical applications such as sports training and motor rehabilitation. Although reinforcement learning is a significant motor learning process, the mechanisms underlying how the brain learns movements through reward and punishment are not fully understood. We modified a well-established motor adaptation task and used savings (faster relearning) to measure generalization. We found reward led to slower learning but promoted generalization, whereas punishment led to faster learning but impaired generalization, suggesting that reward and punishment may engage different neural mechanisms during reinforcement-based motor learning and generalization.
奖惩长期以来一直被认为是人类行为的有力调节因素。虽然强化学习是一个重要的运动学习过程,但大脑通过奖惩学习运动的具体机制仍不完全清楚。除了对特定示例的记忆之外,研究对新情况进行概括的能力可以更好地理解运动学习。本研究假设奖励和惩罚涉及具有不同神经化学和神经解剖学基础的不同动机系统,这将对基于强化的运动学习和概括产生不同的影响。为了检验这一假设,两组参与者在一个方向上学习一个运动任务,然后在一个新的方向上重新学习相同的任务,只收到基于绩效的奖励或惩罚分数反馈。我们的研究结果支持了我们的假设,表明奖励导致学习速度较慢,但促进了概括。另一方面,惩罚导致学习速度更快,但损害了概括。这些行为差异可能归因于每组运动变异性的不同趋势。惩罚组在初始学习阶段比奖励组更积极地探索,这可能是由于损失厌恶。相比之下,奖励组在概括测试阶段比初始学习阶段更积极地探索,似乎回忆起导致奖励的策略。这些结果表明,奖励和惩罚可能在基于强化的运动学习和概括过程中涉及不同的神经机制,这对运动训练和运动康复等实际应用具有重要意义。虽然强化学习是一个重要的运动学习过程,但大脑通过奖励和惩罚学习运动的机制仍不完全清楚。我们修改了一个成熟的运动适应任务,并使用节省(更快的重新学习)来衡量概括。我们发现奖励导致学习速度较慢但促进概括,而惩罚导致学习速度更快但损害概括,这表明奖励和惩罚可能在基于强化的运动学习和概括过程中涉及不同的神经机制。