Jin Shuyang, Hull Court
Department of Neurobiology, Duke University, Durham, NC 27710, USA.
Department of Neurobiology, Duke University, Durham, NC 27710, USA.
Curr Biol. 2025 Aug 9. doi: 10.1016/j.cub.2025.07.064.
The cerebellum plays a key role in motor coordination and learning. In contrast to classical supervised learning models, recent work has revealed that climbing fibers (CFs) can signal reward-predictive information in some behaviors. This raises the question of whether CFs may also operate according to the principles of reinforcement learning. To test how CFs operate during reward-guided behavior and evaluate the role of reward-related CF activity in learning, we measured CF responses in Purkinje cells of the lateral cerebellum during a Pavlovian task using two-photon calcium imaging. Specifically, we have performed multi-stimulus experiments to determine whether CF activity meets the requirements of a reward prediction error (rPE) signal for transfer from an unexpected reward to a reward-predictive cue. We find that once CF activity is transferred to a conditioned stimulus, and there is no longer a response to reward, CFs cannot generate learned responses to a second conditioned stimulus that carries the same reward prediction. In addition, by expressing the inhibitory opsin GtACR2 in neurons of the inferior olive and optically inhibiting these neurons during behavioral training at the time of unexpected reward, we find that the transfer of CF signals to the conditioned stimulus is impaired. Moreover, this optogenetic inhibition also impairs learning, resulting in a deficit in anticipatory lick timing. Together, these results indicate that CF signals can exhibit several characteristics in common with the rPEs that have been observed during reinforcement learning and that the cerebellum can harness these reward-related learning signals to generate accurately timed motor behavior.
小脑在运动协调和学习中起关键作用。与经典的监督学习模型不同,最近的研究表明,攀爬纤维(CFs)在某些行为中可以传递奖励预测信息。这就提出了一个问题,即CFs是否也可能按照强化学习的原则运作。为了测试CFs在奖励引导行为中的运作方式,并评估与奖励相关的CF活动在学习中的作用,我们在一项巴甫洛夫任务中使用双光子钙成像测量了外侧小脑浦肯野细胞中的CF反应。具体来说,我们进行了多刺激实验,以确定CF活动是否满足奖励预测误差(rPE)信号从意外奖励转移到奖励预测线索的要求。我们发现,一旦CF活动转移到条件刺激上,并且对奖励不再有反应,CFs就无法对携带相同奖励预测的第二个条件刺激产生习得反应。此外,通过在下橄榄核神经元中表达抑制性视蛋白GtACR2,并在意外奖励出现时的行为训练过程中对这些神经元进行光学抑制,我们发现CF信号向条件刺激的转移受到损害。此外,这种光遗传学抑制也损害了学习,导致预期舔舐时间出现缺陷。总之,这些结果表明,CF信号可以表现出与强化学习过程中观察到的rPEs的几个共同特征,并且小脑可以利用这些与奖励相关的学习信号来产生精确计时的运动行为。