Graduate School of Comprehensive Human Sciences, University of Tsukuba, Tsukuba, Japan.
Division of Biomedical Science, Faculty of Medicine, University of Tsukuba, Tsukuba, Japan.
Elife. 2021 Mar 10;10:e63166. doi: 10.7554/eLife.63166.
Animal behavior is regulated based on the values of future rewards. The phasic activity of midbrain dopamine neurons signals these values. Because reward values often change over time, even on a subsecond-by-subsecond basis, appropriate behavioral regulation requires continuous value monitoring. However, the phasic dopamine activity, which is sporadic and has a short duration, likely fails continuous monitoring. Here, we demonstrate a tonic firing mode of dopamine neurons that effectively tracks changing reward values. We recorded dopamine neuron activity in monkeys during a Pavlovian procedure in which the value of a cued reward gradually increased or decreased. Dopamine neurons tonically increased and decreased their activity as the reward value changed. This tonic activity was evoked more strongly by non-burst spikes than burst spikes producing a conventional phasic activity. Our findings suggest that dopamine neurons change their firing mode to effectively signal reward values in a given situation.
动物行为是根据未来奖励的价值来调节的。中脑多巴胺神经元的相位活动信号这些价值。由于奖励价值通常随时间变化,即使在亚秒级的基础上,适当的行为调节也需要持续的价值监测。然而,相位多巴胺活动是零星的,持续时间短,可能无法进行连续监测。在这里,我们展示了一种多巴胺神经元的紧张性放电模式,它可以有效地跟踪不断变化的奖励价值。我们在猴子身上记录了多巴胺神经元在条件反射程序中的活动,在这个程序中,提示奖励的价值逐渐增加或减少。当奖励价值发生变化时,多巴胺神经元的活动会持续增加和减少。与产生传统相位活动的爆发性尖峰相比,这种紧张性活动被非爆发性尖峰更强烈地激发。我们的发现表明,多巴胺神经元改变其放电模式,以有效地在给定情况下信号奖励价值。