NIDA-Intramural Research Program, Baltimore, Maryland 21224, USA.
J Neurosci. 2012 Jul 25;32(30):10296-305. doi: 10.1523/JNEUROSCI.0832-12.2012.
Neural correlates of reward prediction errors (RPEs) have been found in dorsal striatum. Such signals may be important for updating associative action representations within striatum. In order that the appropriate representations can be updated, it might be important for the RPE signal to be specific for the action that led to that error. However, RPEs signaled by midbrain dopamine neurons, which project heavily to striatum, are not action-specific. Here we tested whether RPE-like activity in dorsal striatum is action-specific; we recorded single-unit activity in posterior dorsomedial and dorsolateral striatum as rats performed a task in which the reward predictions associated with two different actions were repeatedly violated, thereby eliciting RPEs. We separately analyzed fast firing neurons (FFNs) and phasically firing neurons (total n = 1076). Only among FFNs recorded in posterior dorsomedial striatum did we find a population with RPE-like characteristics (19 of all 196 FFNs, 10%). This population showed a phasic increase in activity during unexpected rewards, a phasic decrease in activity during unexpected omission of rewards, and a phasic increase in activity during cues when they predicted high-value reward. However, unlike a classical RPE signal, this signal was linked to the action that elicited the prediction error, in that neurons tended to signal RPEs only after their anti-preferred action. This action-specific RPE-like signal could provide a mechanism for updating specific associative action representations in posterior dorsomedial striatum.
背侧纹状体中发现了与奖励预测误差(RPE)相关的神经相关物。这种信号可能对更新纹状体内的关联动作表示很重要。为了能够更新适当的表示,与导致该错误的动作相关的 RPE 信号可能是特定的,这可能很重要。然而,从中脑多巴胺神经元发出的 RPE 信号,这些神经元大量投射到纹状体,不是特定于动作的。在这里,我们测试了背侧纹状体中的 RPE 样活动是否是特定于动作的;当大鼠执行一项任务时,我们记录了后背侧腹侧和背外侧纹状体中的单个单元活动,在该任务中,与两个不同动作相关的奖励预测反复被违反,从而引起 RPE。我们分别分析了快速放电神经元(FFN)和相位放电神经元(总共 n = 1076)。只有在记录于后背侧腹侧纹状体中的 FFN 中,我们才发现具有 RPE 样特征的群体(196 个 FFN 中的 19 个,10%)。该群体在意外奖励期间表现出活动的相位增加,在意外奖励缺失期间表现出活动的相位减少,在预测高价值奖励时表现出活动的相位增加。然而,与经典的 RPE 信号不同,该信号与引起预测误差的动作有关,即神经元仅在其反优先动作之后才倾向于发出 RPE 信号。这种特定于动作的 RPE 样信号可以为更新后背侧腹侧纹状体中特定的关联动作表示提供一种机制。