Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA; Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, USA.
Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA; Department of Neurology, Washington University School of Medicine, St. Louis, MO, USA.
Neuroimage. 2023 Feb 15;267:119831. doi: 10.1016/j.neuroimage.2022.119831. Epub 2022 Dec 28.
Converging evidence from both human neuroimaging and animal studies has supported a model of mesolimbic processing underlying reward learning behaviors, based on the computation of reward prediction errors. However, competing evidence supports human dopamine signaling in the basal ganglia as also contributing to the generation of higher order learning heuristics. Here, we present data from a large (N = 81, 18-30yo), multi-modal neuroimaging study using simultaneously acquired task fMRI, affording temporal resolution of reward system function, and PET imaging with [C]Raclopride (RAC), assessing striatal dopamine (DA) D2/3 receptor binding, during performance of a probabilistic reward learning task. Both fMRI activation and PET DA measures showed ventral striatum involvement for signaling rewards. However, greater DA release was uniquely associated with learning strategies (i.e., learning rates) that were more task-optimal within the best fitting reinforcement learning model. This DA response was associated with BOLD activation of a network of regions including anterior cingulate cortex, medial prefrontal cortex, thalamus and posterior parietal cortex, primarily during expectation, rather than prediction error, task epochs. Together, these data provide novel, human in vivo evidence that striatal dopaminergic signaling interacts with a network of cortical regions to generate task-optimal learning strategies, rather than representing reward outcomes in isolation.
越来越多的人类神经影像学和动物研究证据支持了一种中脑边缘处理模型,该模型是基于奖励预测误差的计算,用于解释奖励学习行为。然而,也有竞争的证据表明,人类基底神经节的多巴胺信号也有助于产生更高阶的学习启发式策略。在这里,我们展示了一项大型(N=81,18-30 岁)多模态神经影像学研究的数据,该研究使用同时采集的任务 fMRI,提供了奖励系统功能的时间分辨率,以及使用 [C]Raclopride(RAC)进行的正电子发射断层扫描(PET),评估纹状体多巴胺(DA)D2/3 受体结合,在进行概率奖励学习任务时。fMRI 激活和 PET DA 测量都显示腹侧纹状体参与了奖励信号的传递。然而,更大的 DA 释放与学习策略(即学习率)独特相关,这些策略在最佳拟合强化学习模型内更符合任务要求。这种 DA 反应与包括前扣带皮层、内侧前额叶皮层、丘脑和顶叶后皮质在内的一系列区域的 BOLD 激活有关,主要发生在期望而不是预测误差任务期间。总之,这些数据提供了新的、人类体内的证据,表明纹状体多巴胺能信号与皮质区域网络相互作用,产生符合任务要求的学习策略,而不是孤立地代表奖励结果。