Kishida Kenneth T, Saez Ignacio, Lohrenz Terry, Witcher Mark R, Laxton Adrian W, Tatter Stephen B, White Jason P, Ellis Thomas L, Phillips Paul E M, Montague P Read
Virginia Tech Carilion Research Institute, Virginia Tech, Roanoke, VA 24016;
Department of Neurosurgery, Wake Forest Health Sciences, Winston-Salem, NC 27157;
Proc Natl Acad Sci U S A. 2016 Jan 5;113(1):200-5. doi: 10.1073/pnas.1513619112. Epub 2015 Nov 23.
In the mammalian brain, dopamine is a critical neuromodulator whose actions underlie learning, decision-making, and behavioral control. Degeneration of dopamine neurons causes Parkinson's disease, whereas dysregulation of dopamine signaling is believed to contribute to psychiatric conditions such as schizophrenia, addiction, and depression. Experiments in animal models suggest the hypothesis that dopamine release in human striatum encodes reward prediction errors (RPEs) (the difference between actual and expected outcomes) during ongoing decision-making. Blood oxygen level-dependent (BOLD) imaging experiments in humans support the idea that RPEs are tracked in the striatum; however, BOLD measurements cannot be used to infer the action of any one specific neurotransmitter. We monitored dopamine levels with subsecond temporal resolution in humans (n = 17) with Parkinson's disease while they executed a sequential decision-making task. Participants placed bets and experienced monetary gains or losses. Dopamine fluctuations in the striatum fail to encode RPEs, as anticipated by a large body of work in model organisms. Instead, subsecond dopamine fluctuations encode an integration of RPEs with counterfactual prediction errors, the latter defined by how much better or worse the experienced outcome could have been. How dopamine fluctuations combine the actual and counterfactual is unknown. One possibility is that this process is the normal behavior of reward processing dopamine neurons, which previously had not been tested by experiments in animal models. Alternatively, this superposition of error terms may result from an additional yet-to-be-identified subclass of dopamine neurons.
在哺乳动物大脑中,多巴胺是一种关键的神经调质,其作用是学习、决策和行为控制的基础。多巴胺神经元的退化会导致帕金森病,而多巴胺信号失调被认为与精神疾病如精神分裂症、成瘾和抑郁症有关。动物模型实验提出了一个假说,即在人类纹状体中,多巴胺释放编码了正在进行的决策过程中的奖励预测误差(RPE,实际结果与预期结果之间的差异)。人类的血氧水平依赖(BOLD)成像实验支持了RPE在纹状体中被追踪的观点;然而,BOLD测量不能用于推断任何一种特定神经递质的作用。我们以亚秒级的时间分辨率监测了17名帕金森病患者在执行顺序决策任务时的多巴胺水平。参与者下注并经历金钱得失。正如模型生物中的大量研究预期的那样,纹状体中的多巴胺波动未能编码RPE。相反,亚秒级的多巴胺波动编码了RPE与反事实预测误差的整合,后者由经历的结果本可以好多少或坏多少来定义。多巴胺波动如何结合实际和反事实情况尚不清楚。一种可能性是,这个过程是奖励处理多巴胺神经元的正常行为,而之前动物模型实验尚未对此进行测试。或者,这种误差项的叠加可能是由一个尚未被识别的多巴胺神经元亚类导致的。