Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3DY, UK.
Department of Neurobiology, Systems Neuroscience Institute, University of Pittsburgh, Pittsburgh, PA 15261, USA.
Curr Opin Neurobiol. 2017 Apr;43:139-148. doi: 10.1016/j.conb.2017.03.013. Epub 2017 Apr 6.
The phasic dopamine reward prediction error response is a major brain signal underlying learning, approach and decision making. This dopamine response consists of two components that reflect, initially, stimulus detection from physical impact and, subsequenttly, reward valuation; dopamine activations by punishers reflect physical impact rather than aversiveness. The dopamine reward signal is distinct from earlier reported and recently confirmed phasic changes with behavioural activation. Optogenetic activation of dopamine neurones in monkeys causes value learning and biases economic choices. The dopamine reward signal conforms to formal economic utility and thus constitutes a utility prediction error signal. In these combined ways, the dopamine reward prediction error signal constitutes a potential neuronal substrate for the crucial economic decision variable of utility.
相位多巴胺奖励预测误差反应是学习、接近和决策的主要大脑信号。这种多巴胺反应由两个组成部分组成,最初反映了物理冲击的刺激检测,随后反映了奖励估值;惩罚者的多巴胺激活反映了物理冲击,而不是厌恶。多巴胺奖励信号与之前报道的、最近证实的与行为激活相关的相位变化不同。猴子中多巴胺神经元的光遗传学激活导致价值学习和经济选择的偏差。多巴胺奖励信号符合形式经济学效用,因此构成了效用预测误差信号。通过这些组合方式,多巴胺奖励预测误差信号构成了效用这一关键经济决策变量的潜在神经元基质。