Princeton Neuroscience Institute & Department of Psychology, Princeton University, Princeton, NJ 08540, United States.
Princeton Neuroscience Institute & Department of Psychology, Princeton University, Princeton, NJ 08540, United States; National Institute on Drug Abuse, Baltimore, MD 21224, United States; School of Psychology, University of New South Wales, Australia.
Curr Opin Neurobiol. 2018 Apr;49:1-7. doi: 10.1016/j.conb.2017.10.006. Epub 2017 Oct 31.
Phasic dopamine responses are thought to encode a prediction-error signal consistent with model-free reinforcement learning theories. However, a number of recent findings highlight the influence of model-based computations on dopamine responses, and suggest that dopamine prediction errors reflect more dimensions of an expected outcome than scalar reward value. Here, we review a selection of these recent results and discuss the implications and complications of model-based predictions for computational theories of dopamine and learning.
相位多巴胺反应被认为编码了与无模型强化学习理论一致的预测误差信号。然而,最近的一些发现强调了基于模型计算对多巴胺反应的影响,并表明多巴胺预测误差反映了预期结果的多个维度,而不仅仅是标量奖励值。在这里,我们回顾了其中的一些最新结果,并讨论了基于模型的预测对多巴胺和学习的计算理论的影响和复杂性。