Daw Nathaniel D, Doya Kenji
Gatsby Computational Neuroscience Unit, UCL, Alexandra House, 17 Queen Square, London, WC1N 3AR, UK.
Curr Opin Neurobiol. 2006 Apr;16(2):199-204. doi: 10.1016/j.conb.2006.03.006. Epub 2006 Mar 24.
Following the suggestion that midbrain dopaminergic neurons encode a signal, known as a 'reward prediction error', used by artificial intelligence algorithms for learning to choose advantageous actions, the study of the neural substrates for reward-based learning has been strongly influenced by computational theories. In recent work, such theories have been increasingly integrated into experimental design and analysis. Such hybrid approaches have offered detailed new insights into the function of a number of brain areas, especially the cortex and basal ganglia. In part this is because these approaches enable the study of neural correlates of subjective factors (such as a participant's beliefs about the reward to be received for performing some action) that the computational theories purport to quantify.
中脑多巴胺能神经元编码一种被称为“奖励预测误差”的信号,人工智能算法利用该信号来学习选择有利的行动。基于这一观点,基于奖励学习的神经基础研究受到了计算理论的强烈影响。在最近的研究中,这些理论越来越多地被整合到实验设计和分析中。这种混合方法为许多脑区的功能提供了详细的新见解,尤其是皮层和基底神经节。部分原因在于,这些方法能够研究计算理论旨在量化的主观因素(例如参与者对执行某种行动所获奖励的信念)的神经关联。