Suppr超能文献

多巴胺依赖的预测误差是人类寻求奖励行为的基础。

Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans.

作者信息

Pessiglione Mathias, Seymour Ben, Flandin Guillaume, Dolan Raymond J, Frith Chris D

机构信息

Wellcome Department of Imaging Neuroscience, 12 Queen Square, London WC1N 3BG, UK.

出版信息

Nature. 2006 Aug 31;442(7106):1042-5. doi: 10.1038/nature05051. Epub 2006 Aug 23.

Abstract

Theories of instrumental learning are centred on understanding how success and failure are used to improve future decisions. These theories highlight a central role for reward prediction errors in updating the values associated with available actions. In animals, substantial evidence indicates that the neurotransmitter dopamine might have a key function in this type of learning, through its ability to modulate cortico-striatal synaptic efficacy. However, no direct evidence links dopamine, striatal activity and behavioural choice in humans. Here we show that, during instrumental learning, the magnitude of reward prediction error expressed in the striatum is modulated by the administration of drugs enhancing (3,4-dihydroxy-L-phenylalanine; L-DOPA) or reducing (haloperidol) dopaminergic function. Accordingly, subjects treated with L-DOPA have a greater propensity to choose the most rewarding action relative to subjects treated with haloperidol. Furthermore, incorporating the magnitude of the prediction errors into a standard action-value learning algorithm accurately reproduced subjects' behavioural choices under the different drug conditions. We conclude that dopamine-dependent modulation of striatal activity can account for how the human brain uses reward prediction errors to improve future decisions.

摘要

工具性学习理论的核心在于理解成功与失败是如何被用于改进未来决策的。这些理论强调了奖励预测误差在更新与可用行动相关联的价值方面的核心作用。在动物身上,大量证据表明神经递质多巴胺可能在这类学习中具有关键作用,通过其调节皮质 - 纹状体突触效能的能力。然而,在人类中,尚无直接证据将多巴胺、纹状体活动和行为选择联系起来。在此我们表明,在工具性学习过程中,纹状体中表达的奖励预测误差的大小会受到增强(3,4 - 二羟基 - L - 苯丙氨酸;L - 多巴)或降低(氟哌啶醇)多巴胺能功能的药物给药的调节。相应地,与接受氟哌啶醇治疗的受试者相比,接受L - 多巴治疗的受试者更倾向于选择最具奖励性的行动。此外,将预测误差的大小纳入标准行动价值学习算法能够准确重现不同药物条件下受试者的行为选择。我们得出结论,多巴胺对纹状体活动依赖性的调节能够解释人类大脑如何利用奖励预测误差来改进未来决策。

相似文献

4

引用本文的文献

5
Impaired effort allocation in schizophrenia.精神分裂症中努力分配受损。
Schizophr Res Cogn. 2025 Jul 15;42:100378. doi: 10.1016/j.scog.2025.100378. eCollection 2025 Dec.
6
Basal ganglia activation localized in MEG using a reward task.使用奖励任务在脑磁图中定位基底神经节激活。
Neuroimage Rep. 2021 Jul 28;1(3):100034. doi: 10.1016/j.ynirp.2021.100034. eCollection 2021 Sep.

本文引用的文献

3
Distributed neural representation of expected value.预期值的分布式神经表征。
J Neurosci. 2005 May 11;25(19):4806-12. doi: 10.1523/JNEUROSCI.0642-05.2005.
8
Dopamine, learning and motivation.多巴胺、学习与动机。
Nat Rev Neurosci. 2004 Jun;5(6):483-94. doi: 10.1038/nrn1406.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验