• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多巴胺反应符合形式学习理论的基本假设。

Dopamine responses comply with basic assumptions of formal learning theory.

作者信息

Waelti P, Dickinson A, Schultz W

机构信息

Institute of Physiology and Programme in Neuroscience, University of Fribourg, CH-1700 Fribourg, Switzerland.

出版信息

Nature. 2001 Jul 5;412(6842):43-8. doi: 10.1038/35083500.

DOI:10.1038/35083500
PMID:11452299
Abstract

According to contemporary learning theories, the discrepancy, or error, between the actual and predicted reward determines whether learning occurs when a stimulus is paired with a reward. The role of prediction errors is directly demonstrated by the observation that learning is blocked when the stimulus is paired with a fully predicted reward. By using this blocking procedure, we show that the responses of dopamine neurons to conditioned stimuli was governed differentially by the occurrence of reward prediction errors rather than stimulus-reward associations alone, as was the learning of behavioural reactions. Both behavioural and neuronal learning occurred predominantly when dopamine neurons registered a reward prediction error at the time of the reward. Our data indicate that the use of analytical tests derived from formal behavioural learning theory provides a powerful approach for studying the role of single neurons in learning.

摘要

根据当代学习理论,实际奖励与预测奖励之间的差异或误差决定了在刺激与奖励配对时学习是否发生。当刺激与完全可预测的奖励配对时学习受到阻碍这一观察结果直接证明了预测误差的作用。通过使用这种阻碍程序,我们表明,多巴胺神经元对条件刺激的反应,与行为反应的学习一样,是由奖励预测误差的出现而非仅由刺激-奖励关联差异性地控制的。当多巴胺神经元在奖励出现时记录到奖励预测误差时,行为学习和神经元学习主要都会发生。我们的数据表明,使用源自正式行为学习理论的分析测试为研究单个神经元在学习中的作用提供了一种强有力的方法。

相似文献

1
Dopamine responses comply with basic assumptions of formal learning theory.多巴胺反应符合形式学习理论的基本假设。
Nature. 2001 Jul 5;412(6842):43-8. doi: 10.1038/35083500.
2
Dopamine neurons report an error in the temporal prediction of reward during learning.多巴胺神经元在学习过程中报告奖励时间预测的误差。
Nat Neurosci. 1998 Aug;1(4):304-9. doi: 10.1038/1124.
3
Predictive reward signal of dopamine neurons.多巴胺神经元的预测性奖励信号。
J Neurophysiol. 1998 Jul;80(1):1-27. doi: 10.1152/jn.1998.80.1.1.
4
Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm.在条件性抑制范式中多巴胺神经元对预测奖励缺失的编码。
J Neurosci. 2003 Nov 12;23(32):10402-10. doi: 10.1523/JNEUROSCI.23-32-10402.2003.
5
Neural coding of reward-prediction error signals during classical conditioning with attractive faces.在对有吸引力面孔进行经典条件反射过程中奖励预测误差信号的神经编码
J Neurophysiol. 2007 Apr;97(4):3036-45. doi: 10.1152/jn.01211.2006. Epub 2007 Feb 15.
6
Tonically active neurons in the striatum differentiate between delivery and omission of expected reward in a probabilistic task context.在概率性任务情境中,纹状体中的紧张性活动神经元能够区分预期奖励的给予和未给予。
Eur J Neurosci. 2009 Aug;30(3):515-26. doi: 10.1111/j.1460-9568.2009.06872.x. Epub 2009 Jul 28.
7
Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior.基底神经节和眶额皮质在目标导向行为中的参与。
Prog Brain Res. 2000;126:193-215. doi: 10.1016/S0079-6123(00)26015-9.
8
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
9
Neuronal coding of prediction errors.预测误差的神经元编码。
Annu Rev Neurosci. 2000;23:473-500. doi: 10.1146/annurev.neuro.23.1.473.
10
Responses of monkey dopamine neurons during learning of behavioral reactions.猴子多巴胺神经元在行为反应学习过程中的反应
J Neurophysiol. 1992 Jan;67(1):145-63. doi: 10.1152/jn.1992.67.1.145.

引用本文的文献

1
Dopamine supports reward prediction to constrain reward seeking.多巴胺支持奖励预测以限制奖励寻求行为。
bioRxiv. 2025 Aug 23:2025.08.22.671841. doi: 10.1101/2025.08.22.671841.
2
Reward-driven cerebellar climbing fiber activity influences both neural and behavioral learning.奖励驱动的小脑攀缘纤维活动影响神经和行为学习。
Curr Biol. 2025 Aug 9. doi: 10.1016/j.cub.2025.07.064.
3
Tonic dopamine and biases in value learning linked through a biologically inspired reinforcement learning model.通过生物启发式强化学习模型,紧张性多巴胺与价值学习中的偏差相联系。
Nat Commun. 2025 Aug 13;16(1):7529. doi: 10.1038/s41467-025-62280-1.
4
Oppositional and competitive instigation of hippocampal synaptic plasticity by the VTA and locus coeruleus.腹侧被盖区和蓝斑对海马突触可塑性的对立性和竞争性刺激
Proc Natl Acad Sci U S A. 2025 Jan 7;122(1):e2402356122. doi: 10.1073/pnas.2402356122. Epub 2024 Dec 30.
5
Dopaminergic responses to identity prediction errors depend differently on the orbitofrontal cortex and hippocampus.多巴胺能对身份预测误差的反应在眶额皮质和海马体上的依赖方式有所不同。
bioRxiv. 2024 Dec 17:2024.12.11.628003. doi: 10.1101/2024.12.11.628003.
6
Dopamine transients encode reward prediction errors independent of learning rates.多巴胺瞬变独立于学习率编码奖励预测误差。
Cell Rep. 2024 Oct 22;43(10):114840. doi: 10.1016/j.celrep.2024.114840. Epub 2024 Oct 11.
7
Computational Modeling Differentiates Learning Rate From Reward Sensitivity Deficits Produced by Early-Life Adversity in a Rodent Touchscreen Probabilistic Reward Task.在啮齿动物触屏概率奖励任务中,计算模型区分了早期生活逆境所产生的学习率与奖励敏感性缺陷。
Biol Psychiatry Glob Open Sci. 2024 Jul 20;4(6):100362. doi: 10.1016/j.bpsgos.2024.100362. eCollection 2024 Nov.
8
Pavlovian safety learning: An integrative theoretical review.巴甫洛夫式安全学习:一项综合性理论综述。
Psychon Bull Rev. 2025 Feb;32(1):176-202. doi: 10.3758/s13423-024-02559-4. Epub 2024 Aug 21.
9
The influence of emotion on temporal context models.情绪对时间背景模型的影响。
Cogn Emot. 2025 Feb;39(1):18-46. doi: 10.1080/02699931.2024.2371075. Epub 2024 Jul 15.
10
Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time.学习表达类似于奖励预测误差的多巴胺能活动需要时间的可塑性表示。
Nat Commun. 2024 Jul 12;15(1):5856. doi: 10.1038/s41467-024-50205-3.