• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多巴胺神经元的预测性奖励信号。

Predictive reward signal of dopamine neurons.

作者信息

Schultz W

机构信息

Institute of Physiology and Program in Neuroscience, University of Fribourg, CH-1700 Fribourg, Switzerland.

出版信息

J Neurophysiol. 1998 Jul;80(1):1-27. doi: 10.1152/jn.1998.80.1.1.

DOI:10.1152/jn.1998.80.1.1
PMID:9658025
Abstract

The effects of lesions, receptor blocking, electrical self-stimulation, and drugs of abuse suggest that midbrain dopamine systems are involved in processing reward information and learning approach behavior. Most dopamine neurons show phasic activations after primary liquid and food rewards and conditioned, reward-predicting visual and auditory stimuli. They show biphasic, activation-depression responses after stimuli that resemble reward-predicting stimuli or are novel or particularly salient. However, only few phasic activations follow aversive stimuli. Thus dopamine neurons label environmental stimuli with appetitive value, predict and detect rewards and signal alerting and motivating events. By failing to discriminate between different rewards, dopamine neurons appear to emit an alerting message about the surprising presence or absence of rewards. All responses to rewards and reward-predicting stimuli depend on event predictability. Dopamine neurons are activated by rewarding events that are better than predicted, remain uninfluenced by events that are as good as predicted, and are depressed by events that are worse than predicted. By signaling rewards according to a prediction error, dopamine responses have the formal characteristics of a teaching signal postulated by reinforcement learning theories. Dopamine responses transfer during learning from primary rewards to reward-predicting stimuli. This may contribute to neuronal mechanisms underlying the retrograde action of rewards, one of the main puzzles in reinforcement learning. The impulse response releases a short pulse of dopamine onto many dendrites, thus broadcasting a rather global reinforcement signal to postsynaptic neurons. This signal may improve approach behavior by providing advance reward information before the behavior occurs, and may contribute to learning by modifying synaptic transmission. The dopamine reward signal is supplemented by activity in neurons in striatum, frontal cortex, and amygdala, which process specific reward information but do not emit a global reward prediction error signal. A cooperation between the different reward signals may assure the use of specific rewards for selectively reinforcing behaviors. Among the other projection systems, noradrenaline neurons predominantly serve attentional mechanisms and nucleus basalis neurons code rewards heterogeneously. Cerebellar climbing fibers signal errors in motor performance or errors in the prediction of aversive events to cerebellar Purkinje cells. Most deficits following dopamine-depleting lesions are not easily explained by a defective reward signal but may reflect the absence of a general enabling function of tonic levels of extracellular dopamine. Thus dopamine systems may have two functions, the phasic transmission of reward information and the tonic enabling of postsynaptic neurons.

摘要

损伤、受体阻断、电自我刺激以及滥用药物的影响表明,中脑多巴胺系统参与奖励信息的处理和学习趋近行为。大多数多巴胺神经元在初次液体和食物奖励以及条件性、奖励预测性视觉和听觉刺激后呈现相位激活。在类似于奖励预测性刺激、新颖或特别突出的刺激后,它们表现出双相的激活-抑制反应。然而,只有少数相位激活跟随厌恶刺激。因此,多巴胺神经元用具有吸引力的价值标记环境刺激,预测和检测奖励,并发出警报和激发事件的信号。由于无法区分不同的奖励,多巴胺神经元似乎发出了关于奖励意外出现或缺失的警报信息。对奖励和奖励预测性刺激的所有反应都取决于事件的可预测性。多巴胺神经元在奖励事件比预期更好时被激活,在与预期一样好的事件中不受影响,在比预期更差的事件中被抑制。通过根据预测误差发出奖励信号,多巴胺反应具有强化学习理论假设的教学信号的形式特征。在学习过程中,多巴胺反应从初级奖励转移到奖励预测性刺激。这可能有助于强化学习中主要谜题之一的奖励逆行作用的神经元机制。冲动反应将一小股多巴胺释放到许多树突上,从而向突触后神经元广播一个相当全局性的强化信号。这个信号可以通过在行为发生前提供提前的奖励信息来改善趋近行为,并且可能通过改变突触传递来促进学习。多巴胺奖励信号由纹状体、额叶皮质和杏仁核中的神经元活动补充,这些神经元处理特定的奖励信息,但不发出全局性的奖励预测误差信号。不同奖励信号之间的合作可能确保使用特定奖励来选择性地强化行为。在其他投射系统中,去甲肾上腺素神经元主要服务于注意力机制,基底核神经元对奖励进行异质性编码。小脑攀缘纤维向小脑浦肯野细胞发出运动表现误差或厌恶事件预测误差的信号。多巴胺耗竭性损伤后的大多数缺陷不容易用有缺陷的奖励信号来解释,但可能反映了细胞外多巴胺紧张水平的一般促进功能的缺失。因此,多巴胺系统可能有两种功能,奖励信息的相位传递和突触后神经元的紧张促进作用。

相似文献

1
Predictive reward signal of dopamine neurons.多巴胺神经元的预测性奖励信号。
J Neurophysiol. 1998 Jul;80(1):1-27. doi: 10.1152/jn.1998.80.1.1.
2
Reward signaling by dopamine neurons.多巴胺神经元的奖赏信号传导
Neuroscientist. 2001 Aug;7(4):293-302. doi: 10.1177/107385840100700406.
3
Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior.基底神经节和眶额皮质在目标导向行为中的参与。
Prog Brain Res. 2000;126:193-215. doi: 10.1016/S0079-6123(00)26015-9.
4
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
5
Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli.与厌恶刺激相比,奖赏性刺激对中脑多巴胺神经元具有优先激活作用。
Nature. 1996 Feb 1;379(6564):449-51. doi: 10.1038/379449a0.
6
Dopamine signals for reward value and risk: basic and recent data.多巴胺信号与奖励价值和风险:基础与近期数据。
Behav Brain Funct. 2010 Apr 23;6:24. doi: 10.1186/1744-9081-6-24.
7
Neuronal coding of prediction errors.预测误差的神经元编码。
Annu Rev Neurosci. 2000;23:473-500. doi: 10.1146/annurev.neuro.23.1.473.
8
Dopamine neurons report an error in the temporal prediction of reward during learning.多巴胺神经元在学习过程中报告奖励时间预测的误差。
Nat Neurosci. 1998 Aug;1(4):304-9. doi: 10.1038/1124.
9
Dopamine reward prediction error coding.多巴胺奖励预测误差编码。
Dialogues Clin Neurosci. 2016 Mar;18(1):23-32. doi: 10.31887/DCNS.2016.18.1/wschultz.
10
Behavior-related activity of primate dopamine neurons.灵长类动物多巴胺能神经元的行为相关活动。
Rev Neurol (Paris). 1994 Aug-Sep;150(8-9):634-9.

引用本文的文献

1
Age-dependent predictors of effective reinforcement motor learning across childhood.儿童期有效强化运动学习的年龄依赖性预测因素。
Elife. 2025 Aug 28;13:RP101036. doi: 10.7554/eLife.101036.
2
Identification of conserved frontal neurophysiological markers of cognitive flexibility in humans and rats.人类和大鼠认知灵活性的保守额叶神经生理标志物的鉴定。
Commun Biol. 2025 Aug 23;8(1):1268. doi: 10.1038/s42003-025-08729-x.
3
Concurrent representations of reinstated and transformed memories and their modulation by reward.恢复和转换记忆的并发表征及其奖励调节
Imaging Neurosci (Camb). 2025 Feb 18;3. doi: 10.1162/imag_a_00476. eCollection 2025.
4
Unintended bias in the pursuit of collinearity solutions in fMRI analysis.功能磁共振成像分析中寻求共线性解决方案时的意外偏差。
bioRxiv. 2025 Jun 18:2025.01.14.633053. doi: 10.1101/2025.01.14.633053.
5
Error encoding in human speech motor cortex.人类言语运动皮层中的错误编码。
bioRxiv. 2025 Jun 8:2025.06.07.658426. doi: 10.1101/2025.06.07.658426.
6
Neural Mechanisms of Feedback Processing and Regulation Recalibration During Neurofeedback Training.神经反馈训练期间反馈处理与调节重新校准的神经机制
Hum Brain Mapp. 2025 Jul;46(10):e70279. doi: 10.1002/hbm.70279.
7
Brain Neurotrophins and Plant Polyphenols: A Powerful Connection.脑神经营养因子与植物多酚:一种强大的联系。
Molecules. 2025 Jun 19;30(12):2657. doi: 10.3390/molecules30122657.
8
Reinforced liquid state machines-new training strategies for spiking neural networks based on reinforcements.强化液态机器——基于强化的脉冲神经网络新训练策略
Front Comput Neurosci. 2025 May 23;19:1569374. doi: 10.3389/fncom.2025.1569374. eCollection 2025.
9
Associations Between Taq1A/C957T Polymorphic Variants and Autonomic Responsivity in a Slot Machine Task: Influence of Real-Life Gambling Exposure and Sex.老虎机任务中Taq1A/C957T多态性变体与自主反应性之间的关联:现实生活中赌博暴露和性别的影响。
J Gambl Stud. 2025 May 30. doi: 10.1007/s10899-025-10398-8.
10
Disrupted functional connectome in a rodent model of autism during social isolation.社交隔离期间自闭症啮齿动物模型中功能连接组的破坏
Front Neural Circuits. 2025 May 14;19:1525130. doi: 10.3389/fncir.2025.1525130. eCollection 2025.