• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

奖励概率安排下习得的纹状体和多巴胺细胞反应的局部回路模型。

A local circuit model of learned striatal and dopamine cell responses under probabilistic schedules of reward.

作者信息

Tan Can Ozan, Bullock Daniel

机构信息

Cognitive and Neural Systems Department, Boston University, Boston, Massachusetts 02215, USA.

出版信息

J Neurosci. 2008 Oct 1;28(40):10062-74. doi: 10.1523/JNEUROSCI.0259-08.2008.

DOI:10.1523/JNEUROSCI.0259-08.2008
PMID:18829964
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6671259/
Abstract

Recently, dopamine (DA) neurons of the substantia nigra pars compacta (SNc) were found to exhibit sustained responses related to reward uncertainty, in addition to the phasic responses related to reward-prediction errors (RPEs). Thus, cue-dependent anticipations of the timing, magnitude, and uncertainty of rewards are learned and reflected in components of DA signals. Here we simulate a local circuit model to show how learned uncertainty responses are generated, along with phasic RPE responses, on single trials. Both types of simulated DA responses exhibit the empirically observed dependencies on conditional probability, expected value of reward, and time since onset of the reward-predicting cue. The model's three major pathways compute expected values of cues, timed predictions of reward magnitudes, and uncertainties associated with these predictions. The first two pathways' computations refine those modeled by Brown et al. (1999). The third, newly modeled, pathway involves medium spiny projection neurons (MSPNs) of the striatal matrix, whose axons corelease GABA and substance P, both at synapses with GABAergic neurons in the substantia nigra pars reticulata (SNr) and with distal dendrites (in SNr) of DA neurons whose somas are located in ventral SNc. Corelease enables efficient computation of uncertainty responses that are a nonmonotonic function of the conditional probability of reward, and variability in striatal cholinergic transmission can explain observed individual differences in the amplitudes of uncertainty responses. The involvement of matricial MSPNs and cholinergic transmission within the striatum implies a relation between uncertainty in cue-reward contingencies and action-selection functions of the basal ganglia.

摘要

最近,人们发现黑质致密部(SNc)的多巴胺(DA)神经元除了表现出与奖励预测误差(RPE)相关的相位反应外,还表现出与奖励不确定性相关的持续反应。因此,与线索相关的对奖励时间、大小和不确定性的预期在DA信号成分中得到学习和体现。在此,我们模拟了一个局部回路模型,以展示在单次试验中如何产生学习到的不确定性反应以及相位RPE反应。两种类型的模拟DA反应均表现出基于经验观察到的对条件概率、奖励期望值以及自奖励预测线索开始后的时间的依赖性。该模型的三条主要通路计算线索的期望值、奖励大小的定时预测以及与这些预测相关的不确定性。前两条通路的计算完善了Brown等人(1999年)所建立的模型。第三条新建立的通路涉及纹状体基质的中等棘状投射神经元(MSPNs),其轴突在与黑质网状部(SNr)的GABA能神经元以及与位于腹侧SNc的DA神经元的远端树突(在SNr中)形成的突触处共同释放GABA和P物质。共同释放能够高效计算不确定性反应,这种反应是奖励条件概率的非单调函数,并且纹状体胆碱能传递的变异性可以解释观察到的不确定性反应幅度的个体差异。纹状体基质MSPNs和胆碱能传递的参与意味着线索 - 奖励偶联的不确定性与基底神经节的动作选择功能之间存在关联。

相似文献

1
A local circuit model of learned striatal and dopamine cell responses under probabilistic schedules of reward.奖励概率安排下习得的纹状体和多巴胺细胞反应的局部回路模型。
J Neurosci. 2008 Oct 1;28(40):10062-74. doi: 10.1523/JNEUROSCI.0259-08.2008.
2
Can the apparent adaptation of dopamine neurons' mismatch sensitivities be reconciled with their computation of reward prediction errors?多巴胺神经元的失配敏感性的明显适应性能否与它们对奖励预测误差的计算相协调?
Neurosci Lett. 2008 Jun 13;438(1):14-6. doi: 10.1016/j.neulet.2008.04.059. Epub 2008 Apr 22.
3
Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior.基底神经节和眶额皮质在目标导向行为中的参与。
Prog Brain Res. 2000;126:193-215. doi: 10.1016/S0079-6123(00)26015-9.
4
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
5
Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.纹状体多巴胺爬坡可能表明皮质基底神经节回路具有灵活的强化学习和遗忘能力。
Front Neural Circuits. 2014 Apr 9;8:36. doi: 10.3389/fncir.2014.00036. eCollection 2014.
6
Neuropeptide co-release with GABA may explain functional non-monotonic uncertainty responses in dopamine neurons.神经肽与γ-氨基丁酸的共同释放可能解释多巴胺神经元中功能性非单调不确定性反应。
Neurosci Lett. 2008 Jan 17;430(3):218-23. doi: 10.1016/j.neulet.2007.10.039. Epub 2007 Nov 6.
7
How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues.基底神经节如何利用平行的兴奋性和抑制性学习通路来选择性地响应意外的奖励线索。
J Neurosci. 1999 Dec 1;19(23):10502-11. doi: 10.1523/JNEUROSCI.19-23-10502.1999.
8
Modeling functions of striatal dopamine modulation in learning and planning.纹状体多巴胺调节在学习和规划中的建模功能。
Neuroscience. 2001;103(1):65-85. doi: 10.1016/s0306-4522(00)00554-6.
9
Dopaminergic Regulation of Striatal Interneurons in Reward and Addiction: Focus on Alcohol.奖赏与成瘾中纹状体中间神经元的多巴胺能调节:聚焦于酒精
Neural Plast. 2015;2015:814567. doi: 10.1155/2015/814567. Epub 2015 Jul 13.
10
Considerations upon the anatomical model of reward-based learning in the basal ganglia.关于基底神经节中基于奖励学习的解剖学模型的思考。
Med Hypotheses. 2000 Mar;54(3):397-9. doi: 10.1054/mehy.1999.0859.

引用本文的文献

1
Localization of the epileptogenic network from scalp EEG using a patient-specific whole-brain model.使用患者特异性全脑模型从头皮脑电图定位致痫网络。
Netw Neurosci. 2025 Mar 3;9(1):18-37. doi: 10.1162/netn_a_00418. eCollection 2025.
2
Human Substantia Nigra Neurons Encode Reward Expectations.人类黑质神经元编码奖励预期。
bioRxiv. 2024 May 11:2024.05.10.593406. doi: 10.1101/2024.05.10.593406.
3
A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.多巴胺反应的逐渐时间转移反映了机器学习中时间差分误差的进展。
Nat Neurosci. 2022 Aug;25(8):1082-1092. doi: 10.1038/s41593-022-01109-2. Epub 2022 Jul 7.
4
Modulation of Dopamine for Adaptive Learning: A Neurocomputational Model.多巴胺对适应性学习的调节:一种神经计算模型。
Comput Brain Behav. 2021 Mar;4(1):34-52. doi: 10.1007/s42113-020-00083-x. Epub 2020 Jun 12.
5
A systems-neuroscience model of phasic dopamine.相位多巴胺的系统神经科学模型。
Psychol Rev. 2020 Nov;127(6):972-1021. doi: 10.1037/rev0000199. Epub 2020 Jun 11.
6
Synchronicity: The Role of Midbrain Dopamine in Whole-Brain Coordination.同步性:中脑多巴胺在全脑协调中的作用。
eNeuro. 2019 May 3;6(2). doi: 10.1523/ENEURO.0345-18.2019. Print 2019 Mar/Apr.
7
Neural Circuitry of Reward Prediction Error.奖励预测误差的神经回路
Annu Rev Neurosci. 2017 Jul 25;40:373-394. doi: 10.1146/annurev-neuro-072116-031109. Epub 2017 Apr 24.
8
Strain commonalities and differences in response-outcome decision making in mice.小鼠反应-结果决策中的应变共性与差异
Neurobiol Learn Mem. 2016 May;131:101-8. doi: 10.1016/j.nlm.2016.03.016. Epub 2016 Mar 18.
9
Arithmetic and local circuitry underlying dopamine prediction errors.多巴胺预测误差背后的算术和局部神经回路。
Nature. 2015 Sep 10;525(7568):243-6. doi: 10.1038/nature14855. Epub 2015 Aug 31.
10
Learning touch preferences with a tactile robot using dopamine modulated STDP in a model of insular cortex.在岛叶皮质模型中,利用多巴胺调节的突触可塑性,通过触觉机器人学习触觉偏好。
Front Neurorobot. 2015 Jul 22;9:6. doi: 10.3389/fnbot.2015.00006. eCollection 2015.

本文引用的文献

1
Neural dynamics underlying impaired autonomic and conditioned responses following amygdala and orbitofrontal lesions.杏仁核与眶额皮层损伤后自主神经及条件反应受损背后的神经动力学
Behav Neurosci. 2008 Oct;122(5):1100-25. doi: 10.1037/a0012808.
2
A dopamine-acetylcholine cascade: simulating learned and lesion-induced behavior of striatal cholinergic interneurons.多巴胺 - 乙酰胆碱级联反应:模拟纹状体胆碱能中间神经元的习得性行为和损伤诱导行为。
J Neurophysiol. 2008 Oct;100(4):2409-21. doi: 10.1152/jn.90486.2008. Epub 2008 Aug 20.
3
Can the apparent adaptation of dopamine neurons' mismatch sensitivities be reconciled with their computation of reward prediction errors?多巴胺神经元的失配敏感性的明显适应性能否与它们对奖励预测误差的计算相协调?
Neurosci Lett. 2008 Jun 13;438(1):14-6. doi: 10.1016/j.neulet.2008.04.059. Epub 2008 Apr 22.
4
Neuropeptide co-release with GABA may explain functional non-monotonic uncertainty responses in dopamine neurons.神经肽与γ-氨基丁酸的共同释放可能解释多巴胺神经元中功能性非单调不确定性反应。
Neurosci Lett. 2008 Jan 17;430(3):218-23. doi: 10.1016/j.neulet.2007.10.039. Epub 2007 Nov 6.
5
GABAergic control of substantia nigra dopaminergic neurons.黑质多巴胺能神经元的γ-氨基丁酸能调控
Prog Brain Res. 2007;160:189-208. doi: 10.1016/S0079-6123(06)60011-3.
6
PVLV: the primary value and learned value Pavlovian learning algorithm.PVLV:主要价值与习得价值的巴甫洛夫学习算法
Behav Neurosci. 2007 Feb;121(1):31-49. doi: 10.1037/0735-7044.121.1.31.
7
Neurokinin-1 and neurokinin-3 receptors in primate substantia nigra.灵长类动物黑质中的神经激肽-1和神经激肽-3受体
Neurosci Res. 2007 Mar;57(3):362-71. doi: 10.1016/j.neures.2006.11.002. Epub 2006 Nov 28.
8
Frames, biases, and rational decision-making in the human brain.人类大脑中的框架、偏差与理性决策
Science. 2006 Aug 4;313(5787):684-7. doi: 10.1126/science.1128356.
9
Neuropeptides as synaptic transmitters.作为突触递质的神经肽。
Cell Tissue Res. 2006 Nov;326(2):583-98. doi: 10.1007/s00441-006-0268-3. Epub 2006 Jul 18.
10
Representation of action-specific reward values in the striatum.纹状体中特定动作奖励值的表征。
Science. 2005 Nov 25;310(5752):1337-40. doi: 10.1126/science.1115270.