• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

中脑多巴胺神经元在一个通用框架中计算推断和缓存的价值预测误差。

Midbrain dopamine neurons compute inferred and cached value prediction errors in a common framework.

作者信息

Sadacca Brian F, Jones Joshua L, Schoenbaum Geoffrey

机构信息

Intramural Research program of the National Institute on Drug Abuse, National Institutes of Health, Bethesda, United States.

Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, United States.

出版信息

Elife. 2016 Mar 7;5:e13665. doi: 10.7554/eLife.13665.

DOI:10.7554/eLife.13665
PMID:26949249
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4805544/
Abstract

Midbrain dopamine neurons have been proposed to signal reward prediction errors as defined in temporal difference (TD) learning algorithms. While these models have been extremely powerful in interpreting dopamine activity, they typically do not use value derived through inference in computing errors. This is important because much real world behavior - and thus many opportunities for error-driven learning - is based on such predictions. Here, we show that error-signaling rat dopamine neurons respond to the inferred, model-based value of cues that have not been paired with reward and do so in the same framework as they track the putative cached value of cues previously paired with reward. This suggests that dopamine neurons access a wider variety of information than contemplated by standard TD models and that, while their firing conforms to predictions of TD models in some cases, they may not be restricted to signaling errors from TD predictions.

摘要

中脑多巴胺神经元被认为可像在时间差分(TD)学习算法中定义的那样,发出奖励预测误差信号。虽然这些模型在解释多巴胺活动方面极为强大,但它们在计算误差时通常不会使用通过推理得出的值。这一点很重要,因为许多现实世界的行为——以及由此产生的许多基于误差驱动学习的机会——都是基于此类预测。在这里,我们表明,发出误差信号的大鼠多巴胺神经元会对未与奖励配对的线索的基于模型的推断值做出反应,并且其反应框架与它们追踪先前与奖励配对的线索的假定缓存值时相同。这表明多巴胺神经元能够获取比标准TD模型所设想的更多种类的信息,并且虽然它们的放电在某些情况下符合TD模型的预测,但它们可能并不局限于发出TD预测的误差信号。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/4cf4c6fcc29d/elife-13665-fig3-figsupp3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/94567c9ac2e8/elife-13665-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/8ee7072adaf4/elife-13665-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/42a7fa561ab1/elife-13665-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/36ced23c87a9/elife-13665-fig3-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/eb658569b48a/elife-13665-fig3-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/4cf4c6fcc29d/elife-13665-fig3-figsupp3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/94567c9ac2e8/elife-13665-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/8ee7072adaf4/elife-13665-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/42a7fa561ab1/elife-13665-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/36ced23c87a9/elife-13665-fig3-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/eb658569b48a/elife-13665-fig3-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5952/4805544/4cf4c6fcc29d/elife-13665-fig3-figsupp3.jpg

相似文献

1
Midbrain dopamine neurons compute inferred and cached value prediction errors in a common framework.中脑多巴胺神经元在一个通用框架中计算推断和缓存的价值预测误差。
Elife. 2016 Mar 7;5:e13665. doi: 10.7554/eLife.13665.
2
Dopamine Neurons Respond to Errors in the Prediction of Sensory Features of Expected Rewards.多巴胺神经元对预期奖励的感觉特征预测错误做出反应。
Neuron. 2017 Sep 13;95(6):1395-1405.e3. doi: 10.1016/j.neuron.2017.08.025.
3
Optogenetic Blockade of Dopamine Transients Prevents Learning Induced by Changes in Reward Features.光遗传学阻断多巴胺瞬变可防止因奖励特征变化引起的学习。
Curr Biol. 2017 Nov 20;27(22):3480-3486.e3. doi: 10.1016/j.cub.2017.09.049. Epub 2017 Nov 2.
4
Dopamine reward prediction errors reflect hidden-state inference across time.多巴胺奖励预测误差反映了跨时间的隐藏状态推理。
Nat Neurosci. 2017 Apr;20(4):581-589. doi: 10.1038/nn.4520. Epub 2017 Mar 6.
5
Midbrain dopamine neurons encode a quantitative reward prediction error signal.中脑多巴胺神经元编码一种定量奖励预测误差信号。
Neuron. 2005 Jul 7;47(1):129-41. doi: 10.1016/j.neuron.2005.05.020.
6
A causal link between prediction errors, dopamine neurons and learning.预测误差、多巴胺神经元和学习之间的因果关系。
Nat Neurosci. 2013 Jul;16(7):966-73. doi: 10.1038/nn.3413. Epub 2013 May 26.
7
Midbrain dopamine neurons signal aversion in a reward-context-dependent manner.中脑多巴胺神经元以奖赏背景依赖的方式发出厌恶信号。
Elife. 2016 Oct 19;5:e17328. doi: 10.7554/eLife.17328.
8
Identity prediction errors in the human midbrain update reward-identity expectations in the orbitofrontal cortex.人类中脑中的身份预测错误会更新眶额皮质中的奖励-身份预期。
Nat Commun. 2018 Apr 23;9(1):1611. doi: 10.1038/s41467-018-04055-5.
9
Ventral striatal lesions disrupt dopamine neuron signaling of differences in cue value caused by changes in reward timing but not number.腹侧纹状体损伤会破坏多巴胺神经元对因奖励时间变化而非奖励数量变化所导致的线索价值差异的信号传递。
Behav Neurosci. 2016 Dec;130(6):593-9. doi: 10.1037/bne0000169.
10
The cost of obtaining rewards enhances the reward prediction error signal of midbrain dopamine neurons.获得奖励的成本增强了中脑多巴胺神经元的奖励预测误差信号。
Nat Commun. 2019 Aug 15;10(1):3674. doi: 10.1038/s41467-019-11334-2.

引用本文的文献

1
Striatal dopamine signals errors in prediction across different informational domains.纹状体多巴胺信号在不同信息领域中预测误差。
Sci Adv. 2025 Jul 11;11(28):eadq9684. doi: 10.1126/sciadv.adq9684. Epub 2025 Jul 9.
2
Generalized cue reactivity in rat dopamine neurons after opioids.阿片类药物作用后大鼠多巴胺能神经元的广泛性线索反应性
Nat Commun. 2025 Jan 2;16(1):321. doi: 10.1038/s41467-024-55504-3.
3
Revisiting the role of computational neuroimaging in the era of integrative neuroscience.重新审视计算神经影像学在整合神经科学时代的作用。

本文引用的文献

1
Physiological state gates acquisition and expression of mesolimbic reward prediction signals.生理状态控制中脑边缘奖赏预测信号的获取与表达。
Proc Natl Acad Sci U S A. 2016 Feb 16;113(7):1943-8. doi: 10.1073/pnas.1519643113. Epub 2016 Feb 1.
2
Variability in Dopamine Genes Dissociates Model-Based and Model-Free Reinforcement Learning.多巴胺基因的变异性使基于模型和无模型的强化学习产生分离。
J Neurosci. 2016 Jan 27;36(4):1211-22. doi: 10.1523/JNEUROSCI.1901-15.2016.
3
Phasic dopamine signals: from subjective reward value to formal economic utility.
Neuropsychopharmacology. 2024 Nov;50(1):103-113. doi: 10.1038/s41386-024-01946-8. Epub 2024 Sep 6.
4
Dopamine Release in the Nucleus Accumbens Core Encodes the General Excitatory Components of Learning.伏隔核核心中的多巴胺释放编码了学习的一般兴奋成分。
J Neurosci. 2024 Aug 28;44(35):e0120242024. doi: 10.1523/JNEUROSCI.0120-24.2024.
5
Generalized cue reactivity in dopamine neurons after opioids.阿片类药物作用后多巴胺神经元的广泛性线索反应性
bioRxiv. 2024 Jun 2:2024.06.02.597025. doi: 10.1101/2024.06.02.597025.
6
Dopamine-independent effect of rewards on choices through hidden-state inference.奖励对选择的多巴胺非依赖效应:通过隐状态推断。
Nat Neurosci. 2024 Feb;27(2):286-297. doi: 10.1038/s41593-023-01542-x. Epub 2024 Jan 12.
7
State and rate-of-change encoding in parallel mesoaccumbal dopamine pathways.平行中脑边缘多巴胺通路中的状态和变化率编码。
Nat Neurosci. 2024 Feb;27(2):309-318. doi: 10.1038/s41593-023-01547-6. Epub 2024 Jan 11.
8
Neural substrates of parallel devaluation-sensitive and devaluation-insensitive Pavlovian learning in humans.人类中平行的易感性和不易感性的条件价值学习的神经基质。
Nat Commun. 2023 Dec 5;14(1):8057. doi: 10.1038/s41467-023-43747-5.
9
Expectancy-related changes in firing of dopamine neurons depend on hippocampus.多巴胺能神经元放电中与预期相关的变化取决于海马体。
bioRxiv. 2023 Jul 21:2023.07.19.549728. doi: 10.1101/2023.07.19.549728.
10
Dual credit assignment processes underlie dopamine signals in a complex spatial environment.双信用分配过程是复杂空间环境中多巴胺信号的基础。
Neuron. 2023 Nov 1;111(21):3465-3478.e7. doi: 10.1016/j.neuron.2023.07.017. Epub 2023 Aug 22.
阶段性多巴胺信号:从主观奖励价值到形式经济效用
Curr Opin Behav Sci. 2015 Oct;5:147-154. doi: 10.1016/j.cobeha.2015.09.006.
4
Dopamine selectively remediates 'model-based' reward learning: a computational approach.多巴胺选择性修复“基于模型”的奖赏学习:一种计算方法。
Brain. 2016 Feb;139(Pt 2):355-64. doi: 10.1093/brain/awv347. Epub 2015 Dec 17.
5
Brief optogenetic inhibition of dopamine neurons mimics endogenous negative reward prediction errors.对多巴胺神经元进行短暂的光遗传学抑制可模拟内源性负性奖励预测误差。
Nat Neurosci. 2016 Jan;19(1):111-6. doi: 10.1038/nn.4191. Epub 2015 Dec 7.
6
Mesolimbic dopamine signals the value of work.中脑边缘多巴胺传递工作的价值。
Nat Neurosci. 2016 Jan;19(1):117-26. doi: 10.1038/nn.4173. Epub 2015 Nov 23.
7
Arithmetic and local circuitry underlying dopamine prediction errors.多巴胺预测误差背后的算术和局部神经回路。
Nature. 2015 Sep 10;525(7568):243-6. doi: 10.1038/nature14855. Epub 2015 Aug 31.
8
What the orbitofrontal cortex does not do.眶额皮质不具备的功能。
Nat Neurosci. 2015 May;18(5):620-7. doi: 10.1038/nn.3982.
9
Temporal structure in associative retrieval.关联检索中的时间结构。
Elife. 2015 Jan 23;4:e04919. doi: 10.7554/eLife.04919.
10
Ventral striatal dopamine reflects behavioral and neural signatures of model-based control during sequential decision making.腹侧纹状体多巴胺反映了序列决策过程中基于模型控制的行为和神经特征。
Proc Natl Acad Sci U S A. 2015 Feb 3;112(5):1595-600. doi: 10.1073/pnas.1417219112. Epub 2015 Jan 20.