• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多巴胺错误以特定于结果的方式驱动反向条件作用的兴奋性和抑制性成分。

Dopamine errors drive excitatory and inhibitory components of backward conditioning in an outcome-specific manner.

机构信息

Department of Psychology, University of California, Los Angeles, Portola Plaza, Los Angeles, CA 91602, USA.

Department of Psychology, University of California, Los Angeles, Portola Plaza, Los Angeles, CA 91602, USA.

出版信息

Curr Biol. 2022 Jul 25;32(14):3210-3218.e3. doi: 10.1016/j.cub.2022.06.035. Epub 2022 Jun 24.

DOI:10.1016/j.cub.2022.06.035
PMID:35752165
Abstract

For over two decades, phasic activity in midbrain dopamine neurons was considered synonymous with the prediction error in temporal-difference reinforcement learning. Central to this proposal is the notion that reward-predictive stimuli become endowed with the scalar value of predicted rewards. When these cues are subsequently encountered, their predictive value is compared to the value of the actual reward received, allowing for the calculation of prediction errors. Phasic firing of dopamine neurons was proposed to reflect this computation, facilitating the backpropagation of value from the predicted reward to the reward-predictive stimulus, thus reducing future prediction errors. There are two critical assumptions of this proposal: (1) that dopamine errors can only facilitate learning about scalar value and not more complex features of predicted rewards, and (2) that the dopamine signal can only be involved in anticipatory cue-reward learning in which cues or actions precede rewards. Recent work has challenged the first assumption, demonstrating that phasic dopamine signals across species are involved in learning about more complex features of the predicted outcomes, in a manner that transcends this value computation. Here, we tested the validity of the second assumption. Specifically, we examined whether phasic midbrain dopamine activity would be necessary for backward conditioning-when a neutral cue reliably follows a rewarding outcome. Using a specific Pavlovian-to-instrumental transfer (PIT) procedure, we show rats learn both excitatory and inhibitory components of a backward association, and that this association entails knowledge of the specific identity of the reward and cue. We demonstrate that brief optogenetic inhibition of VTA neurons timed to the transition between the reward and cue reduces both of these components of backward conditioning. These findings suggest VTA neurons are capable of facilitating associations between contiguously occurring events, regardless of the content of those events. We conclude that these data may be in line with suggestions that the VTA error acts as a universal teaching signal. This may provide insight into why dopamine function has been implicated in myriad psychological disorders that are characterized by very distinct reinforcement-learning deficits.

摘要

二十多年来,中脑多巴胺神经元的相位活动被认为是时间差异强化学习中预测误差的同义词。这一观点的核心是,奖励预测性刺激具有预测奖励的标量值。当这些线索随后被遇到时,它们的预测价值与实际收到的奖励价值进行比较,从而计算预测误差。多巴胺神经元的相位放电被提议反映这种计算,促进从预测奖励到奖励预测性刺激的价值反向传播,从而减少未来的预测误差。这一建议有两个关键假设:(1)多巴胺误差只能促进对标量值的学习,而不能促进对预测奖励更复杂特征的学习;(2)多巴胺信号只能参与预期线索-奖励学习,其中线索或动作先于奖励。最近的工作挑战了第一个假设,证明跨物种的多巴胺相位信号参与了对预测结果更复杂特征的学习,这种方式超越了这种价值计算。在这里,我们测试了第二个假设的有效性。具体来说,我们检查了中脑多巴胺活动是否对逆向条件作用是必要的——当一个中性线索可靠地跟随一个奖励结果时。使用特定的条件反射到工具性转移(PIT)程序,我们发现老鼠学习了逆向关联的兴奋和抑制成分,并且这种关联需要对奖励和线索的具体身份的了解。我们证明,当奖励和线索之间的过渡时,短暂的 VTA 神经元的光遗传学抑制会减少这两种逆向条件作用的成分。这些发现表明 VTA 神经元能够促进连续发生的事件之间的关联,而不管这些事件的内容如何。我们得出结论,这些数据可能符合 VTA 误差作为通用教学信号的建议。这可能为多巴胺功能在众多以明显不同的强化学习缺陷为特征的心理障碍中被牵连提供了一些启示。

相似文献

1
Dopamine errors drive excitatory and inhibitory components of backward conditioning in an outcome-specific manner.多巴胺错误以特定于结果的方式驱动反向条件作用的兴奋性和抑制性成分。
Curr Biol. 2022 Jul 25;32(14):3210-3218.e3. doi: 10.1016/j.cub.2022.06.035. Epub 2022 Jun 24.
2
Dopamine Release in the Nucleus Accumbens Core Encodes the General Excitatory Components of Learning.伏隔核核心中的多巴胺释放编码了学习的一般兴奋成分。
J Neurosci. 2024 Aug 28;44(35):e0120242024. doi: 10.1523/JNEUROSCI.0120-24.2024.
3
Cue and Reward Evoked Dopamine Activity Is Necessary for Maintaining Learned Pavlovian Associations.线索和奖励诱发的多巴胺活动对于维持习得的巴甫洛夫式联想是必要的。
J Neurosci. 2021 Jun 9;41(23):5004-5014. doi: 10.1523/JNEUROSCI.2744-20.2021. Epub 2021 Apr 22.
4
Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions.腹侧被盖区多巴胺神经元参与奖励身份预测。
Curr Biol. 2019 Jan 7;29(1):93-103.e3. doi: 10.1016/j.cub.2018.11.050. Epub 2018 Dec 20.
5
Dopamine projections to the basolateral amygdala drive the encoding of identity-specific reward memories.多巴胺投射到基底外侧杏仁核驱动身份特异性奖励记忆的编码。
Nat Neurosci. 2024 Apr;27(4):728-736. doi: 10.1038/s41593-024-01586-7. Epub 2024 Feb 23.
6
Tonic or Phasic Stimulation of Dopaminergic Projections to Prefrontal Cortex Causes Mice to Maintain or Deviate from Previously Learned Behavioral Strategies.对前额叶皮层多巴胺能投射的强直或相位刺激使小鼠维持或偏离先前习得的行为策略。
J Neurosci. 2017 Aug 30;37(35):8315-8329. doi: 10.1523/JNEUROSCI.1221-17.2017. Epub 2017 Jul 24.
7
Optogenetic Blockade of Dopamine Transients Prevents Learning Induced by Changes in Reward Features.光遗传学阻断多巴胺瞬变可防止因奖励特征变化引起的学习。
Curr Biol. 2017 Nov 20;27(22):3480-3486.e3. doi: 10.1016/j.cub.2017.09.049. Epub 2017 Nov 2.
8
Adolescent Dopamine Neurons Represent Reward Differently during Action and State Guided Learning.青少年多巴胺神经元在动作和状态引导学习中对奖励的表现不同。
J Neurosci. 2021 Nov 10;41(45):9419-9430. doi: 10.1523/JNEUROSCI.1321-21.2021. Epub 2021 Oct 5.
9
A causal link between prediction errors, dopamine neurons and learning.预测误差、多巴胺神经元和学习之间的因果关系。
Nat Neurosci. 2013 Jul;16(7):966-73. doi: 10.1038/nn.3413. Epub 2013 May 26.
10
Predictive reward signal of dopamine neurons.多巴胺神经元的预测性奖励信号。
J Neurophysiol. 1998 Jul;80(1):1-27. doi: 10.1152/jn.1998.80.1.1.

引用本文的文献

1
A Bio-Inspired Dopamine Model for Robots with Autonomous Decision-Making.一种用于具有自主决策能力机器人的受生物启发的多巴胺模型。
Biomimetics (Basel). 2024 Aug 21;9(8):504. doi: 10.3390/biomimetics9080504.
2
Humans adaptively deploy forward and backward prediction.人类适应性地部署前向和后向预测。
Nat Hum Behav. 2024 Sep;8(9):1726-1737. doi: 10.1038/s41562-024-01930-8. Epub 2024 Jul 16.
3
Dopamine Release in the Nucleus Accumbens Core Encodes the General Excitatory Components of Learning.伏隔核核心中的多巴胺释放编码了学习的一般兴奋成分。
J Neurosci. 2024 Aug 28;44(35):e0120242024. doi: 10.1523/JNEUROSCI.0120-24.2024.
4
Age-related changes of dopamine D1 and D2 receptors expression in parvalbumin-positive cells of the orbitofrontal and prelimbic cortices of mice.小鼠眶额皮质和前边缘皮质小白蛋白阳性细胞中多巴胺D1和D2受体表达的年龄相关变化
Front Neurosci. 2024 Jun 6;18:1364067. doi: 10.3389/fnins.2024.1364067. eCollection 2024.
5
Mesostriatal dopamine is sensitive to changes in specific cue-reward contingencies.中脑边缘多巴胺对特定线索-奖励关联的变化敏感。
Sci Adv. 2024 May 31;10(22):eadn4203. doi: 10.1126/sciadv.adn4203. Epub 2024 May 29.
6
Dopamine projections to the basolateral amygdala drive the encoding of identity-specific reward memories.多巴胺投射到基底外侧杏仁核驱动身份特异性奖励记忆的编码。
Nat Neurosci. 2024 Apr;27(4):728-736. doi: 10.1038/s41593-024-01586-7. Epub 2024 Feb 23.
7
Audible pain squeaks can mediate emotional contagion across pre-exposed rats with a potential effect of auto-conditioning.可听见的疼痛 squeaks 可以介导具有自动条件作用的预先暴露大鼠的情绪感染。
Commun Biol. 2023 Oct 25;6(1):1085. doi: 10.1038/s42003-023-05474-x.
8
Association learning is impaired in insulin resistance and restored by liraglutide.关联学习在胰岛素抵抗中受损,并可通过利拉鲁肽恢复。
Nat Metab. 2023 Aug;5(8):1262-1263. doi: 10.1038/s42255-023-00870-3.
9
Mesolimbic dopamine release conveys causal associations.中脑边缘多巴胺释放传递因果关系。
Science. 2022 Dec 23;378(6626):eabq6740. doi: 10.1126/science.abq6740.