• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强化学习的动机神经回路。

Motivational neural circuits underlying reinforcement learning.

机构信息

Laboratory of Neuropsychology, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland, USA.

出版信息

Nat Neurosci. 2017 Mar 29;20(4):505-512. doi: 10.1038/nn.4506.

DOI:10.1038/nn.4506
PMID:28352111
Abstract

Reinforcement learning (RL) is the behavioral process of learning the values of actions and objects. Most models of RL assume that the dopaminergic prediction error signal drives plasticity in frontal-striatal circuits. The striatum then encodes value representations that drive decision processes. However, the amygdala has also been shown to play an important role in forming Pavlovian stimulus-outcome associations. These Pavlovian associations can drive motivated behavior via the amygdala projections to the ventral striatum or the ventral tegmental area. The amygdala may, therefore, play a central role in RL. Here we compare the contributions of the amygdala and the striatum to RL and show that both the amygdala and striatum learn and represent expected values in RL tasks. Furthermore, value representations in the striatum may be inherited, to some extent, from the amygdala. The striatum may, therefore, play less of a primary role in learning stimulus-outcome associations in RL than previously suggested.

摘要

强化学习(RL)是学习行为和对象值的行为过程。大多数 RL 模型假设多巴胺预测误差信号驱动额纹状体系电路的可塑性。然后,纹状体对驱动决策过程的值表示进行编码。然而,杏仁核在形成条件刺激-反应关联方面也被证明具有重要作用。这些条件刺激-反应关联可以通过杏仁核投射到腹侧纹状体或腹侧被盖区来驱动动机行为。因此,杏仁核可能在 RL 中发挥核心作用。在这里,我们比较了杏仁核和纹状体对 RL 的贡献,并表明杏仁核和纹状体都在 RL 任务中学习和表示预期值。此外,纹状体中的价值表示在某种程度上可能是从杏仁核继承而来的。因此,与之前的建议相比,纹状体在 RL 中学习刺激-反应关联的主要作用可能较小。

相似文献

1
Motivational neural circuits underlying reinforcement learning.强化学习的动机神经回路。
Nat Neurosci. 2017 Mar 29;20(4):505-512. doi: 10.1038/nn.4506.
2
Effects of Ventral Striatum Lesions on Stimulus-Based versus Action-Based Reinforcement Learning.腹侧纹状体损伤对基于刺激与基于动作的强化学习的影响。
J Neurosci. 2017 Jul 19;37(29):6902-6914. doi: 10.1523/JNEUROSCI.0631-17.2017. Epub 2017 Jun 16.
3
Amygdala and Ventral Striatum Make Distinct Contributions to Reinforcement Learning.杏仁核与腹侧纹状体对强化学习有不同贡献。
Neuron. 2016 Oct 19;92(2):505-517. doi: 10.1016/j.neuron.2016.09.025. Epub 2016 Oct 6.
4
Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation.强化学习中的遗忘将持续的多巴胺信号与动机联系起来。
PLoS Comput Biol. 2016 Oct 13;12(10):e1005145. doi: 10.1371/journal.pcbi.1005145. eCollection 2016 Oct.
5
Distinct prediction errors in mesostriatal circuits of the human brain mediate learning about the values of both states and actions: evidence from high-resolution fMRI.人类大脑中脑纹状体回路中不同的预测误差介导了对状态和动作价值的学习:来自高分辨率功能磁共振成像的证据。
PLoS Comput Biol. 2017 Oct 19;13(10):e1005810. doi: 10.1371/journal.pcbi.1005810. eCollection 2017 Oct.
6
Hypothalamic Interactions with Large-Scale Neural Circuits Underlying Reinforcement Learning and Motivated Behavior.下丘脑与强化学习和动机行为相关的大规模神经回路的相互作用。
Trends Neurosci. 2020 Sep;43(9):681-694. doi: 10.1016/j.tins.2020.06.006. Epub 2020 Aug 3.
7
The motivational role of the ventral striatum and amygdala in learning from gains and losses.腹侧纹状体和杏仁核在从得失中学习的激励作用。
Behav Neurosci. 2023 Aug;137(4):268-280. doi: 10.1037/bne0000558. Epub 2023 May 4.
8
Drive and Reinforcement Circuitry in the Brain: Origins, Neurotransmitters, and Projection Fields.大脑的驱动和强化回路:起源、神经递质和投射区域。
Neuropsychopharmacology. 2018 Mar;43(4):680-689. doi: 10.1038/npp.2017.228. Epub 2017 Oct 6.
9
Striatal contributions to reward and decision making: making sense of regional variations in a reiterated processing matrix.纹状体对奖励和决策的贡献:理解重复处理矩阵中的区域差异
Ann N Y Acad Sci. 2007 May;1104:192-212. doi: 10.1196/annals.1390.016. Epub 2007 Apr 7.
10
Reinforcement learning models and their neural correlates: An activation likelihood estimation meta-analysis.强化学习模型及其神经关联:一项激活可能性估计元分析。
Cogn Affect Behav Neurosci. 2015 Jun;15(2):435-59. doi: 10.3758/s13415-015-0338-7.

引用本文的文献

1
Non-invasive Ultrasonic Neuromodulation of the Human Nucleus Accumbens Impacts Reward Sensitivity.对人类伏隔核进行非侵入性超声神经调节会影响奖赏敏感性。
bioRxiv. 2025 Aug 6:2024.07.25.605068. doi: 10.1101/2024.07.25.605068.
2
Higher-order and distributed synergistic functional interactions encode information gain in goal-directed learning.高阶和分布式协同功能相互作用在目标导向学习中编码信息增益。
Nat Commun. 2025 Aug 5;16(1):7179. doi: 10.1038/s41467-025-62507-1.
3
Schizophrenia-associated 22q11.2 deletion elevates striatal acetylcholine and disrupts thalamostriatal projections to produce amotivation in mice.
与精神分裂症相关的22q11.2缺失会提高纹状体乙酰胆碱水平,并破坏丘脑纹状体投射,从而在小鼠中产生无动机状态。
bioRxiv. 2025 Jun 21:2025.06.20.660782. doi: 10.1101/2025.06.20.660782.
4
Empowering safer socially sensitive autonomous vehicles using human-plausible cognitive encoding.利用似人类认知编码赋能更安全的社会敏感型自动驾驶车辆。
Proc Natl Acad Sci U S A. 2025 May 27;122(21):e2401626122. doi: 10.1073/pnas.2401626122. Epub 2025 May 19.
5
Transcutaneous Vagus Nerve Stimulation Enhances Probabilistic Learning.经皮迷走神经刺激增强概率学习。
Psychophysiology. 2025 Mar;62(3):e70037. doi: 10.1111/psyp.70037.
6
Dorsal-Ventral Reinforcement Learning Network Connectivity and Incentive-Driven Changes in Exploration.背腹侧强化学习网络连接性与探索中动机驱动的变化
J Neurosci. 2025 Apr 9;45(15):e0422242025. doi: 10.1523/JNEUROSCI.0422-24.2025.
7
Pharmacological Modulation of Dopamine Receptors Reveals Distinct Brain-Wide Networks Associated with Learning and Motivation in Nonhuman Primates.多巴胺受体的药理学调节揭示了与非人灵长类动物学习和动机相关的独特全脑网络。
J Neurosci. 2025 Feb 5;45(6):e1301242024. doi: 10.1523/JNEUROSCI.1301-24.2024.
8
Neurophysiological and Autonomic Dynamics of Threat Processing during Sustained Social Fear Generalization.持续性社交恐惧泛化过程中威胁处理的神经生理和自主神经动力学
J Cogn Neurosci. 2025 Feb 1;37(2):482-497. doi: 10.1162/jocn_a_02276.
9
Ventral frontostriatal circuitry mediates the computation of reinforcement from symbolic gains and losses.腹侧额眶皮质环路介导了来自符号收益和损失的强化计算。
Neuron. 2024 Nov 20;112(22):3782-3795.e5. doi: 10.1016/j.neuron.2024.08.018. Epub 2024 Sep 24.
10
Beta activity in human anterior cingulate cortex mediates reward biases.人类前扣带皮层的β活动介导了奖励偏见。
Nat Commun. 2024 Jul 15;15(1):5528. doi: 10.1038/s41467-024-49600-7.