• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多巴胺在学习和行动推断中的作用。

Dopamine role in learning and action inference.

机构信息

MRC Brain Networks Dynamics Unit, University of Oxford, Oxford, United Kingdom.

出版信息

Elife. 2020 Jul 7;9:e53262. doi: 10.7554/eLife.53262.

DOI:10.7554/eLife.53262
PMID:32633715
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7392608/
Abstract

This paper describes a framework for modelling dopamine function in the mammalian brain. It proposes that both learning and action planning involve processes minimizing prediction errors encoded by dopaminergic neurons. In this framework, dopaminergic neurons projecting to different parts of the striatum encode errors in predictions made by the corresponding systems within the basal ganglia. The dopaminergic neurons encode differences between rewards and expectations in the goal-directed system, and differences between the chosen and habitual actions in the habit system. These prediction errors trigger learning about rewards and habit formation, respectively. Additionally, dopaminergic neurons in the goal-directed system play a key role in action planning: They compute the difference between a desired reward and the reward expected from the current motor plan, and they facilitate action planning until this difference diminishes. Presented models account for dopaminergic responses during movements, effects of dopamine depletion on behaviour, and make several experimental predictions.

摘要

本文提出了一个用于模拟哺乳动物大脑中多巴胺功能的框架。该框架认为学习和动作规划都涉及到最小化由多巴胺能神经元编码的预测误差的过程。在这个框架中,投射到纹状体不同部位的多巴胺能神经元编码了基底神经节内相应系统做出的预测误差。多巴胺能神经元在目标导向系统中编码了奖励与期望之间的差异,在习惯系统中编码了选择动作与习惯动作之间的差异。这些预测误差分别触发了关于奖励和习惯形成的学习。此外,目标导向系统中的多巴胺能神经元在动作规划中起着关键作用:它们计算期望奖励与当前运动计划预期奖励之间的差异,并在这个差异减小时促进动作规划。提出的模型解释了运动过程中的多巴胺能反应、多巴胺耗竭对行为的影响,并做出了一些实验预测。

相似文献

1
Dopamine role in learning and action inference.多巴胺在学习和行动推断中的作用。
Elife. 2020 Jul 7;9:e53262. doi: 10.7554/eLife.53262.
2
Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior.基底神经节和眶额皮质在目标导向行为中的参与。
Prog Brain Res. 2000;126:193-215. doi: 10.1016/S0079-6123(00)26015-9.
3
Modeling the effects of motivation on choice and learning in the basal ganglia.在基底神经节中建模动机对选择和学习的影响。
PLoS Comput Biol. 2020 May 26;16(5):e1007465. doi: 10.1371/journal.pcbi.1007465. eCollection 2020 May.
4
Reward functions of the basal ganglia.基底神经节的奖赏功能。
J Neural Transm (Vienna). 2016 Jul;123(7):679-693. doi: 10.1007/s00702-016-1510-0. Epub 2016 Feb 2.
5
Reward prediction error does not explain movement selectivity in DMS-projecting dopamine neurons.奖励预测误差不能解释投射到 DMS 的多巴胺神经元的运动选择性。
Elife. 2019 Apr 4;8:e42992. doi: 10.7554/eLife.42992.
6
Subsecond dopamine fluctuations in human striatum encode superposed error signals about actual and counterfactual reward.人类纹状体中次秒级的多巴胺波动编码了关于实际和反事实奖励的叠加误差信号。
Proc Natl Acad Sci U S A. 2016 Jan 5;113(1):200-5. doi: 10.1073/pnas.1513619112. Epub 2015 Nov 23.
7
Feasibility of dopamine as a vector-valued feedback signal in the basal ganglia.多巴胺作为基底神经节中向量值反馈信号的可行性。
Proc Natl Acad Sci U S A. 2023 Aug 8;120(32):e2221994120. doi: 10.1073/pnas.2221994120. Epub 2023 Aug 1.
8
Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.纹状体多巴胺爬坡可能表明皮质基底神经节回路具有灵活的强化学习和遗忘能力。
Front Neural Circuits. 2014 Apr 9;8:36. doi: 10.3389/fncir.2014.00036. eCollection 2014.
9
Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcements driving both action acquisition and reward maximization: a simulated robotic study.相位多巴胺作为内在和外在强化的预测误差,驱动着动作获取和奖励最大化:一项模拟机器人研究。
Neural Netw. 2013 Mar;39:40-51. doi: 10.1016/j.neunet.2012.12.012. Epub 2013 Jan 14.
10
A Selective Role for Dopamine in Learning to Maximize Reward But Not to Minimize Effort: Evidence from Patients with Parkinson's Disease.多巴胺在学习以最大化奖励而非最小化努力中的选择性作用:来自帕金森病患者的证据。
J Neurosci. 2017 Jun 21;37(25):6087-6097. doi: 10.1523/JNEUROSCI.2081-16.2017. Epub 2017 May 24.

引用本文的文献

1
Modelling cognitive flexibility with deep neural networks.使用深度神经网络对认知灵活性进行建模。
Curr Opin Behav Sci. 2024 Jun;57:101361. doi: 10.1016/j.cobeha.2024.101361.
2
Tabula rasa agents display emergent in-group behavior.白板智能体表现出新兴的群体行为。
Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319947121. doi: 10.1073/pnas.2319947121. Epub 2025 Jun 16.
3
The interoceptive origin of reinforcement learning.强化学习的内感受起源

本文引用的文献

1
Modeling the effects of motivation on choice and learning in the basal ganglia.在基底神经节中建模动机对选择和学习的影响。
PLoS Comput Biol. 2020 May 26;16(5):e1007465. doi: 10.1371/journal.pcbi.1007465. eCollection 2020 May.
2
Dopaminergic Transmission Rapidly and Persistently Enhances Excitability of D1 Receptor-Expressing Striatal Projection Neurons.多巴胺能传递快速且持久地增强 D1 受体表达的纹状体投射神经元的兴奋性。
Neuron. 2020 Apr 22;106(2):277-290.e6. doi: 10.1016/j.neuron.2020.01.028. Epub 2020 Feb 18.
3
Time-dependent competition between goal-directed and habitual response preparation.
Trends Cogn Sci. 2025 Sep;29(9):840-854. doi: 10.1016/j.tics.2025.05.008. Epub 2025 Jun 10.
4
Striatal Dopamine Actions and Movement: Inferences from Parkinson Disease.纹状体多巴胺的作用与运动:来自帕金森病的推断
J Neurosci. 2025 Jun 11;45(24):e0022252025. doi: 10.1523/JNEUROSCI.0022-25.2025.
5
Dopaminergic action prediction errors serve as a value-free teaching signal.多巴胺能动作预测误差作为一种无价值的教学信号。
Nature. 2025 May 14. doi: 10.1038/s41586-025-09008-9.
6
Noradrenergic and Dopaminergic modulation of meta-cognition and meta-control.去甲肾上腺素能和多巴胺能对元认知和元控制的调节。
PLoS Comput Biol. 2025 Feb 26;21(2):e1012675. doi: 10.1371/journal.pcbi.1012675. eCollection 2025 Feb.
7
Hippocampal neuronal activity is aligned with action plans.海马体神经元活动与行动计划一致。
Nature. 2025 Mar;639(8053):153-161. doi: 10.1038/s41586-024-08397-7. Epub 2025 Jan 8.
8
Substantial Heritability Underlies Fairness Norm Adaptation Capability and its Neural Basis.公平规范适应能力及其神经基础存在显著遗传力。
Adv Sci (Weinh). 2025 Mar;12(9):e2411070. doi: 10.1002/advs.202411070. Epub 2024 Dec 16.
9
Iron deficiency negatively affects behavioral measures of learning, indirect neural measures of dopamine, and neural efficiency.缺铁对学习的行为指标、多巴胺的间接神经指标以及神经效率产生负面影响。
Cogn Affect Behav Neurosci. 2025 Feb;25(1):89-113. doi: 10.3758/s13415-024-01241-5. Epub 2024 Dec 5.
10
Reward Bases: A simple mechanism for adaptive acquisition of multiple reward types.奖励基础:一种用于适应性获取多种奖励类型的简单机制。
PLoS Comput Biol. 2024 Nov 19;20(11):e1012580. doi: 10.1371/journal.pcbi.1012580. eCollection 2024 Nov.
目标导向和习惯反应准备之间的时变竞争。
Nat Hum Behav. 2019 Dec;3(12):1252-1262. doi: 10.1038/s41562-019-0725-0. Epub 2019 Sep 30.
4
Where Does Value Come From?价值从何而来?
Trends Cogn Sci. 2019 Oct;23(10):836-850. doi: 10.1016/j.tics.2019.07.012. Epub 2019 Sep 4.
5
Specialized coding of sensory, motor and cognitive variables in VTA dopamine neurons.腹侧被盖区多巴胺神经元中感觉、运动和认知变量的特异性编码。
Nature. 2019 Jun;570(7762):509-513. doi: 10.1038/s41586-019-1261-9. Epub 2019 May 29.
6
Dissociable dopamine dynamics for learning and motivation.学习和动机的多巴胺动态可分离。
Nature. 2019 Jun;570(7759):65-70. doi: 10.1038/s41586-019-1235-y. Epub 2019 May 22.
7
Reward prediction error does not explain movement selectivity in DMS-projecting dopamine neurons.奖励预测误差不能解释投射到 DMS 的多巴胺神经元的运动选择性。
Elife. 2019 Apr 4;8:e42992. doi: 10.7554/eLife.42992.
8
Learning the payoffs and costs of actions.学习行为的收益和成本。
PLoS Comput Biol. 2019 Feb 28;15(2):e1006285. doi: 10.1371/journal.pcbi.1006285. eCollection 2019 Feb.
9
Habits without values.无价值观的习惯。
Psychol Rev. 2019 Mar;126(2):292-311. doi: 10.1037/rev0000120. Epub 2019 Jan 24.
10
Rethinking dopamine as generalized prediction error.重新思考多巴胺作为一般性预测误差。
Proc Biol Sci. 2018 Nov 21;285(1891):20181645. doi: 10.1098/rspb.2018.1645.