学习中获得的相位多巴胺反应的神经机制。

Neural mechanisms of acquired phasic dopamine responses in learning.

机构信息

Department of Psychology and Neuroscience, University of Colorado Boulder, 345 UCB, Boulder, CO 80309, United States.

出版信息

Neurosci Biobehav Rev. 2010 Apr;34(5):701-20. doi: 10.1016/j.neubiorev.2009.11.019. Epub 2009 Nov 26.

DOI:10.1016/j.neubiorev.2009.11.019

PMID:19944716

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2839018/

Abstract

What biological mechanisms underlie the reward-predictive firing properties of midbrain dopaminergic neurons, and how do they relate to the complex constellation of empirical findings understood as Pavlovian and instrumental conditioning? We previously presented PVLV, a biologically inspired Pavlovian learning algorithm accounting for DA activity in terms of two interrelated systems: a primary value (PV) system, which governs how DA cells respond to a US (reward) and; a learned value (LV) system, which governs how DA cells respond to a CS. Here, we provide a more extensive review of the biological mechanisms supporting phasic DA firing and their relation to the spate of Pavlovian conditioning phenomena and their sensitivity to focal brain lesions. We further extend the model by incorporating a new NV (novelty value) component reflecting the ability of novel stimuli to trigger phasic DA firing, providing "novelty bonuses" which encourages exploratory working memory updating and in turn speeds learning in trace conditioning and other working memory-dependent paradigms. The evolving PVLV model builds upon insights developed in many earlier computational models, especially reinforcement learning models based on the ideas of Sutton and Barto, biological models, and the psychological model developed by Savastano and Miller. The PVLV framework synthesizes these various approaches, overcoming important shortcomings of each by providing a coherent and specific mapping to much of the relevant empirical data at both the micro- and macro-levels, and examines their relevance for higher order cognitive functions.

摘要

中脑多巴胺能神经元的奖赏预测发射特性的生物学机制是什么，它们与作为经典条件作用和工具条件作用的复杂经验发现有何关系？我们之前提出了 PVLV，这是一种受生物启发的经典条件作用学习算法，根据两个相互关联的系统来解释 DA 活动：一个是主要价值（PV）系统，它决定了 DA 细胞对 US（奖励）的反应方式；另一个是习得价值（LV）系统，它决定了 DA 细胞对 CS 的反应方式。在这里，我们更全面地回顾了支持相位 DA 发射的生物学机制及其与大量经典条件作用现象的关系，以及它们对焦点脑损伤的敏感性。我们通过引入一个新的 NV（新颖价值）组件进一步扩展了该模型，该组件反映了新刺激触发相位 DA 发射的能力，提供了“新颖性奖励”，鼓励探索性工作记忆更新，从而加快痕迹条件作用和其他依赖工作记忆的范式中的学习。不断发展的 PVLV 模型建立在许多早期计算模型的见解之上，特别是基于 Sutton 和 Barto 的强化学习模型、生物模型以及 Savastano 和 Miller 开发的心理模型。PVLV 框架综合了这些不同的方法，通过为微观和宏观层面的大量相关经验数据提供一致和具体的映射，克服了每种方法的重要缺点，并考察了它们对更高阶认知功能的相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/701a/2839018/aa285f3483d6/nihms167036f1.jpg

相似文献

Neural mechanisms of acquired phasic dopamine responses in learning.学习中获得的相位多巴胺反应的神经机制。

Neurosci Biobehav Rev. 2010 Apr;34(5):701-20. doi: 10.1016/j.neubiorev.2009.11.019. Epub 2009 Nov 26.

PVLV: the primary value and learned value Pavlovian learning algorithm.PVLV：主要价值与习得价值的巴甫洛夫学习算法

Behav Neurosci. 2007 Feb;121(1):31-49. doi: 10.1037/0735-7044.121.1.31.

A systems-neuroscience model of phasic dopamine.相位多巴胺的系统神经科学模型。

Psychol Rev. 2020 Nov;127(6):972-1021. doi: 10.1037/rev0000199. Epub 2020 Jun 11.

Absence of NMDA receptors in dopamine neurons attenuates dopamine release but not conditioned approach during Pavlovian conditioning.多巴胺神经元中 NMDA 受体的缺失可减弱多巴胺释放，但不影响条件性趋近反应在 Pavlovian 条件作用中的表达。

Proc Natl Acad Sci U S A. 2010 Jul 27;107(30):13491-6. doi: 10.1073/pnas.1007827107. Epub 2010 Jul 7.

A Specific Component of the Evoked Potential Mirrors Phasic Dopamine Neuron Activity during Conditioning.条件反射期间，诱发电位的一个特定成分反映了相位性多巴胺能神经元的活动。

J Neurosci. 2015 Jul 22;35(29):10451-9. doi: 10.1523/JNEUROSCI.4096-14.2015.

The emergence of saliency and novelty responses from Reinforcement Learning principles.基于强化学习原理的显著性和新颖性反应的出现。

Neural Netw. 2008 Dec;21(10):1493-9. doi: 10.1016/j.neunet.2008.09.004. Epub 2008 Sep 25.

Dopamine errors drive excitatory and inhibitory components of backward conditioning in an outcome-specific manner.多巴胺错误以特定于结果的方式驱动反向条件作用的兴奋性和抑制性成分。

Curr Biol. 2022 Jul 25;32(14):3210-3218.e3. doi: 10.1016/j.cub.2022.06.035. Epub 2022 Jun 24.

Adolescent Dopamine Neurons Represent Reward Differently during Action and State Guided Learning.青少年多巴胺神经元在动作和状态引导学习中对奖励的表现不同。

J Neurosci. 2021 Nov 10;41(45):9419-9430. doi: 10.1523/JNEUROSCI.1321-21.2021. Epub 2021 Oct 5.

Alternative time representation in dopamine models.多巴胺模型中的交替时间表示。

J Comput Neurosci. 2010 Feb;28(1):107-30. doi: 10.1007/s10827-009-0191-1. Epub 2009 Oct 22.

Ablation of NMDA receptors in dopamine neurons disrupts attribution of incentive salience to reward-paired stimuli.多巴胺能神经元中NMDA受体的消融破坏了对奖励配对刺激的动机显著性归因。

Behav Brain Res. 2019 May 2;363:77-82. doi: 10.1016/j.bbr.2019.01.037. Epub 2019 Jan 31.

引用本文的文献

Therapeutic ketogenic diet as treatment for anorexia nervosa.治疗性生酮饮食作为神经性厌食症的治疗方法。

Front Nutr. 2024 Sep 4;11:1392135. doi: 10.3389/fnut.2024.1392135. eCollection 2024.

Human Substantia Nigra Neurons Encode Reward Expectations.人类黑质神经元编码奖励预期。

bioRxiv. 2024 May 11:2024.05.10.593406. doi: 10.1101/2024.05.10.593406.

Choice-selective sequences dominate in cortical relative to thalamic inputs to NAc to support reinforcement learning.皮层对 NAc 的输入比丘脑的输入更具有选择选择性，从而支持强化学习。

Cell Rep. 2022 May 17;39(7):110756. doi: 10.1016/j.celrep.2022.110756.

Modulation of Dopamine for Adaptive Learning: A Neurocomputational Model.多巴胺对适应性学习的调节：一种神经计算模型。

Comput Brain Behav. 2021 Mar;4(1):34-52. doi: 10.1007/s42113-020-00083-x. Epub 2020 Jun 12.

Wave-like dopamine dynamics as a mechanism for spatiotemporal credit assignment.波状多巴胺动力学作为时空信用分配的机制。

Cell. 2021 May 13;184(10):2733-2749.e16. doi: 10.1016/j.cell.2021.03.046. Epub 2021 Apr 15.

A systems-neuroscience model of phasic dopamine.相位多巴胺的系统神经科学模型。

Psychol Rev. 2020 Nov;127(6):972-1021. doi: 10.1037/rev0000199. Epub 2020 Jun 11.

Unraveling the Mysteries of Motivation.揭开动机的神秘面纱。

Trends Cogn Sci. 2020 Jun;24(6):425-434. doi: 10.1016/j.tics.2020.03.001. Epub 2020 Apr 3.

The Unexplored Territory of Neural Models: Potential Guides for Exploring the Function of Metabotropic Neuromodulation.神经模型的未知领域：探索代谢型神经调节功能的潜在指南。

Neuroscience. 2021 Feb 21;456:143-158. doi: 10.1016/j.neuroscience.2020.03.048. Epub 2020 Apr 8.

Synchronicity: The Role of Midbrain Dopamine in Whole-Brain Coordination.同步性：中脑多巴胺在全脑协调中的作用。

eNeuro. 2019 May 3;6(2). doi: 10.1523/ENEURO.0345-18.2019. Print 2019 Mar/Apr.

The Virtual Personalities Neural Network Model: Neurobiological Underpinnings.虚拟人格神经网络模型：神经生物学基础

Personal Neurosci. 2018 Aug 10;1. doi: 10.1017/pen.2018.6.

本文引用的文献

Conditioned reflexes: An investigation of the physiological activity of the cerebral cortex.条件反射：大脑皮层生理活动的研究

Ann Neurosci. 2010 Jul;17(3):136-41. doi: 10.5214/ans.0972-7531.1017309.

Time as content in Pavlovian conditioning.作为经典条件作用中内容的时间。

Behav Processes. 1998 Dec;44(2):147-62. doi: 10.1016/s0376-6357(98)00046-1.

Pharmacological modulation of subliminal learning in Parkinson's and Tourette's syndromes.药物调节帕金森病和妥瑞氏综合征的潜意识学习。

Proc Natl Acad Sci U S A. 2009 Nov 10;106(45):19179-84. doi: 10.1073/pnas.0904035106. Epub 2009 Oct 22.

Afferent projections to A10 dopaminergic neurones in the rat as shown by the retrograde transport of horseradisd peroxidase.用辣根过氧化物酶逆行运输法显示的大鼠A10多巴胺能神经元的传入投射。

Neurosci Lett. 1978 Oct;9(4):353-9. doi: 10.1016/0304-3940(78)90208-2.

Reward-learning and the novelty-seeking personality: a between- and within-subjects study of the effects of dopamine agonists on young Parkinson's patients.奖赏学习与寻求新奇人格：多巴胺激动剂对年轻帕金森病患者影响的组间和组内研究

Brain. 2009 Sep;132(Pt 9):2385-95. doi: 10.1093/brain/awp094. Epub 2009 May 4.

Striatal dopamine predicts outcome-specific reversal learning and its sensitivity to dopaminergic drug administration.纹状体多巴胺可预测特定结果的逆向学习及其对多巴胺能药物给药的敏感性。

J Neurosci. 2009 Feb 4;29(5):1538-43. doi: 10.1523/JNEUROSCI.4467-08.2009.

Representation of negative motivational value in the primate lateral habenula.灵长类动物外侧缰核中负性动机值的表征。

Nat Neurosci. 2009 Jan;12(1):77-84. doi: 10.1038/nn.2233. Epub 2008 Nov 30.

Understanding risk: a guide for the perplexed.理解风险：给困惑者的指南。

Cogn Affect Behav Neurosci. 2008 Dec;8(4):348-54. doi: 10.3758/CABN.8.4.348.

A role for dopamine in temporal decision making and reward maximization in parkinsonism.多巴胺在帕金森病时间决策和奖励最大化中的作用。

J Neurosci. 2008 Nov 19;28(47):12294-304. doi: 10.1523/JNEUROSCI.3116-08.2008.

A local circuit model of learned striatal and dopamine cell responses under probabilistic schedules of reward.奖励概率安排下习得的纹状体和多巴胺细胞反应的局部回路模型。

J Neurosci. 2008 Oct 1;28(40):10062-74. doi: 10.1523/JNEUROSCI.0259-08.2008.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验