背侧纹状体中的多巴胺释放平台和结果信号与经典的强化学习公式形成对比。

Dopamine release plateau and outcome signals in dorsal striatum contrast with classic reinforcement learning formulations.

机构信息

McGovern Institute for Brain Research and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, 43 Vassar St., Cambridge, MA, 02139, USA.

Advanced Imaging Research Center, University of Texas, Southwestern Medical Center, Dallas, TX, 75390, USA.

出版信息

Nat Commun. 2024 Oct 14;15(1):8856. doi: 10.1038/s41467-024-53176-7.

DOI:10.1038/s41467-024-53176-7

PMID:39402067

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11473536/

Abstract

We recorded dopamine release signals in centromedial and centrolateral sectors of the striatum as mice learned consecutive versions of visual cue-outcome conditioning tasks. Dopamine release responses differed for the centromedial and centrolateral sites. In neither sector could these be accounted for by classic reinforcement learning alone as classically applied to the activity of nigral dopamine-containing neurons. Medially, cue responses ranged from initial sharp peaks to modulated plateau responses; outcome (reward) responses during cue conditioning were minimal or, initially, negative. At centrolateral sites, by contrast, strong, transient dopamine release responses occurred at both cue and outcome. Prolonged, plateau release responses to cues emerged in both regions when discriminative behavioral responses became required. At most sites, we found no evidence for a transition from outcome signaling to cue signaling, a hallmark of temporal difference reinforcement learning as applied to midbrain dopaminergic neuronal activity. These findings delineate a reshaping of striatal dopamine release activity during learning and suggest that current views of reward prediction error encoding need review to accommodate distinct learning-related spatial and temporal patterns of striatal dopamine release in the dorsal striatum.

摘要

我们在中脑边缘和中脑侧区记录了多巴胺释放信号，因为老鼠学习了连续的视觉线索-结果条件作用任务版本。多巴胺释放反应在中脑边缘和中脑侧区有所不同。在这两个区域，经典强化学习都不能单独解释这些反应，因为经典强化学习应用于含有多巴胺的黑质神经元的活动。在中脑，线索反应从最初的急剧峰值到调制的平台反应不等；在线索条件作用期间，结果（奖励）反应最小化或最初为负。相比之下，在中脑侧区，线索和结果都会产生强烈的、短暂的多巴胺释放反应。当需要区分行为反应时，两个区域都会出现对线索的延长、平台释放反应。在大多数部位，我们没有发现从结果信号到线索信号的转变的证据，这是应用于中脑多巴胺能神经元活动的时间差强化学习的一个标志。这些发现描绘了学习过程中纹状体多巴胺释放活动的重塑，并表明需要重新审视当前关于奖励预测误差编码的观点，以适应背侧纹状体中纹状体多巴胺释放的不同学习相关的空间和时间模式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9862/11473536/3825e71eaca3/41467_2024_53176_Fig1_HTML.jpg

相似文献

Dopamine release plateau and outcome signals in dorsal striatum contrast with classic reinforcement learning formulations.背侧纹状体中的多巴胺释放平台和结果信号与经典的强化学习公式形成对比。

Nat Commun. 2024 Oct 14;15(1):8856. doi: 10.1038/s41467-024-53176-7.

Dopamine Release Plateau and Outcome Signals in Dorsal Striatum Contrast with Classic Reinforcement Learning Formulations.背侧纹状体中的多巴胺释放平台和结果信号与经典强化学习公式形成对比。

bioRxiv. 2023 Aug 17:2023.08.15.553421. doi: 10.1101/2023.08.15.553421.

Dopamine neurons drive spatiotemporally heterogeneous striatal dopamine signals during learning.多巴胺神经元在学习过程中驱动纹状体多巴胺信号的时空异质性。

Curr Biol. 2024 Jul 22;34(14):3086-3101.e4. doi: 10.1016/j.cub.2024.05.069. Epub 2024 Jun 25.

Mesostriatal dopamine is sensitive to changes in specific cue-reward contingencies.中脑边缘多巴胺对特定线索-奖励关联的变化敏感。

Sci Adv. 2024 May 31;10(22):eadn4203. doi: 10.1126/sciadv.adn4203. Epub 2024 May 29.

Reward and choice encoding in terminals of midbrain dopamine neurons depends on striatal target.中脑多巴胺神经元终末的奖赏与选择编码取决于纹状体靶点。

Nat Neurosci. 2016 Jun;19(6):845-54. doi: 10.1038/nn.4287. Epub 2016 Apr 25.

Dopamine errors drive excitatory and inhibitory components of backward conditioning in an outcome-specific manner.多巴胺错误以特定于结果的方式驱动反向条件作用的兴奋性和抑制性成分。

Curr Biol. 2022 Jul 25;32(14):3210-3218.e3. doi: 10.1016/j.cub.2022.06.035. Epub 2022 Jun 24.

Phasic dopamine release induced by positive feedback predicts individual differences in reversal learning.由正反馈诱导的阶段性多巴胺释放可预测逆向学习中的个体差异。

Neurobiol Learn Mem. 2015 Nov;125:135-45. doi: 10.1016/j.nlm.2015.08.011. Epub 2015 Sep 5.

Distinct temporal difference error signals in dopamine axons in three regions of the striatum in a decision-making task.在一个决策任务中，纹状体三个区域的多巴胺轴突中存在明显的时间差异错误信号。

Elife. 2020 Dec 21;9:e62390. doi: 10.7554/eLife.62390.

Dopamine prediction error signaling in a unique nigrostriatal circuit is critical for associative fear learning.在一个独特的黑质纹状体回路中，多巴胺预测误差信号传导对于关联性恐惧学习至关重要。

Nat Commun. 2025 Mar 29;16(1):3066. doi: 10.1038/s41467-025-58382-5.

Regionally distinct phasic dopamine release patterns in the striatum during reversal learning.在逆向学习过程中，纹状体内区域特异性的阶段性多巴胺释放模式。

Neuroscience. 2017 Mar 14;345:110-123. doi: 10.1016/j.neuroscience.2016.05.011. Epub 2016 May 13.

本文引用的文献

Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time.学习表达类似于奖励预测误差的多巴胺能活动需要时间的可塑性表示。

Nat Commun. 2024 Jul 12;15(1):5856. doi: 10.1038/s41467-024-50205-3.

A feature-specific prediction error model explains dopaminergic heterogeneity.一种具有特征特异性的预测误差模型解释了多巴胺能异质性。

Nat Neurosci. 2024 Aug;27(8):1574-1586. doi: 10.1038/s41593-024-01689-1. Epub 2024 Jul 3.

Improved green and red GRAB sensors for monitoring dopaminergic activity in vivo.用于监测体内多巴胺能活动的改良绿色和红色 GRAB 传感器。

Nat Methods. 2024 Apr;21(4):680-691. doi: 10.1038/s41592-023-02100-w. Epub 2023 Nov 30.

Intrinsic dopamine and acetylcholine dynamics in the striatum of mice.小鼠纹状体中的多巴胺和乙酰胆碱的内在动力学。

Nature. 2023 Sep;621(7979):543-549. doi: 10.1038/s41586-023-05995-9. Epub 2023 Aug 9.

Dopamine and glutamate regulate striatal acetylcholine in decision-making.多巴胺和谷氨酸调节决策中的纹状体乙酰胆碱。

Nature. 2023 Sep;621(7979):577-585. doi: 10.1038/s41586-023-06492-9. Epub 2023 Aug 9.

Unique functional responses differentially map onto genetic subtypes of dopamine neurons.独特的功能反应差异映射到多巴胺神经元的遗传亚型上。

Nat Neurosci. 2023 Oct;26(10):1762-1774. doi: 10.1038/s41593-023-01401-9. Epub 2023 Aug 3.

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model.腹侧被盖区的多巴胺预测误差反映了一个多线程的预测模型。

Nat Neurosci. 2023 May;26(5):830-839. doi: 10.1038/s41593-023-01310-x. Epub 2023 Apr 20.

Striosomes and Matrisomes: Scaffolds for Dynamic Coupling of Volition and Action.纹状体和基质体：意志和行动动态耦联的支架。

Annu Rev Neurosci. 2023 Jul 10;46:359-380. doi: 10.1146/annurev-neuro-121522-025740. Epub 2023 Apr 17.

Distributed processing for value-based choice by prelimbic circuits targeting anterior-posterior dorsal striatal subregions in male mice.前额皮质回路通过靶向雄性小鼠前后背纹状体内部分区进行基于价值的选择的分布式处理。

Nat Commun. 2023 Apr 6;14(1):1920. doi: 10.1038/s41467-023-36795-4.

Mesolimbic dopamine adapts the rate of learning from action.中脑边缘多巴胺适应动作学习的速度。

Nature. 2023 Feb;614(7947):294-302. doi: 10.1038/s41586-022-05614-z. Epub 2023 Jan 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

背侧纹状体中的多巴胺释放平台和结果信号与经典的强化学习公式形成对比。

Dopamine release plateau and outcome signals in dorsal striatum contrast with classic reinforcement learning formulations.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献