力调节解释了刺激-奖励学习过程中多巴胺相位信号的变化。

Force tuning explains changes in phasic dopamine signaling during stimulus-reward learning.

作者信息

Bakhurin Konstantin I, Hughes Ryan N, Jiang Qiaochu, Hossain Meghdoot, Gutkin Boris, Fallon Isabella P, Yin Henry

出版信息

bioRxiv. 2023 Jun 7:2023.04.23.537994. doi: 10.1101/2023.04.23.537994.

DOI:10.1101/2023.04.23.537994

PMID:37162997

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10168281/

Abstract

According to a popular hypothesis, phasic dopamine (DA) activity encodes a reward prediction error (RPE) necessary for reinforcement learning. However, recent work showed that DA neurons are necessary for performance rather than learning. One limitation of previous work on phasic DA signaling and RPE is the limited behavioral measures. Here, we measured subtle force exertion while recording and manipulating DA activity in the ventral tegmental area (VTA) during stimulus-reward learning. We found two major populations of DA neurons that increased firing before forward and backward force exertion. Force tuning is the same regardless of learning, reward predictability, or outcome valence. Changes in the pattern of force exertion can explain results traditionally used to support the RPE hypothesis, such as modulation by reward magnitude, probability, and unpredicted reward delivery or omission. Thus VTA DA neurons are not used to signal RPE but to regulate force exertion during motivated behavior.

摘要

根据一个流行的假说，阶段性多巴胺（DA）活动编码强化学习所需的奖励预测误差（RPE）。然而，最近的研究表明，DA神经元对行为表现而非学习是必要的。先前关于阶段性DA信号和RPE的研究的一个局限性是行为测量方法有限。在这里，我们在刺激-奖励学习过程中记录和操纵腹侧被盖区（VTA）的DA活动时，测量了细微的力量施加情况。我们发现了两类主要的DA神经元，它们在向前和向后施加力量之前放电增加。无论学习情况、奖励可预测性或结果效价如何，力量调谐都是相同的。力量施加模式的变化可以解释传统上用于支持RPE假说的结果，例如奖励大小、概率以及意外奖励发放或遗漏所产生的调节作用。因此，VTA DA神经元并非用于发出RPE信号，而是在动机行为期间调节力量施加。

相似文献

Force tuning explains changes in phasic dopamine signaling during stimulus-reward learning.

bioRxiv. 2023 Jun 7:2023.04.23.537994. doi: 10.1101/2023.04.23.537994.

Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.

Front Neural Circuits. 2014 Apr 9;8:36. doi: 10.3389/fncir.2014.00036. eCollection 2014.

Dopamine errors drive excitatory and inhibitory components of backward conditioning in an outcome-specific manner.

Curr Biol. 2022 Jul 25;32(14):3210-3218.e3. doi: 10.1016/j.cub.2022.06.035. Epub 2022 Jun 24.

Context-Dependent Multiplexing by Individual VTA Dopamine Neurons.

J Neurosci. 2020 Sep 23;40(39):7489-7509. doi: 10.1523/JNEUROSCI.0502-20.2020. Epub 2020 Aug 28.

Minimal Circuit Model of Reward Prediction Error Computations and Effects of Nicotinic Modulations.

Front Neural Circuits. 2019 Jan 8;12:116. doi: 10.3389/fncir.2018.00116. eCollection 2018.

Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions.

Curr Biol. 2019 Jan 7;29(1):93-103.e3. doi: 10.1016/j.cub.2018.11.050. Epub 2018 Dec 20.

VTA dopamine neuron activity encodes social interaction and promotes reinforcement learning through social prediction error.

Nat Neurosci. 2022 Jan;25(1):86-97. doi: 10.1038/s41593-021-00972-9. Epub 2021 Dec 2.

Tonic or Phasic Stimulation of Dopaminergic Projections to Prefrontal Cortex Causes Mice to Maintain or Deviate from Previously Learned Behavioral Strategies.

J Neurosci. 2017 Aug 30;37(35):8315-8329. doi: 10.1523/JNEUROSCI.1221-17.2017. Epub 2017 Jul 24.

Modulation of cue-induced firing of ventral tegmental area dopamine neurons by leptin and ghrelin.

Int J Obes (Lond). 2015 Dec;39(12):1742-9. doi: 10.1038/ijo.2015.131. Epub 2015 Jul 17.

A Dual Role Hypothesis of the Cortico-Basal-Ganglia Pathways: Opponency and Temporal Difference Through Dopamine and Adenosine.

Front Neural Circuits. 2019 Jan 7;12:111. doi: 10.3389/fncir.2018.00111. eCollection 2018.

引用本文的文献

The learning primacy hypothesis of dopamine: reconsidering dopamine's dual functions.

Front Cell Neurosci. 2025 Apr 15;19:1538500. doi: 10.3389/fncel.2025.1538500. eCollection 2025.

Disruption of dopamine D2/D3 system function impairs the human ability to understand the mental states of other people.

PLoS Biol. 2024 Jun 13;22(6):e3002652. doi: 10.1371/journal.pbio.3002652. eCollection 2024 Jun.

Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts.

PLoS Comput Biol. 2024 Mar 29;20(3):e1011950. doi: 10.1371/journal.pcbi.1011950. eCollection 2024 Mar.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

力调节解释了刺激-奖励学习过程中多巴胺相位信号的变化。

Force tuning explains changes in phasic dopamine signaling during stimulus-reward learning.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献