国家不确定性在多巴胺动态中的作用。

The role of state uncertainty in the dynamics of dopamine.

机构信息

Program in Neuroscience, Harvard Medical School, Boston, MA 02115, USA; MD-PhD Program, Harvard Medical School, Boston, MA 02115, USA.

Center for Neuroscience Imaging Research, Institute for Basic Science, Suwon 16419, Republic of Korea; Department of Biomedical Engineering, Sungkyunkwan University, Suwon 16419, Republic of Korea; Department of Molecular and Cellular Biology and Center for Brain Science, Harvard University, Cambridge, MA 02138, USA.

出版信息

Curr Biol. 2022 Mar 14;32(5):1077-1087.e9. doi: 10.1016/j.cub.2022.01.025. Epub 2022 Feb 2.

DOI:10.1016/j.cub.2022.01.025

PMID:35114098

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8930519/

Abstract

Reinforcement learning models of the basal ganglia map the phasic dopamine signal to reward prediction errors (RPEs). Conventional models assert that, when a stimulus predicts a reward with fixed delay, dopamine activity during the delay should converge to baseline through learning. However, recent studies have found that dopamine ramps up before reward in certain conditions even after learning, thus challenging the conventional models. In this work, we show that sensory feedback causes an unbiased learner to produce RPE ramps. Our model predicts that when feedback gradually decreases during a trial, dopamine activity should resemble a "bump," whose ramp-up phase should, furthermore, be greater than that of conditions where the feedback stays high. We trained mice on a virtual navigation task with varying brightness, and both predictions were empirically observed. In sum, our theoretical and experimental results reconcile the seemingly conflicting data on dopamine behaviors under the RPE hypothesis.

摘要

基底神经节的强化学习模型将相位多巴胺信号映射到奖励预测误差（RPE）。传统模型断言，当刺激以固定延迟预测奖励时，多巴胺活动在延迟期间应该通过学习收敛到基线。然而，最近的研究发现，在某些条件下，即使在学习之后，多巴胺在奖励之前也会上升，从而挑战了传统模型。在这项工作中，我们表明，感官反馈导致无偏学习者产生 RPE 斜坡。我们的模型预测，当在试验期间逐渐降低反馈时，多巴胺活动应该类似于“凸起”，其上升阶段应该大于反馈保持较高的情况。我们在具有不同亮度的虚拟导航任务中对老鼠进行了训练，并且都观察到了这两个预测。总之，我们的理论和实验结果在 RPE 假设下协调了多巴胺行为的看似冲突的数据。

相似文献

The role of state uncertainty in the dynamics of dopamine.

Curr Biol. 2022 Mar 14;32(5):1077-1087.e9. doi: 10.1016/j.cub.2022.01.025. Epub 2022 Feb 2.

Belief state representation in the dopamine system.

Nat Commun. 2018 May 14;9(1):1891. doi: 10.1038/s41467-018-04397-0.

Dopamine ramps for accurate value learning under uncertainty.

Trends Neurosci. 2022 Apr;45(4):254-256. doi: 10.1016/j.tins.2022.01.008. Epub 2022 Feb 15.

Midbrain dopamine neurons signal phasic and ramping reward prediction error during goal-directed navigation.

Cell Rep. 2022 Oct 11;41(2):111470. doi: 10.1016/j.celrep.2022.111470.

Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.

Front Neural Circuits. 2014 Apr 9;8:36. doi: 10.3389/fncir.2014.00036. eCollection 2014.

Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time.

Nat Commun. 2024 Jul 12;15(1):5856. doi: 10.1038/s41467-024-50205-3.

An association between prediction errors and risk-seeking: Theory and behavioral evidence.

PLoS Comput Biol. 2021 Jul 16;17(7):e1009213. doi: 10.1371/journal.pcbi.1009213. eCollection 2021 Jul.

A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.

Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.

Uncertainty-guided learning with scaled prediction errors in the basal ganglia.

PLoS Comput Biol. 2022 May 27;18(5):e1009816. doi: 10.1371/journal.pcbi.1009816. eCollection 2022 May.

Phasic dopamine release in the rat nucleus accumbens symmetrically encodes a reward prediction error term.

J Neurosci. 2014 Jan 15;34(3):698-704. doi: 10.1523/JNEUROSCI.2489-13.2014.

引用本文的文献

Mesolimbic dopamine ramps reflect environmental timescales.

Elife. 2025 Aug 29;13:RP98666. doi: 10.7554/eLife.98666.

Striatal Gradient in Value-Decay Explains Regional Differences in Dopamine Patterns and Reinforcement Learning Computations.

J Neurosci. 2025 Jul 18. doi: 10.1523/JNEUROSCI.0170-25.2025.

Multi-timescale reinforcement learning in the brain.

Nature. 2025 Jun 4. doi: 10.1038/s41586-025-08929-9.

Associations Between Taq1A/C957T Polymorphic Variants and Autonomic Responsivity in a Slot Machine Task: Influence of Real-Life Gambling Exposure and Sex.

J Gambl Stud. 2025 May 30. doi: 10.1007/s10899-025-10398-8.

Quantitative dynamics of neural uncertainty in sensory processing and decision-making during discriminative learning.

Exp Mol Med. 2025 May 7. doi: 10.1038/s12276-025-01456-7.

Striatal dopamine represents valence on dynamic regional scales.

J Neurosci. 2025 Mar 17;45(17). doi: 10.1523/JNEUROSCI.1551-24.2025.

Fiber photometry analysis of spontaneous dopamine signals: The z-scored data are not the data.

bioRxiv. 2025 Feb 24:2025.02.19.639080. doi: 10.1101/2025.02.19.639080.

The devilish details affecting TDRL models in dopamine research.

Trends Cogn Sci. 2025 May;29(5):434-447. doi: 10.1016/j.tics.2025.02.001. Epub 2025 Feb 26.

Dopamine release plateau and outcome signals in dorsal striatum contrast with classic reinforcement learning formulations.

Nat Commun. 2024 Oct 14;15(1):8856. doi: 10.1038/s41467-024-53176-7.

Mating proximity blinds threat perception.

Nature. 2024 Oct;634(8034):635-643. doi: 10.1038/s41586-024-07890-3. Epub 2024 Aug 28.

本文引用的文献

A Unified Framework for Dopamine Signals across Timescales.

Cell. 2020 Dec 10;183(6):1600-1616.e25. doi: 10.1016/j.cell.2020.11.013. Epub 2020 Nov 27.

Believing in dopamine.

Nat Rev Neurosci. 2019 Nov;20(11):703-714. doi: 10.1038/s41583-019-0220-7. Epub 2019 Sep 30.

Dopamine blockade impairs the exploration-exploitation trade-off in rats.

Sci Rep. 2019 May 1;9(1):6770. doi: 10.1038/s41598-019-43245-z.

Rethinking dopamine as generalized prediction error.

Proc Biol Sci. 2018 Nov 21;285(1891):20181645. doi: 10.1098/rspb.2018.1645.

Log versus linear timing in human temporal bisection: A signal detection theory study.

J Exp Psychol Anim Learn Cogn. 2018 Oct;44(4):396-408. doi: 10.1037/xan0000184.

What does dopamine mean?

Nat Neurosci. 2018 Jun;21(6):787-793. doi: 10.1038/s41593-018-0152-y. Epub 2018 May 14.

Belief state representation in the dopamine system.

Nat Commun. 2018 May 14;9(1):1891. doi: 10.1038/s41467-018-04397-0.

The Medial Prefrontal Cortex Shapes Dopamine Reward Prediction Errors under State Uncertainty.

Neuron. 2018 May 2;98(3):616-629.e6. doi: 10.1016/j.neuron.2018.03.036. Epub 2018 Apr 12.

Midbrain Dopamine Neurons Signal Belief in Choice Accuracy during a Perceptual Decision.

Curr Biol. 2017 Mar 20;27(6):821-832. doi: 10.1016/j.cub.2017.02.026. Epub 2017 Mar 9.

Dopamine reward prediction errors reflect hidden-state inference across time.

Nat Neurosci. 2017 Apr;20(4):581-589. doi: 10.1038/nn.4520. Epub 2017 Mar 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

国家不确定性在多巴胺动态中的作用。

The role of state uncertainty in the dynamics of dopamine.

机构信息

Program in Neuroscience, Harvard Medical School, Boston, MA 02115, USA; MD-PhD Program, Harvard Medical School, Boston, MA 02115, USA.

出版信息

Curr Biol. 2022 Mar 14;32(5):1077-1087.e9. doi: 10.1016/j.cub.2022.01.025. Epub 2022 Feb 2.

DOI:10.1016/j.cub.2022.01.025

PMID:35114098

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8930519/

Abstract

摘要

国家不确定性在多巴胺动态中的作用。

The role of state uncertainty in the dynamics of dopamine.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

国家不确定性在多巴胺动态中的作用。

The role of state uncertainty in the dynamics of dopamine.

机构信息

出版信息