多巴胺奖赏预测误差信号编码了对感知决策报告的时间评估。

Dopamine reward prediction error signal codes the temporal evaluation of a perceptual decision report.

机构信息

Departamento de Física Teórica, Universidad Autónoma de Madrid, Cantoblanco 28049, Madrid, Spain.

Centro de Investigación Avanzada en Física Fundamental, Universidad Autónoma de Madrid, Cantoblanco 28049, Madrid, Spain.

出版信息

Proc Natl Acad Sci U S A. 2017 Nov 28;114(48):E10494-E10503. doi: 10.1073/pnas.1712479114. Epub 2017 Nov 13.

DOI:10.1073/pnas.1712479114

PMID:29133424

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5715768/

Abstract

Learning to associate unambiguous sensory cues with rewarded choices is known to be mediated by dopamine (DA) neurons. However, little is known about how these neurons behave when choices rely on uncertain reward-predicting stimuli. To study this issue we reanalyzed DA recordings from monkeys engaged in the detection of weak tactile stimuli delivered at random times and formulated a reinforcement learning model based on belief states. Specifically, we investigated how the firing activity of DA neurons should behave if they were coding the error in the prediction of the total future reward when animals made decisions relying on uncertain sensory and temporal information. Our results show that the same signal that codes for reward prediction errors also codes the animal's certainty about the presence of the stimulus and the temporal expectation of sensory cues.

摘要

学习将明确的感官线索与奖励选择联系起来，这被认为是由多巴胺 (DA) 神经元介导的。然而，对于这些神经元在依赖不确定的奖励预测刺激的情况下如何表现，人们知之甚少。为了研究这个问题，我们重新分析了猴子在检测随机时间内给予的微弱触觉刺激时的 DA 神经元记录，并基于信念状态制定了一个强化学习模型。具体来说，我们研究了当动物在依赖不确定的感觉和时间信息做出决策时，DA 神经元的放电活动应该如何表现，如果它们正在编码对未来总奖励预测的误差。我们的结果表明，编码奖励预测误差的相同信号也编码了动物对刺激存在的确定性和对感觉线索的时间期望。

相似文献

Dopamine reward prediction error signal codes the temporal evaluation of a perceptual decision report.多巴胺奖赏预测误差信号编码了对感知决策报告的时间评估。

Proc Natl Acad Sci U S A. 2017 Nov 28;114(48):E10494-E10503. doi: 10.1073/pnas.1712479114. Epub 2017 Nov 13.

Midbrain Dopamine Neurons Signal Belief in Choice Accuracy during a Perceptual Decision.中脑多巴胺神经元在知觉决策中对选择准确性的置信度进行信号传递。

Curr Biol. 2017 Mar 20;27(6):821-832. doi: 10.1016/j.cub.2017.02.026. Epub 2017 Mar 9.

J Neurosci. 2003 Oct 29;23(30):9913-23. doi: 10.1523/JNEUROSCI.23-30-09913.2003.

Reward and choice encoding in terminals of midbrain dopamine neurons depends on striatal target.中脑多巴胺神经元终末的奖赏与选择编码取决于纹状体靶点。

Nat Neurosci. 2016 Jun;19(6):845-54. doi: 10.1038/nn.4287. Epub 2016 Apr 25.

Components and characteristics of the dopamine reward utility signal.多巴胺奖赏效用信号的组成部分及特征。

J Comp Neurol. 2016 Jun 1;524(8):1699-711. doi: 10.1002/cne.23880. Epub 2015 Sep 8.

Dopamine neurons code subjective sensory experience and uncertainty of perceptual decisions.多巴胺神经元编码主观感觉体验和知觉决策的不确定性。

Proc Natl Acad Sci U S A. 2011 Dec 6;108(49):19767-71. doi: 10.1073/pnas.1117636108. Epub 2011 Nov 21.

A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型，用于学习空间延迟反应任务。

Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.

Midbrain dopamine neurons compute inferred and cached value prediction errors in a common framework.中脑多巴胺神经元在一个通用框架中计算推断和缓存的价值预测误差。

Elife. 2016 Mar 7;5:e13665. doi: 10.7554/eLife.13665.

Dopamine neurons coding prediction errors in reward space, but not in aversive space: a matter of location?多巴胺神经元编码奖励空间而非厌恶空间中的预测误差：位置问题？

J Neurophysiol. 2014 Sep 1;112(5):1021-4. doi: 10.1152/jn.00751.2013. Epub 2014 Feb 26.

The cost of obtaining rewards enhances the reward prediction error signal of midbrain dopamine neurons.获得奖励的成本增强了中脑多巴胺神经元的奖励预测误差信号。

Nat Commun. 2019 Aug 15;10(1):3674. doi: 10.1038/s41467-019-11334-2.

引用本文的文献

Temporal regularities shape perceptual decisions and striatal dopamine signals.时间规律塑造知觉决策和纹状体多巴胺信号。

Nat Commun. 2024 Aug 17;15(1):7093. doi: 10.1038/s41467-024-51393-8.

Emergence of belief-like representations through reinforcement learning.通过强化学习产生类信仰的表示。

PLoS Comput Biol. 2023 Sep 11;19(9):e1011067. doi: 10.1371/journal.pcbi.1011067. eCollection 2023 Sep.

Emergence of belief-like representations through reinforcement learning.通过强化学习产生类似信念的表征。

bioRxiv. 2023 Apr 4:2023.04.04.535512. doi: 10.1101/2023.04.04.535512.

Rhythm and Music-Based Interventions in Motor Rehabilitation: Current Evidence and Future Perspectives.基于节奏和音乐的运动康复干预：当前证据与未来展望。

Front Hum Neurosci. 2022 Jan 17;15:789467. doi: 10.3389/fnhum.2021.789467. eCollection 2021.

Dopamine firing plays a dual role in coding reward prediction errors and signaling motivation in a working memory task.多巴胺放电在编码奖励预测误差和在工作记忆任务中信号动机方面发挥双重作用。

Proc Natl Acad Sci U S A. 2022 Jan 11;119(2). doi: 10.1073/pnas.2113311119.

Expectancy-based rhythmic entrainment as continuous Bayesian inference.基于预期的节奏同步作为连续贝叶斯推断。

PLoS Comput Biol. 2021 Jun 9;17(6):e1009025. doi: 10.1371/journal.pcbi.1009025. eCollection 2021 Jun.

How Beat Perception Co-opts Motor Neurophysiology.节拍感知如何借鉴运动神经生理学。

Trends Cogn Sci. 2021 Feb;25(2):137-150. doi: 10.1016/j.tics.2020.11.002. Epub 2020 Dec 24.

Dopamine signals as temporal difference errors: recent advances.多巴胺信号作为时间差异误差：最新进展。

Curr Opin Neurobiol. 2021 Apr;67:95-105. doi: 10.1016/j.conb.2020.08.014. Epub 2020 Nov 10.

Distributional Reinforcement Learning in the Brain.大脑中的分布强化学习。

Trends Neurosci. 2020 Dec;43(12):980-997. doi: 10.1016/j.tins.2020.09.004. Epub 2020 Oct 19.

Motor and Predictive Processes in Auditory Beat and Rhythm Perception.听觉节拍与节奏感知中的运动和预测过程

Front Hum Neurosci. 2020 Sep 11;14:578546. doi: 10.3389/fnhum.2020.578546. eCollection 2020.

本文引用的文献

Midbrain dopamine neurons control judgment of time.中脑多巴胺神经元控制时间判断。

Science. 2016 Dec 9;354(6317):1273-1277. doi: 10.1126/science.aah5234.

Emergence of an abstract categorical code enabling the discrimination of temporally structured tactile stimuli.一种抽象分类代码的出现，能够区分具有时间结构的触觉刺激。

Proc Natl Acad Sci U S A. 2016 Dec 6;113(49):E7966-E7975. doi: 10.1073/pnas.1618196113. Epub 2016 Nov 21.

Confidence and certainty: distinct probabilistic quantities for different goals.置信度与确定性：针对不同目标的不同概率量值。

Nat Neurosci. 2016 Mar;19(3):366-74. doi: 10.1038/nn.4240.

Brief optogenetic inhibition of dopamine neurons mimics endogenous negative reward prediction errors.对多巴胺神经元进行短暂的光遗传学抑制可模拟内源性负性奖励预测误差。

Nat Neurosci. 2016 Jan;19(1):111-6. doi: 10.1038/nn.4191. Epub 2015 Dec 7.

Neuronal Reward and Decision Signals: From Theories to Data.神经元奖励与决策信号：从理论到数据

Physiol Rev. 2015 Jul;95(3):853-951. doi: 10.1152/physrev.00023.2014.

Dynamic Control of Response Criterion in Premotor Cortex during Perceptual Detection under Temporal Uncertainty.在时间不确定性下进行知觉检测时，前运动皮层中反应标准的动态控制。

Neuron. 2015 May 20;86(4):1067-1077. doi: 10.1016/j.neuron.2015.04.014. Epub 2015 May 7.

A scalable population code for time in the striatum.纹状体中的时间的可扩展群体代码。

Curr Biol. 2015 May 4;25(9):1113-22. doi: 10.1016/j.cub.2015.02.036. Epub 2015 Apr 23.

Dopamine neurons encode errors in predicting movement trigger occurrence.多巴胺神经元在预测运动触发事件发生时编码误差。

J Neurophysiol. 2015 Feb 15;113(4):1110-23. doi: 10.1152/jn.00401.2014. Epub 2014 Nov 19.

An optimal decision population code that accounts for correlated variability unambiguously predicts a subject's choice.一个考虑到明确相关性的最优决策群体代码可以预测受试者的选择。

Neuron. 2013 Dec 18;80(6):1532-43. doi: 10.1016/j.neuron.2013.09.023. Epub 2013 Nov 21.

A causal link between prediction errors, dopamine neurons and learning.预测误差、多巴胺神经元和学习之间的因果关系。

Nat Neurosci. 2013 Jul;16(7):966-73. doi: 10.1038/nn.3413. Epub 2013 May 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验