Suppr超能文献

多巴胺奖赏预测误差信号编码了对感知决策报告的时间评估。

Dopamine reward prediction error signal codes the temporal evaluation of a perceptual decision report.

机构信息

Departamento de Física Teórica, Universidad Autónoma de Madrid, Cantoblanco 28049, Madrid, Spain.

Centro de Investigación Avanzada en Física Fundamental, Universidad Autónoma de Madrid, Cantoblanco 28049, Madrid, Spain.

出版信息

Proc Natl Acad Sci U S A. 2017 Nov 28;114(48):E10494-E10503. doi: 10.1073/pnas.1712479114. Epub 2017 Nov 13.

Abstract

Learning to associate unambiguous sensory cues with rewarded choices is known to be mediated by dopamine (DA) neurons. However, little is known about how these neurons behave when choices rely on uncertain reward-predicting stimuli. To study this issue we reanalyzed DA recordings from monkeys engaged in the detection of weak tactile stimuli delivered at random times and formulated a reinforcement learning model based on belief states. Specifically, we investigated how the firing activity of DA neurons should behave if they were coding the error in the prediction of the total future reward when animals made decisions relying on uncertain sensory and temporal information. Our results show that the same signal that codes for reward prediction errors also codes the animal's certainty about the presence of the stimulus and the temporal expectation of sensory cues.

摘要

学习将明确的感官线索与奖励选择联系起来,这被认为是由多巴胺 (DA) 神经元介导的。然而,对于这些神经元在依赖不确定的奖励预测刺激的情况下如何表现,人们知之甚少。为了研究这个问题,我们重新分析了猴子在检测随机时间内给予的微弱触觉刺激时的 DA 神经元记录,并基于信念状态制定了一个强化学习模型。具体来说,我们研究了当动物在依赖不确定的感觉和时间信息做出决策时,DA 神经元的放电活动应该如何表现,如果它们正在编码对未来总奖励预测的误差。我们的结果表明,编码奖励预测误差的相同信号也编码了动物对刺激存在的确定性和对感觉线索的时间期望。

相似文献

5
Components and characteristics of the dopamine reward utility signal.多巴胺奖赏效用信号的组成部分及特征。
J Comp Neurol. 2016 Jun 1;524(8):1699-711. doi: 10.1002/cne.23880. Epub 2015 Sep 8.

引用本文的文献

2
Emergence of belief-like representations through reinforcement learning.通过强化学习产生类信仰的表示。
PLoS Comput Biol. 2023 Sep 11;19(9):e1011067. doi: 10.1371/journal.pcbi.1011067. eCollection 2023 Sep.
6
Expectancy-based rhythmic entrainment as continuous Bayesian inference.基于预期的节奏同步作为连续贝叶斯推断。
PLoS Comput Biol. 2021 Jun 9;17(6):e1009025. doi: 10.1371/journal.pcbi.1009025. eCollection 2021 Jun.
7
How Beat Perception Co-opts Motor Neurophysiology.节拍感知如何借鉴运动神经生理学。
Trends Cogn Sci. 2021 Feb;25(2):137-150. doi: 10.1016/j.tics.2020.11.002. Epub 2020 Dec 24.
8
Dopamine signals as temporal difference errors: recent advances.多巴胺信号作为时间差异误差:最新进展。
Curr Opin Neurobiol. 2021 Apr;67:95-105. doi: 10.1016/j.conb.2020.08.014. Epub 2020 Nov 10.
9
Distributional Reinforcement Learning in the Brain.大脑中的分布强化学习。
Trends Neurosci. 2020 Dec;43(12):980-997. doi: 10.1016/j.tins.2020.09.004. Epub 2020 Oct 19.
10
Motor and Predictive Processes in Auditory Beat and Rhythm Perception.听觉节拍与节奏感知中的运动和预测过程
Front Hum Neurosci. 2020 Sep 11;14:578546. doi: 10.3389/fnhum.2020.578546. eCollection 2020.

本文引用的文献

1
Midbrain dopamine neurons control judgment of time.中脑多巴胺神经元控制时间判断。
Science. 2016 Dec 9;354(6317):1273-1277. doi: 10.1126/science.aah5234.
7
A scalable population code for time in the striatum.纹状体中的时间的可扩展群体代码。
Curr Biol. 2015 May 4;25(9):1113-22. doi: 10.1016/j.cub.2015.02.036. Epub 2015 Apr 23.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验