• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类中脑和腹侧纹状体中的可分离奖励和时间信号。

Dissociable reward and timing signals in human midbrain and ventral striatum.

机构信息

Sobell Department of Motor Neuroscience and Movement Disorders, Institute of Neurology, UCL, London WC1N3BG, UK.

出版信息

Neuron. 2011 Nov 17;72(4):654-64. doi: 10.1016/j.neuron.2011.08.024.

DOI:10.1016/j.neuron.2011.08.024
PMID:22099466
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3219831/
Abstract

Reward prediction error (RPE) signals are central to current models of reward-learning. Temporal difference (TD) learning models posit that these signals should be modulated by predictions, not only of magnitude but also timing of reward. Here we show that BOLD activity in the VTA conforms to such TD predictions: responses to unexpected rewards are modulated by a temporal hazard function and activity between a predictive stimulus and reward is depressed in proportion to predicted reward. By contrast, BOLD activity in ventral striatum (VS) does not reflect a TD RPE, but instead encodes a signal on the variable relevant for behavior, here timing but not magnitude of reward. The results have important implications for dopaminergic models of cortico-striatal learning and suggest a modification of the conventional view that VS BOLD necessarily reflects inputs from dopaminergic VTA neurons signaling an RPE.

摘要

奖励预测误差(RPE)信号是当前奖励学习模型的核心。时频差(TD)学习模型假设这些信号不仅应该受到奖励幅度的预测的调节,还应该受到奖励时间的预测的调节。在这里,我们表明腹侧被盖区(VTA)的 BOLD 活动符合这种 TD 预测:对意外奖励的反应受到时间危险函数的调节,并且在预测刺激和奖励之间的活动根据预测的奖励而被抑制。相比之下,腹侧纹状体(VS)的 BOLD 活动不反映 TD RPE,而是对行为相关的变量(此处是奖励的时间而不是幅度)进行编码。研究结果对皮质纹状体学习的多巴胺能模型具有重要意义,并表明对 VS BOLD 必然反映来自多巴胺能 VTA 神经元信号的 RPE 的传统观点进行了修正。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/61adb7631380/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/8e43965d0a30/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/0cb72255705f/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/8239fb9fb3ec/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/f1baf60d72dc/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/ae8096f17726/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/61adb7631380/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/8e43965d0a30/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/0cb72255705f/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/8239fb9fb3ec/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/f1baf60d72dc/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/ae8096f17726/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4b0/3219831/61adb7631380/gr6.jpg

相似文献

1
Dissociable reward and timing signals in human midbrain and ventral striatum.人类中脑和腹侧纹状体中的可分离奖励和时间信号。
Neuron. 2011 Nov 17;72(4):654-64. doi: 10.1016/j.neuron.2011.08.024.
2
Differential magnitude coding of gains and omitted rewards in the ventral striatum.腹侧纹状体中收益和缺失奖励的差异幅度编码。
Brain Res. 2011 Sep 9;1411:76-86. doi: 10.1016/j.brainres.2011.07.019. Epub 2011 Jul 18.
3
Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior.基底神经节和眶额皮质在目标导向行为中的参与。
Prog Brain Res. 2000;126:193-215. doi: 10.1016/S0079-6123(00)26015-9.
4
Temporal Specificity of Reward Prediction Errors Signaled by Putative Dopamine Neurons in Rat VTA Depends on Ventral Striatum.大鼠腹侧被盖区中假定多巴胺能神经元发出的奖励预测误差的时间特异性取决于腹侧纹状体。
Neuron. 2016 Jul 6;91(1):182-93. doi: 10.1016/j.neuron.2016.05.015. Epub 2016 Jun 9.
5
Anticipatory reward signals in ventral striatal neurons of behaving rats.行为大鼠腹侧纹状体神经元中的预期奖励信号。
Eur J Neurosci. 2008 Nov;28(9):1849-66. doi: 10.1111/j.1460-9568.2008.06480.x.
6
Contextual interaction between novelty and reward processing within the mesolimbic system.中脑边缘系统中新奇与奖励处理的语境交互作用。
Hum Brain Mapp. 2012 Jun;33(6):1309-24. doi: 10.1002/hbm.21288. Epub 2011 Apr 21.
7
A high-resolution fMRI approach to characterize functionally distinct neural pathways within dopaminergic midbrain and nucleus accumbens during reward and salience processing.一种高分辨率 fMRI 方法,用于在奖励和突显处理过程中描绘多巴胺能中脑和伏隔核内功能不同的神经通路。
Eur Neuropsychopharmacol. 2020 Jul;36:137-150. doi: 10.1016/j.euroneuro.2020.05.005. Epub 2020 Jun 13.
8
BOLD responses reflecting dopaminergic signals in the human ventral tegmental area.反映人类腹侧被盖区多巴胺能信号的血氧水平依赖性功能磁共振成像响应。
Science. 2008 Feb 29;319(5867):1264-7. doi: 10.1126/science.1150605.
9
Impaired reward prediction error encoding and striatal-midbrain connectivity in depression.抑郁症患者的奖励预测误差编码和纹状体-中脑连接受损。
Neuropsychopharmacology. 2018 Jun;43(7):1581-1588. doi: 10.1038/s41386-018-0032-x. Epub 2018 Feb 26.
10
Beta Oscillations in Monkey Striatum Encode Reward Prediction Error Signals.猴子纹状体中的β振荡编码奖励预测误差信号。
J Neurosci. 2023 May 3;43(18):3339-3352. doi: 10.1523/JNEUROSCI.0952-22.2023. Epub 2023 Apr 4.

引用本文的文献

1
Reward signals in the motor cortex: from biology to neurotechnology.运动皮层中的奖赏信号:从生物学到神经技术
Nat Commun. 2025 Feb 3;16(1):1307. doi: 10.1038/s41467-024-55016-0.
2
Distinct Action Signals by Subregions in the Nucleus Accumbens during STOP-Change Performance.伏隔核亚区域在停止-转换任务执行过程中的不同动作信号
J Neurosci. 2024 Jul 17;44(29):e0020242024. doi: 10.1523/JNEUROSCI.0020-24.2024.
3
PET-measured human dopamine synthesis capacity and receptor availability predict trading rewards and time-costs during foraging.

本文引用的文献

1
The impact of physiological noise correction on fMRI at 7 T.生理噪声校正对 7T fMRI 的影响。
Neuroimage. 2011 Jul 1;57(1):101-112. doi: 10.1016/j.neuroimage.2011.04.018. Epub 2011 Apr 15.
2
Signals in human striatum are appropriate for policy update rather than value prediction.人类纹状体中的信号适合用于策略更新,而不是价值预测。
J Neurosci. 2011 Apr 6;31(14):5504-11. doi: 10.1523/JNEUROSCI.6316-10.2011.
3
How the opinion of others affects our valuation of objects.他人的意见如何影响我们对事物的评价。
正电子发射断层扫描(PET)测量的人类多巴胺合成能力和受体可及性可预测觅食过程中的交易奖励和时间成本。
Nat Commun. 2023 Sep 30;14(1):6122. doi: 10.1038/s41467-023-41897-0.
4
Reward timing matters in motor learning.奖励时机在运动学习中很重要。
iScience. 2022 Apr 25;25(5):104290. doi: 10.1016/j.isci.2022.104290. eCollection 2022 May 20.
5
Anhedonia and Suicide.快感缺失与自杀。
Curr Top Behav Neurosci. 2022;58:443-464. doi: 10.1007/7854_2022_358.
6
Multiple systems in macaques for tracking prediction errors and other types of surprise.猕猴的多个系统用于跟踪预测误差和其他类型的意外。
PLoS Biol. 2020 Oct 30;18(10):e3000899. doi: 10.1371/journal.pbio.3000899. eCollection 2020 Oct.
7
Neuronal Computation Underlying Inferential Reasoning in Humans and Mice.人类和小鼠推理思维的神经计算。
Cell. 2020 Oct 1;183(1):228-243.e21. doi: 10.1016/j.cell.2020.08.035. Epub 2020 Sep 17.
8
Local and global consequences of reward-evoked striatal dopamine release.奖赏诱发纹状体多巴胺释放的局部和全局后果。
Nature. 2020 Apr;580(7802):239-244. doi: 10.1038/s41586-020-2158-3. Epub 2020 Apr 1.
9
Computational modelling of social cognition and behaviour-a reinforcement learning primer.社交认知与行为的计算建模——强化学习基础
Soc Cogn Affect Neurosci. 2021 Aug 6;16(8):761-771. doi: 10.1093/scan/nsaa040.
10
The anticipation of events in time.对时间事件的预期。
Nat Commun. 2019 Dec 20;10(1):5802. doi: 10.1038/s41467-019-13849-0.
Curr Biol. 2010 Jul 13;20(13):1165-70. doi: 10.1016/j.cub.2010.04.055. Epub 2010 Jun 17.
4
States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.状态与奖励:基于模型和无模型强化学习的分离神经预测误差信号。
Neuron. 2010 May 27;66(4):585-95. doi: 10.1016/j.neuron.2010.04.016.
5
A unique adolescent response to reward prediction errors.青少年对奖励预测误差的独特反应。
Nat Neurosci. 2010 Jun;13(6):669-71. doi: 10.1038/nn.2558. Epub 2010 May 16.
6
Striatal prediction error modulates cortical coupling.纹状体预测误差调节皮质耦合。
J Neurosci. 2010 Mar 3;30(9):3210-9. doi: 10.1523/JNEUROSCI.4458-09.2010.
7
Neural regions that underlie reinforcement learning are also active for social expectancy violations.强化学习所依赖的神经区域对于社会期望违反也很活跃。
Soc Neurosci. 2010;5(1):76-91. doi: 10.1080/17470910903135825.
8
Comparing the neural basis of monetary reward and cognitive feedback during information-integration category learning.比较信息整合类别学习过程中金钱奖励和认知反馈的神经基础。
J Neurosci. 2010 Jan 6;30(1):47-55. doi: 10.1523/JNEUROSCI.2205-09.2010.
9
Dissociating the contributions of independent corticostriatal systems to visual categorization learning through the use of reinforcement learning modeling and Granger causality modeling.通过使用强化学习建模和格兰杰因果关系建模,将独立的皮质纹状体系统对视觉分类学习的贡献区分开来。
Neuroimage. 2010 Apr 1;50(2):644-56. doi: 10.1016/j.neuroimage.2009.11.083. Epub 2009 Dec 5.
10
Human reinforcement learning subdivides structured action spaces by learning effector-specific values.人类强化学习通过学习特定效应器的值来细分结构化动作空间。
J Neurosci. 2009 Oct 28;29(43):13524-31. doi: 10.1523/JNEUROSCI.2469-09.2009.