• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类大脑在使用果汁和金钱奖励进行工具性学习过程中,背侧纹状体的预测误差存在重叠。

Overlapping prediction errors in dorsal striatum during instrumental learning with juice and money reward in the human brain.

机构信息

Division of Humanities and Social Sciences, California Institute of Technology, Pasadena, California, USA.

出版信息

J Neurophysiol. 2009 Dec;102(6):3384-91. doi: 10.1152/jn.91195.2008. Epub 2009 Sep 30.

DOI:10.1152/jn.91195.2008
PMID:19793875
Abstract

Prediction error signals have been reported in human imaging studies in target areas of dopamine neurons such as ventral and dorsal striatum during learning with many different types of reinforcers. However, a key question that has yet to be addressed is whether prediction error signals recruit distinct or overlapping regions of striatum and elsewhere during learning with different types of reward. To address this, we scanned 17 healthy subjects with functional magnetic resonance imaging while they chose actions to obtain either a pleasant juice reward (1 ml apple juice), or a monetary gain (5 cents) and applied a computational reinforcement learning model to subjects' behavioral and imaging data. Evidence for an overlapping prediction error signal during learning with juice and money rewards was found in a region of dorsal striatum (caudate nucleus), while prediction error signals in a subregion of ventral striatum were significantly stronger during learning with money but not juice reward. These results provide evidence for partially overlapping reward prediction signals for different types of appetitive reinforcers within the striatum, a finding with important implications for understanding the nature of associative encoding in the striatum as a function of reinforcer type.

摘要

在使用多种不同类型的强化物进行学习时,人类影像学研究报告称在多巴胺神经元的目标区域(如腹侧和背侧纹状体)中存在预测误差信号。然而,一个尚未解决的关键问题是,在使用不同类型的奖励进行学习时,预测误差信号是否会招募纹状体和其他区域的不同或重叠区域。为了解决这个问题,我们对 17 名健康受试者进行了功能磁共振成像扫描,同时他们选择了行动来获得愉悦的果汁奖励(1 毫升苹果汁)或货币收益(5 美分),并将计算强化学习模型应用于受试者的行为和成像数据。在使用果汁和金钱奖励进行学习时,在背侧纹状体(尾状核)的一个区域中发现了重叠的预测误差信号的证据,而在腹侧纹状体的一个亚区中,预测误差信号在学习使用金钱奖励时明显更强,但在使用果汁奖励时则没有。这些结果为不同类型的奖赏预测信号在纹状体中存在部分重叠提供了证据,这一发现对于理解作为强化物类型函数的纹状体中的联想编码性质具有重要意义。

相似文献

1
Overlapping prediction errors in dorsal striatum during instrumental learning with juice and money reward in the human brain.人类大脑在使用果汁和金钱奖励进行工具性学习过程中,背侧纹状体的预测误差存在重叠。
J Neurophysiol. 2009 Dec;102(6):3384-91. doi: 10.1152/jn.91195.2008. Epub 2009 Sep 30.
2
Expected value and prediction error abnormalities in depression and schizophrenia.抑郁和精神分裂症中的预期价值和预测误差异常。
Brain. 2011 Jun;134(Pt 6):1751-64. doi: 10.1093/brain/awr059. Epub 2011 Apr 10.
3
Heterarchical reinforcement-learning model for integration of multiple cortico-striatal loops: fMRI examination in stimulus-action-reward association learning.用于整合多个皮质-纹状体环路的异层级强化学习模型:刺激-动作-奖励关联学习中的功能磁共振成像检查
Neural Netw. 2006 Oct;19(8):1242-54. doi: 10.1016/j.neunet.2006.06.007. Epub 2006 Sep 20.
4
Dorsal striatal-midbrain connectivity in humans predicts how reinforcements are used to guide decisions.人类背侧纹状体与中脑的连接性可预测强化如何用于指导决策。
J Cogn Neurosci. 2009 Jul;21(7):1332-45. doi: 10.1162/jocn.2009.21092.
5
Prediction error in reinforcement learning: a meta-analysis of neuroimaging studies.强化学习中的预测误差:神经影像学研究的荟萃分析。
Neurosci Biobehav Rev. 2013 Aug;37(7):1297-310. doi: 10.1016/j.neubiorev.2013.03.023. Epub 2013 Apr 6.
6
Neural coding of reward-prediction error signals during classical conditioning with attractive faces.在对有吸引力面孔进行经典条件反射过程中奖励预测误差信号的神经编码
J Neurophysiol. 2007 Apr;97(4):3036-45. doi: 10.1152/jn.01211.2006. Epub 2007 Feb 15.
7
Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics.可预测和不可预测环境动态下奖励预测的脑机制。
Neural Netw. 2006 Oct;19(8):1233-41. doi: 10.1016/j.neunet.2006.05.039. Epub 2006 Sep 18.
8
Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior.基底神经节和眶额皮质在目标导向行为中的参与。
Prog Brain Res. 2000;126:193-215. doi: 10.1016/S0079-6123(00)26015-9.
9
Human dorsal striatum encodes prediction errors during observational learning of instrumental actions.人类背侧纹状体在观察性工具动作学习过程中对预测误差进行编码。
J Cogn Neurosci. 2012 Jan;24(1):106-18. doi: 10.1162/jocn_a_00114. Epub 2011 Aug 3.
10
Activity in human ventral striatum locked to errors of reward prediction.人类腹侧纹状体的活动与奖励预测误差相关。
Nat Neurosci. 2002 Feb;5(2):97-8. doi: 10.1038/nn802.

引用本文的文献

1
The interoceptive origin of reinforcement learning.强化学习的内感受起源
Trends Cogn Sci. 2025 Sep;29(9):840-854. doi: 10.1016/j.tics.2025.05.008. Epub 2025 Jun 10.
2
Representation of Anticipated Rewards and Punishments in the Human Brain.人类大脑中预期奖励与惩罚的表征。
Annu Rev Psychol. 2025 Jan;76(1):197-226. doi: 10.1146/annurev-psych-022324-042614. Epub 2024 Dec 3.
3
State-specific alterations in the neural computations underlying inhibitory control in women remitted from bulimia nervosa.神经性贪食症缓解期女性抑制控制背后神经计算的特定状态改变。
Mol Psychiatry. 2023 Jul;28(7):3055-3062. doi: 10.1038/s41380-023-02063-6. Epub 2023 Apr 27.
4
Beta Oscillations in Monkey Striatum Encode Reward Prediction Error Signals.猴子纹状体中的β振荡编码奖励预测误差信号。
J Neurosci. 2023 May 3;43(18):3339-3352. doi: 10.1523/JNEUROSCI.0952-22.2023. Epub 2023 Apr 4.
5
Learning under social versus nonsocial uncertainty: A meta-analytic approach.在社会不确定性与非社会不确定性下的学习:一项元分析方法。
Hum Brain Mapp. 2022 Sep;43(13):4185-4206. doi: 10.1002/hbm.25948. Epub 2022 May 27.
6
Cortico-Striatal Activity Characterizes Human Safety Learning via Pavlovian Conditioned Inhibition.皮质纹状体活动通过条件性 Pavlovian 抑制来表征人类的安全学习。
J Neurosci. 2022 Jun 22;42(25):5047-5057. doi: 10.1523/JNEUROSCI.2181-21.2022. Epub 2022 May 16.
7
Acute stress blunts prediction error signals in the dorsal striatum during reinforcement learning.急性应激会在强化学习过程中减弱背侧纹状体中的预测误差信号。
Neurobiol Stress. 2021 Oct 27;15:100412. doi: 10.1016/j.ynstr.2021.100412. eCollection 2021 Nov.
8
Modeling changes in probabilistic reinforcement learning during adolescence.建模青少年时期概率强化学习的变化。
PLoS Comput Biol. 2021 Jul 1;17(7):e1008524. doi: 10.1371/journal.pcbi.1008524. eCollection 2021 Jul.
9
Revisiting the importance of model fitting for model-based fMRI: It does matter in computational psychiatry.重新审视模型拟合对基于模型的功能磁共振成像的重要性:它在计算精神病学中确实很重要。
PLoS Comput Biol. 2021 Feb 9;17(2):e1008738. doi: 10.1371/journal.pcbi.1008738. eCollection 2021 Feb.
10
Anhedonia, positive affect dysregulation, and risk and maintenance of binge-eating disorder.快感缺失、正性情绪调节障碍与暴食障碍的风险和维持。
Int J Eat Disord. 2021 Mar;54(3):287-292. doi: 10.1002/eat.23433. Epub 2020 Dec 9.