• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

内在价值观与后天习得价值观的一致性有助于基于奖励的决策。

Congruence of Inherent and Acquired Values Facilitates Reward-Based Decision-Making.

作者信息

Chien Samson, Wiehler Antonius, Spezio Michael, Gläscher Jan

机构信息

Institute for Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg 20246, Germany, and

Institute for Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg 20246, Germany, and.

出版信息

J Neurosci. 2016 May 4;36(18):5003-12. doi: 10.1523/JNEUROSCI.3084-15.2016.

DOI:10.1523/JNEUROSCI.3084-15.2016
PMID:27147653
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6601847/
Abstract

UNLABELLED

Most real-life cues exhibit certain inherent values that may interfere with or facilitate the acquisition of new expected values during associative learning. In particular, when inherent and acquired values are congruent, learning may progress more rapidly. Here we investigated such an influence through a 2 × 2 factorial design, using attractiveness (high/low) of the facial picture as a proxy for the inherent value of the cue and its reward probability (high/low) as a surrogate for the acquired value. Each picture was paired with a monetary win or loss either congruently or incongruently. Behavioral results from 32 human participants indicated both faster response time and faster learning rate for value-congruent cue-outcome pairings. Model-based fMRI analysis revealed a fractionation of reinforcement learning (RL) signals in the ventral striatum, including a strong and novel correlation between the cue-specific decaying learning rate and BOLD activity in the ventral caudate. Additionally, we detected a functional link between neural signals of both learning rate and reward prediction error in the ventral striatum, and the signal of expected value in the ventromedial prefrontal cortex, showing a novel confirmation of the mathematical RL model via functional connectivity.

SIGNIFICANCE STATEMENT

Most real-world decisions require the integration of inherent value and sensitivity to outcomes to facilitate adaptive learning. Inherent value is drawing increasing interest from decision scientists because it influences decisions in contexts ranging from advertising to investing. This study provides novel insight into how inherent value influences the acquisition of new expected value during associative learning. Specifically, we find that the congruence between the inherent value and the acquired reward influences the neural coding of learning rate. We also show for the first time that neuroimaging signals coding the learning rate, prediction error, and acquired value follow the multiplicative Rescorla-Wagner learning rule, a finding predicted by reinforcement learning theory.

摘要

未标注

大多数现实生活中的线索都具有某些内在价值,这些价值可能会在联想学习过程中干扰或促进新预期价值的获取。特别是,当内在价值和习得价值一致时,学习可能会进展得更快。在这里,我们通过2×2析因设计研究了这种影响,使用面部图片的吸引力(高/低)作为线索内在价值的代理,其奖励概率(高/低)作为习得价值的替代。每张图片都与金钱输赢进行一致或不一致的配对。32名人类参与者的行为结果表明,价值一致的线索-结果配对的反应时间更快,学习速度也更快。基于模型的功能磁共振成像分析揭示了腹侧纹状体中强化学习(RL)信号的分离,包括线索特异性衰减学习率与腹侧尾状核中的BOLD活动之间存在强烈且新颖的相关性。此外,我们检测到腹侧纹状体中学习率和奖励预测误差的神经信号与腹内侧前额叶皮层中预期价值信号之间的功能联系,通过功能连接展示了对数学RL模型的新证实。

意义声明

大多数现实世界的决策需要整合内在价值和对结果的敏感性,以促进适应性学习。内在价值正越来越受到决策科学家的关注,因为它在从广告到投资等各种情境中影响决策。这项研究为内在价值如何在联想学习过程中影响新预期价值的获取提供了新的见解。具体而言,我们发现内在价值与习得奖励之间的一致性会影响学习率的神经编码。我们还首次表明,编码学习率、预测误差和习得价值的神经成像信号遵循乘法雷斯克拉-瓦格纳学习规则,这是强化学习理论预测的一个发现。

相似文献

1
Congruence of Inherent and Acquired Values Facilitates Reward-Based Decision-Making.内在价值观与后天习得价值观的一致性有助于基于奖励的决策。
J Neurosci. 2016 May 4;36(18):5003-12. doi: 10.1523/JNEUROSCI.3084-15.2016.
2
Contextual modulation of value signals in reward and punishment learning.语境对奖惩学习中价值信号的调节作用。
Nat Commun. 2015 Aug 25;6:8096. doi: 10.1038/ncomms9096.
3
BOLD subjective value signals exhibit robust range adaptation.血氧水平依赖性功能磁共振成像主观价值信号表现出强大的范围适应性。
J Neurosci. 2014 Dec 3;34(49):16533-43. doi: 10.1523/JNEUROSCI.3927-14.2014.
4
Ventromedial Prefrontal Cortex Damage Is Associated with Decreased Ventral Striatum Volume and Response to Reward.腹内侧前额叶皮质损伤与腹侧纹状体体积减小及奖励反应降低有关。
J Neurosci. 2016 May 4;36(18):5047-54. doi: 10.1523/JNEUROSCI.4236-15.2016.
5
Interaction of Instrumental and Goal-Directed Learning Modulates Prediction Error Representations in the Ventral Striatum.工具性学习与目标导向学习的相互作用调节腹侧纹状体中的预测误差表征。
J Neurosci. 2016 Dec 14;36(50):12650-12660. doi: 10.1523/JNEUROSCI.1677-16.2016.
6
The Effect of Counterfactual Information on Outcome Value Coding in Medial Prefrontal and Cingulate Cortex: From an Absolute to a Relative Neural Code.反事实信息对内侧前额叶和扣带回皮层结果价值编码的影响:从绝对神经编码到相对神经编码。
J Neurosci. 2020 Apr 15;40(16):3268-3277. doi: 10.1523/JNEUROSCI.1712-19.2020. Epub 2020 Mar 10.
7
Neural evidence for adaptive strategy selection in value-based decision-making.基于价值的决策中适应性策略选择的神经证据。
Cereb Cortex. 2014 Aug;24(8):2009-21. doi: 10.1093/cercor/bht049. Epub 2013 Mar 8.
8
Signed Reward Prediction Errors in the Ventral Striatum Drive Episodic Memory.腹侧纹状体中的签名奖励预测误差驱动情景记忆。
J Neurosci. 2021 Feb 24;41(8):1716-1726. doi: 10.1523/JNEUROSCI.1785-20.2020. Epub 2020 Dec 17.
9
Overlapping prediction errors in dorsal striatum during instrumental learning with juice and money reward in the human brain.人类大脑在使用果汁和金钱奖励进行工具性学习过程中,背侧纹状体的预测误差存在重叠。
J Neurophysiol. 2009 Dec;102(6):3384-91. doi: 10.1152/jn.91195.2008. Epub 2009 Sep 30.
10
Neural basis of decision making guided by emotional outcomes.由情感结果引导的决策的神经基础。
J Neurophysiol. 2015 May 1;113(9):3056-68. doi: 10.1152/jn.00564.2014. Epub 2015 Feb 18.

引用本文的文献

1
Feature identification learning both shapes and is shaped by spatial object-similarity representations.特征识别学习形状,同时也受空间对象相似性表征的影响。
Commun Psychol. 2025 May 13;3(1):77. doi: 10.1038/s44271-025-00259-w.
2
Additive Effects of Monetary Loss and Positive Emotion in the Human Brain.金钱损失和积极情绪对人脑的附加效应。
eNeuro. 2024 Apr 17;11(4). doi: 10.1523/ENEURO.0374-23.2024. Print 2024 Apr.
3
Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.状态和动作关联或区分泛化的强化学习:3T 和 7T 的 fMRI。
Hum Brain Mapp. 2022 Oct 15;43(15):4750-4790. doi: 10.1002/hbm.25988. Epub 2022 Jul 21.
4
Learning under social versus nonsocial uncertainty: A meta-analytic approach.在社会不确定性与非社会不确定性下的学习:一项元分析方法。
Hum Brain Mapp. 2022 Sep;43(13):4185-4206. doi: 10.1002/hbm.25948. Epub 2022 May 27.
5
Seeking the "Beauty Center" in the Brain: A Meta-Analysis of fMRI Studies of Beautiful Human Faces and Visual Art.在大脑中寻找“美丽中心”:对美丽人脸和视觉艺术 fMRI 研究的元分析。
Cogn Affect Behav Neurosci. 2020 Dec;20(6):1200-1215. doi: 10.3758/s13415-020-00827-z. Epub 2020 Oct 21.
6
Using reinforcement learning models in social neuroscience: frameworks, pitfalls and suggestions of best practices.在社会神经科学中使用强化学习模型:框架、陷阱和最佳实践建议。
Soc Cogn Affect Neurosci. 2020 Jul 30;15(6):695-707. doi: 10.1093/scan/nsaa089.
7
Functions of Learning Rate in Adaptive Reward Learning.自适应奖励学习中学习率的作用。
Front Hum Neurosci. 2017 Dec 6;11:592. doi: 10.3389/fnhum.2017.00592. eCollection 2017.

本文引用的文献

1
Model-based approaches to neuroimaging: combining reinforcement learning theory with fMRI data.基于模型的神经影像学方法:将强化学习理论与 fMRI 数据相结合。
Wiley Interdiscip Rev Cogn Sci. 2010 Jul;1(4):501-510. doi: 10.1002/wcs.57. Epub 2010 Apr 2.
2
Orthogonalization of regressors in FMRI models.功能磁共振成像模型中回归变量的正交化
PLoS One. 2015 Apr 28;10(4):e0126255. doi: 10.1371/journal.pone.0126255. eCollection 2015.
3
Functionally dissociable influences on learning rate in a dynamic environment.在动态环境中对学习速率的功能可分离影响。
Neuron. 2014 Nov 19;84(4):870-81. doi: 10.1016/j.neuron.2014.10.013.
4
Serotonin and dopamine differentially affect appetitive and aversive general Pavlovian-to-instrumental transfer.血清素和多巴胺对食欲性和厌恶性的一般巴甫洛夫式到工具性转换有不同影响。
Psychopharmacology (Berl). 2015 Jan;232(2):437-51. doi: 10.1007/s00213-014-3682-3. Epub 2014 Jul 18.
5
Fatal attraction: ventral striatum predicts costly choice errors in humans.致命诱惑:腹侧纹状体预测人类代价高昂的选择错误。
Neuroimage. 2014 Apr 1;89:1-9. doi: 10.1016/j.neuroimage.2013.11.039. Epub 2013 Nov 26.
6
Hierarchical prediction errors in midbrain and basal forebrain during sensory learning.中脑和基底前脑在感觉学习过程中的分层预测误差。
Neuron. 2013 Oct 16;80(2):519-30. doi: 10.1016/j.neuron.2013.09.009.
7
Reward prediction error signal enhanced by striatum-amygdala interaction explains the acceleration of probabilistic reward learning by emotion.纹状体-杏仁核相互作用增强的奖励预测误差信号解释了情绪对概率性奖励学习的加速作用。
J Neurosci. 2013 Mar 6;33(10):4487-93. doi: 10.1523/JNEUROSCI.3400-12.2013.
8
Category-dependent and category-independent goal-value codes in human ventromedial prefrontal cortex.人类腹内侧前额叶皮层中的类别相关和类别无关目标价值代码。
Nat Neurosci. 2013 Apr;16(4):479-85. doi: 10.1038/nn.3337. Epub 2013 Feb 17.
9
Neural correlates of specific and general Pavlovian-to-Instrumental Transfer within human amygdalar subregions: a high-resolution fMRI study.人类杏仁核亚区中特定和一般巴甫洛夫到工具性转移的神经相关物:一项高分辨率 fMRI 研究。
J Neurosci. 2012 Jun 13;32(24):8383-90. doi: 10.1523/JNEUROSCI.6237-11.2012.
10
Visual search: a retrospective.视觉搜索:一项回顾性研究。
J Vis. 2011 Dec 30;11(5):14. doi: 10.1167/11.5.14.