• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

奖励在动态决策中的作用。

The role of reward in dynamic decision making.

作者信息

Osman Magda

机构信息

Biological and Experimental Psychology Centre, School of Biological and Chemical Sciences, Queen Mary College, University of London London, UK.

出版信息

Front Neurosci. 2012 Mar 20;6:35. doi: 10.3389/fnins.2012.00035. eCollection 2012.

DOI:10.3389/fnins.2012.00035
PMID:22454616
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3308334/
Abstract

The present study investigates two aspects of decision making that have yet to be explored within a dynamic environment, (1) comparing the accuracy of cue-outcome knowledge under conditions in which knowledge acquisition is either through Prediction or Choice, and (2) examining the effects of reward on both Prediction and Choice. In the present study participants either learnt about the cue-outcome relations in the environment by choosing cue values in order to maintain an outcome to criterion (Choice-based decision making), or learnt to predict the outcome from seeing changes to the cue values (Prediction-based decision making). During training participants received outcome feedback and one of four types of reward manipulations: Positive Reward, Negative Reward, Both Positive + Negative Reward, No Reward. After training both groups of learners were tested on prediction and choice-based tasks. In the main, the findings revealed that cue-outcome knowledge was more accurate when knowledge acquisition was Choice-based rather than Prediction-based. During learning Negative Reward adversely affected Choice-based decision making while Positive Reward adversely affected predictive-based decision making. During the test phase only performance on tests of choice was adversely affected by having received Positive Reward or Negative Reward during training. This article proposes that the adverse effects of reward may reflect the additional demands placed on processing rewards which compete for cognitive resources required to perform the main goal of the task. This in turn implies that, rather than facilitate decision making, the presentation of rewards can interfere with Choice-based and Prediction-based decisions.

摘要

本研究调查了在动态环境中尚未被探索的决策的两个方面

(1)比较在通过预测或选择获取知识的条件下线索-结果知识的准确性;(2)检验奖励对预测和选择的影响。在本研究中,参与者要么通过选择线索值以维持结果达到标准来了解环境中的线索-结果关系(基于选择的决策),要么通过观察线索值的变化来学习预测结果(基于预测的决策)。在训练期间,参与者会收到结果反馈以及四种奖励操纵之一:正奖励、负奖励、正负奖励都有、无奖励。训练后,两组学习者都要接受基于预测和选择的任务测试。总体而言,研究结果表明,当基于选择获取知识而非基于预测时,线索-结果知识更准确。在学习过程中,负奖励对基于选择的决策有不利影响,而正奖励对基于预测的决策有不利影响。在测试阶段,只有选择测试的表现会受到训练期间接受正奖励或负奖励的不利影响。本文提出,奖励的不利影响可能反映了处理奖励所带来的额外需求,这些需求与执行任务主要目标所需的认知资源相竞争。这反过来意味着,奖励的呈现非但促进决策,反而会干扰基于选择和基于预测的决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/4769ad4a3381/fnins-06-00035-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/3ff4aaaabc7f/fnins-06-00035-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/0d304d2e95b0/fnins-06-00035-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/e5193e099c46/fnins-06-00035-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/4769ad4a3381/fnins-06-00035-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/3ff4aaaabc7f/fnins-06-00035-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/0d304d2e95b0/fnins-06-00035-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/e5193e099c46/fnins-06-00035-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21ea/3308334/4769ad4a3381/fnins-06-00035-g004.jpg

相似文献

1
The role of reward in dynamic decision making.奖励在动态决策中的作用。
Front Neurosci. 2012 Mar 20;6:35. doi: 10.3389/fnins.2012.00035. eCollection 2012.
2
Differential Contributions of Nucleus Accumbens Subregions to Cue-Guided Risk/Reward Decision Making and Implementation of Conditional Rules.伏隔核亚区对线索引导的风险/回报决策及条件规则实施的差异贡献。
J Neurosci. 2018 Feb 21;38(8):1901-1914. doi: 10.1523/JNEUROSCI.3191-17.2018. Epub 2018 Jan 18.
3
Temporal Dynamics Underlying Prelimbic Prefrontal Cortical Regulation of Action Selection and Outcome Evaluation during Risk/Reward Decision-Making.风险/回报决策过程中边缘前额皮质对动作选择和结果评估的调节的时间动态。
J Neurosci. 2023 Feb 15;43(7):1238-1255. doi: 10.1523/JNEUROSCI.0802-22.2022. Epub 2023 Jan 6.
4
Normative decision rules in changing environments.规范决策规则在不断变化的环境中。
Elife. 2022 Oct 25;11:e79824. doi: 10.7554/eLife.79824.
5
Temporal dynamics of prediction error processing during reward-based decision making.基于奖励的决策过程中预测误差处理的时间动态。
Neuroimage. 2010 Oct 15;53(1):221-32. doi: 10.1016/j.neuroimage.2010.05.052. Epub 2010 May 25.
6
Electrophysiological correlates of prediction formation in anticipation of reward- and punishment-related feedback signals.在预期与奖励和惩罚相关的反馈信号时,预测形成的电生理相关性。
Psychophysiology. 2019 Aug;56(8):e13379. doi: 10.1111/psyp.13379. Epub 2019 Apr 26.
7
How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策:强化学习预测错误在人类中的快速传播。
J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.
8
Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.在一项运动决策任务中,信用分配受机构影响,而不受感官预测误差影响。
J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.
9
Prediction and control in a dynamic environment.动态环境中的预测与控制
Front Psychol. 2012 Mar 13;3:68. doi: 10.3389/fpsyg.2012.00068. eCollection 2012.
10
Deliberative Decision-Making in Macaques Removes Reward-Driven Response Vigor.猕猴的审慎决策消除了奖励驱动的反应活力。
Front Behav Neurosci. 2021 Aug 18;15:674169. doi: 10.3389/fnbeh.2021.674169. eCollection 2021.

引用本文的文献

1
Complex Problem Solving: What It Is and What It Is Not.复杂问题解决:它是什么以及不是什么。
Front Psychol. 2017 Jul 11;8:1153. doi: 10.3389/fpsyg.2017.01153. eCollection 2017.
2
Editorial: Decision-making experiments under a philosophical analysis: human choice as a challenge for neuroscience.社论:哲学分析视角下的决策实验:人类选择对神经科学的挑战
Front Neurosci. 2015 Aug 18;9:288. doi: 10.3389/fnins.2015.00288. eCollection 2015.
3
Neurophilosophical considerations on decision making: Pushing-up the frontiers without disregarding their foundations.

本文引用的文献

1
Prediction and control in a dynamic environment.动态环境中的预测与控制
Front Psychol. 2012 Mar 13;3:68. doi: 10.3389/fpsyg.2012.00068. eCollection 2012.
2
Observation can be as effective as action in problem solving.观察在解决问题方面和行动一样有效。
Cogn Sci. 2008 Jan 2;32(1):162-83. doi: 10.1080/03640210701703683.
3
Choice modulates the neural dynamics of prediction error processing during rewarded learning.选择调节奖励学习过程中预测误差处理的神经动力学。
关于决策的神经哲学思考:拓展前沿而不忽视其基础。
Front Neurosci. 2013 Dec 30;7:261. doi: 10.3389/fnins.2013.00261. eCollection 2013.
Neuroimage. 2011 Jan 15;54(2):1385-94. doi: 10.1016/j.neuroimage.2010.09.051. Epub 2010 Sep 25.
4
Controlling uncertainty: a review of human behavior in complex dynamic environments.控制不确定性:复杂动态环境中的人类行为综述。
Psychol Bull. 2010 Jan;136(1):65-86. doi: 10.1037/a0017815.
5
Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task.多巴胺能药物在动态觅食任务中调节帕金森病患者的学习率和持续性。
J Neurosci. 2009 Dec 2;29(48):15104-14. doi: 10.1523/JNEUROSCI.3524-09.2009.
6
Complex problem solving: a case for complex cognition?复杂问题解决:复杂认知的一个实例?
Cogn Process. 2010 May;11(2):133-42. doi: 10.1007/s10339-009-0345-0. Epub 2009 Nov 10.
7
How green is the grass on the other side? Frontopolar cortex and the evidence in favor of alternative courses of action.对岸的草有多绿?额极皮质与支持其他行动方案的证据。
Neuron. 2009 Jun 11;62(5):733-43. doi: 10.1016/j.neuron.2009.05.014.
8
Adaptive coding of action values in the human rostral cingulate zone.人类吻侧扣带区动作值的适应性编码
J Neurosci. 2009 Jun 10;29(23):7489-96. doi: 10.1523/JNEUROSCI.0349-09.2009.
9
Explicit neural signals reflecting reward uncertainty.反映奖励不确定性的明确神经信号。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 12;363(1511):3801-11. doi: 10.1098/rstb.2008.0152.
10
Patients with Parkinson's disease learn to control complex systems via procedural as well as non-procedural learning.帕金森病患者通过程序性学习和非程序性学习来控制复杂系统。
Neuropsychologia. 2008;46(9):2355-63. doi: 10.1016/j.neuropsychologia.2008.03.009. Epub 2008 Mar 22.