• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面对波动的奖励率保持活力:一项实验研究。

Vigor in the face of fluctuating rates of reward: an experimental examination.

机构信息

University College London, 17 Queen Square, London, WC1N 3AR, United Kingdom.

出版信息

J Cogn Neurosci. 2011 Dec;23(12):3933-8. doi: 10.1162/jocn_a_00090. Epub 2011 Jul 7.

DOI:10.1162/jocn_a_00090
PMID:21736459
Abstract

Two fundamental questions underlie the expression of behavior, namely what to do and how vigorously to do it. The former is the topic of an overwhelming wealth of theoretical and empirical work particularly in the fields of reinforcement learning and decision-making, with various forms of affective prediction error playing key roles. Although vigor concerns motivation, and so is the subject of many empirical studies in diverse fields, it has suffered a dearth of computational models. Recently, Niv et al. [Niv, Y., Daw, N. D., Joel, D., & Dayan, P. Tonic dopamine: Opportunity costs and the control of response vigor. Psychopharmacology (Berlin), 191, 507-520, 2007] suggested that vigor should be controlled by the opportunity cost of time, which is itself determined by the average rate of reward. This coupling of reward rate and vigor can be shown to be optimal under the theory of average return reinforcement learning for a particular class of tasks but may also be a more general, perhaps hard-wired, characteristic of the architecture of control. We, therefore, tested the hypothesis that healthy human participants would adjust their RTs on the basis of the average rate of reward. We measured RTs in an odd-ball discrimination task for rewards whose magnitudes varied slowly but systematically. Linear regression on the subjects' individual RTs using the time varying average rate of reward as the regressor of interest, and including nuisance regressors such as the immediate reward in a round and in the preceding round, showed that a significant fraction of the variance in subjects' RTs could indeed be explained by the rate of experienced reward. This validates one of the key proposals associated with the model, illuminating an apparently mandatory form of coupling that may involve tonic levels of dopamine.

摘要

两个基本问题是行为表达的基础,即做什么和如何有力地做。前者是大量理论和经验工作的主题,特别是在强化学习和决策领域,各种形式的情感预测误差起着关键作用。尽管活力与动机有关,因此是许多不同领域的实证研究的主题,但它缺乏计算模型。最近,Niv 等人[Niv,Y.,Daw,N.D.,Joel,D.,Dayan,P. Tonic 多巴胺:机会成本和反应活力的控制。精神药理学(柏林),191,507-520,2007]认为,活力应该由时间的机会成本来控制,而时间的机会成本本身又是由平均奖励率决定的。这种奖励率和活力的耦合可以在平均回报强化学习理论下显示为一类特定任务的最优,但也可能是控制架构的更一般的、也许是硬性的特征。因此,我们测试了一个假设,即健康的人类参与者会根据奖励的平均率来调整他们的反应时间。我们在一个奇数球辨别任务中测量了奖励的反应时间,奖励的大小缓慢但系统地变化。使用时间变化的平均奖励率作为感兴趣的回归量,对受试者的个体反应时间进行线性回归,并包括即时奖励在一轮和前一轮的干扰回归量,表明受试者反应时间的很大一部分方差确实可以用经历的奖励率来解释。这验证了与该模型相关的关键假设之一,阐明了一种明显强制性的耦合形式,可能涉及多巴胺的紧张水平。

相似文献

1
Vigor in the face of fluctuating rates of reward: an experimental examination.面对波动的奖励率保持活力:一项实验研究。
J Cogn Neurosci. 2011 Dec;23(12):3933-8. doi: 10.1162/jocn_a_00090. Epub 2011 Jul 7.
2
Dopamine modulates reward-related vigor.多巴胺调节与奖励相关的活力。
Neuropsychopharmacology. 2013 Jul;38(8):1495-503. doi: 10.1038/npp.2013.48. Epub 2013 Feb 18.
3
Tonic dopamine: opportunity costs and the control of response vigor.紧张性多巴胺:机会成本与反应强度的控制
Psychopharmacology (Berl). 2007 Apr;191(3):507-20. doi: 10.1007/s00213-006-0502-4. Epub 2006 Oct 10.
4
Cost, benefit, tonic, phasic: what do response rates tell us about dopamine and motivation?成本、收益、紧张性、相位性:反应率能告诉我们关于多巴胺与动机的哪些信息?
Ann N Y Acad Sci. 2007 May;1104:357-76. doi: 10.1196/annals.1390.018. Epub 2007 Apr 7.
5
Dopamine Manipulation Affects Response Vigor Independently of Opportunity Cost.多巴胺调控对反应活力的影响独立于机会成本。
J Neurosci. 2016 Sep 14;36(37):9516-25. doi: 10.1523/JNEUROSCI.4467-15.2016.
6
Effects of average reward rate on vigor as a function of individual variation in striatal dopamine.纹状体多巴胺个体差异对平均奖励率活力功能的影响。
Psychopharmacology (Berl). 2022 Feb;239(2):465-478. doi: 10.1007/s00213-021-06017-0. Epub 2021 Nov 4.
7
Long-lasting effects of performance-contingent unconscious and conscious reward incentives during cued task-switching.在提示任务转换期间,基于表现的无意识和意识奖励激励的持久影响。
Cortex. 2013 Jul-Aug;49(7):1943-54. doi: 10.1016/j.cortex.2012.05.018. Epub 2012 Jun 12.
8
The Dopaminergic Midbrain Mediates an Effect of Average Reward on Pavlovian Vigor.多巴胺能中脑介导平均奖励对巴甫洛夫式活力的影响。
J Cogn Neurosci. 2016 Sep;28(9):1303-17. doi: 10.1162/jocn_a_00972. Epub 2016 Apr 15.
9
Overlapping prediction errors in dorsal striatum during instrumental learning with juice and money reward in the human brain.人类大脑在使用果汁和金钱奖励进行工具性学习过程中,背侧纹状体的预测误差存在重叠。
J Neurophysiol. 2009 Dec;102(6):3384-91. doi: 10.1152/jn.91195.2008. Epub 2009 Sep 30.
10
Reward-dependent learning in neuronal networks for planning and decision making.用于规划和决策的神经网络中基于奖励的学习。
Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0.

引用本文的文献

1
Reaching vigor tracks learned prediction error.达到活力追踪学习到的预测误差。
bioRxiv. 2025 Mar 25:2025.03.24.645035. doi: 10.1101/2025.03.24.645035.
2
Motivational Vigor in Parkinson's Disease Requires the Short and Long Duration Response to Levodopa.帕金森病的激励活力需要对左旋多巴的短期和长期反应。
Mov Disord. 2024 Jan;39(1):76-84. doi: 10.1002/mds.29659. Epub 2023 Dec 7.
3
Reframing dopamine: A controlled controller at the limbic-motor interface.重新定义多巴胺:边缘运动界面的受控控制器。
PLoS Comput Biol. 2023 Oct 17;19(10):e1011569. doi: 10.1371/journal.pcbi.1011569. eCollection 2023 Oct.
4
Opportunity cost determines free-operant action initiation latency and predicts apathy.机会成本决定自由操作行为启动潜伏期,并可预测冷漠。
Psychol Med. 2023 Apr;53(5):1850-1859. doi: 10.1017/S0033291721003469. Epub 2021 Oct 12.
5
The catecholamine precursor Tyrosine reduces autonomic arousal and decreases decision thresholds in reinforcement learning and temporal discounting.儿茶酚胺前体酪氨酸可降低自主唤醒,并降低强化学习和时间折扣中的决策阈值。
PLoS Comput Biol. 2022 Dec 22;18(12):e1010785. doi: 10.1371/journal.pcbi.1010785. eCollection 2022 Dec.
6
Average reward rates enable motivational transfer across independent reinforcement learning tasks.平均奖励率能够实现跨独立强化学习任务的动机转移。
Front Behav Neurosci. 2022 Nov 9;16:1041566. doi: 10.3389/fnbeh.2022.1041566. eCollection 2022.
7
An energizing role for motivation in information-seeking during the early phase of the COVID-19 pandemic.动机在 COVID-19 大流行早期信息搜索中的激励作用。
Nat Commun. 2022 Apr 28;13(1):2310. doi: 10.1038/s41467-022-30011-5.
8
Reward Value Enhances Sequence Monitoring Ramping Dynamics as Ending Rewards Approach in the Rostrolateral Prefrontal Cortex.奖赏值增强了终末奖赏临近时的序列监测上调动力学,该效应出现在额眶部前额皮质。
eNeuro. 2022 Mar 4;9(2). doi: 10.1523/ENEURO.0003-22.2022. Print 2022 Mar-Apr.
9
Cognitive Control as a Multivariate Optimization Problem.认知控制作为一个多元优化问题。
J Cogn Neurosci. 2022 Mar 5;34(4):569-591. doi: 10.1162/jocn_a_01822.
10
How the value of the environment controls persistence in visual search.环境价值如何控制视觉搜索中的持久性。
PLoS Comput Biol. 2021 Dec 14;17(12):e1009662. doi: 10.1371/journal.pcbi.1009662. eCollection 2021 Dec.