• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类可以在实时限制条件下采用最优折扣策略。

Humans can adopt optimal discounting strategy under real-time constraints.

作者信息

Schweighofer N, Shishida K, Han C E, Okamoto Y, Tanaka S C, Yamawaki S, Doya K

机构信息

Biokinesiology and Physical Therapy, University of Southern California, Los Angeles, United States of America.

出版信息

PLoS Comput Biol. 2006 Nov 10;2(11):e152. doi: 10.1371/journal.pcbi.0020152. Epub 2006 Oct 4.

DOI:10.1371/journal.pcbi.0020152
PMID:17096592
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1635539/
Abstract

Critical to our many daily choices between larger delayed rewards, and smaller more immediate rewards, are the shape and the steepness of the function that discounts rewards with time. Although research in artificial intelligence favors exponential discounting in uncertain environments, studies with humans and animals have consistently shown hyperbolic discounting. We investigated how humans perform in a reward decision task with temporal constraints, in which each choice affects the time remaining for later trials, and in which the delays vary at each trial. We demonstrated that most of our subjects adopted exponential discounting in this experiment. Further, we confirmed analytically that exponential discounting, with a decay rate comparable to that used by our subjects, maximized the total reward gain in our task. Our results suggest that the particular shape and steepness of temporal discounting is determined by the task that the subject is facing, and question the notion of hyperbolic reward discounting as a universal principle.

摘要

在我们每天面临的众多选择中,即在较大的延迟奖励和较小的即时奖励之间做出抉择时,随着时间推移对奖励进行折扣的函数的形状和陡峭程度至关重要。尽管人工智能领域的研究在不确定环境中倾向于指数折扣,但对人类和动物的研究一直表明是双曲线折扣。我们研究了人类在有时间限制的奖励决策任务中的表现,在该任务中,每次选择都会影响后续试验剩余的时间,并且每次试验的延迟都不同。我们证明,在这个实验中,我们的大多数受试者采用了指数折扣。此外,我们通过分析证实,具有与我们的受试者所使用的衰减率相当的指数折扣,在我们的任务中使总奖励收益最大化。我们的结果表明,时间折扣的特定形状和陡峭程度由受试者所面临的任务决定,并对双曲线奖励折扣作为普遍原则的观点提出了质疑。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/26b5ce603c30/pcbi.0020152.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/a89d7b61d1d4/pcbi.0020152.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/f72622bca8cb/pcbi.0020152.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/fd7b185ef279/pcbi.0020152.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/26b5ce603c30/pcbi.0020152.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/a89d7b61d1d4/pcbi.0020152.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/f72622bca8cb/pcbi.0020152.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/fd7b185ef279/pcbi.0020152.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf96/1664701/26b5ce603c30/pcbi.0020152.g004.jpg

相似文献

1
Humans can adopt optimal discounting strategy under real-time constraints.人类可以在实时限制条件下采用最优折扣策略。
PLoS Comput Biol. 2006 Nov 10;2(11):e152. doi: 10.1371/journal.pcbi.0020152. Epub 2006 Oct 4.
2
Studying the relation between temporal reward discounting tasks used in populations with ADHD: a factor analysis.研究 ADHD 人群中使用的时间奖励折扣任务之间的关系:一项因子分析。
Int J Methods Psychiatr Res. 2010 Sep;19(3):167-76. doi: 10.1002/mpr.309.
3
Behavioral modeling of human choices reveals dissociable effects of physical effort and temporal delay on reward devaluation.人类选择的行为建模揭示了体力消耗和时间延迟对奖励贬值的不同影响。
PLoS Comput Biol. 2015 Mar 27;11(3):e1004116. doi: 10.1371/journal.pcbi.1004116. eCollection 2015 Mar.
4
Response effort discounts the subjective value of rewards.反应努力会降低奖励的主观价值。
Behav Processes. 2014 Sep;107:175-7. doi: 10.1016/j.beproc.2014.08.002. Epub 2014 Aug 20.
5
Dissociable neural representations of future reward magnitude and delay during temporal discounting.时间折扣过程中未来奖励大小和延迟的可分离神经表征。
Neuroimage. 2009 Mar 1;45(1):143-50. doi: 10.1016/j.neuroimage.2008.11.004. Epub 2008 Nov 24.
6
Temporal and probabilistic discounting of rewards in children and adolescents: effects of age and ADHD symptoms.儿童和青少年对奖励的时间和概率折扣:年龄及注意缺陷多动障碍症状的影响
Neuropsychologia. 2006;44(11):2092-103. doi: 10.1016/j.neuropsychologia.2005.10.012. Epub 2005 Nov 21.
7
Rats exhibit similar biases in foraging and intertemporal choice tasks.老鼠在觅食和跨期选择任务中表现出类似的偏见。
Elife. 2019 Sep 18;8:e48429. doi: 10.7554/eLife.48429.
8
Discounting of delayed rewards is not hyperbolic.延迟奖励的折扣并非双曲线的。
J Exp Psychol Learn Mem Cogn. 2013 Jul;39(4):1274-9. doi: 10.1037/a0031170. Epub 2013 Jan 28.
9
The Attraction Effect Modulates Reward Prediction Errors and Intertemporal Choices.吸引力效应调节奖励预测误差和跨期选择。
J Neurosci. 2017 Jan 11;37(2):371-382. doi: 10.1523/JNEUROSCI.2532-16.2016.
10
Temporal discounting when the choice is between two delayed rewards.当选择在两个延迟奖励之间进行时的时间折扣。
J Exp Psychol Learn Mem Cogn. 2005 Sep;31(5):1121-33. doi: 10.1037/0278-7393.31.5.1121.

引用本文的文献

1
The value of initiating a pursuit in temporal decision-making.在时间决策中启动追求行为的价值。
Elife. 2025 Mar 28;13:RP99957. doi: 10.7554/eLife.99957.
2
Mechanisms of impulsive choice: Experiments to explore and models to map the empirical terrain.冲动选择的机制:探索经验地形的实验与建模
Learn Behav. 2023 Dec;51(4):355-391. doi: 10.3758/s13420-023-00577-1. Epub 2023 Mar 13.
3
Validating Habitual and Goal-Directed Decision-Making Performance Online in Healthy Older Adults.在健康老年人中在线验证习惯性和目标导向性决策表现

本文引用的文献

1
Discounting of delayed rewards: Models of individual choice.延迟奖励折扣:个体选择模型。
J Exp Anal Behav. 1995 Nov;64(3):263-76. doi: 10.1901/jeab.1995.64-263.
2
Précis of Breakdown of Will.《意志的崩溃》摘要
Behav Brain Sci. 2005 Oct;28(5):635-50; discussion 650-73. doi: 10.1017/S0140525X05000117.
3
Will travel for food: spatial discounting in two new world monkeys.为食物而奔波:两种新大陆猴的空间折扣行为
Front Aging Neurosci. 2021 Jun 29;13:702810. doi: 10.3389/fnagi.2021.702810. eCollection 2021.
4
Effects of d-amphetamine and MK-801 on impulsive choice: Modulation by schedule of reinforcement and delay length.安非他命和 MK-801 对冲动选择的影响:强化时间表和延迟长度的调节。
Behav Brain Res. 2019 Dec 30;376:112228. doi: 10.1016/j.bbr.2019.112228. Epub 2019 Sep 11.
5
A New Analysis on Self-Control in Intertemporal Choice and Mediterranean Dietary Pattern.跨期选择中的自我控制与地中海饮食模式的新分析
Front Public Health. 2019 Jun 26;7:165. doi: 10.3389/fpubh.2019.00165. eCollection 2019.
6
Why has evolution not selected for perfect self-control?为什么进化没有选择完美的自我控制?
Philos Trans R Soc Lond B Biol Sci. 2019 Feb 18;374(1766):20180139. doi: 10.1098/rstb.2018.0139.
7
Context-Dependent Risk Aversion: A Model-Based Approach.情境依赖的风险规避:一种基于模型的方法。
Front Psychol. 2018 Oct 26;9:2053. doi: 10.3389/fpsyg.2018.02053. eCollection 2018.
8
Preliminary evidence of altered neural response during intertemporal choice of losses in adult attention-deficit hyperactivity disorder.成人注意缺陷多动障碍在进行损失跨期选择时神经反应改变的初步证据。
Sci Rep. 2018 Apr 30;8(1):6703. doi: 10.1038/s41598-018-24944-5.
9
The modulation of savouring by prediction error and its effects on choice.预测误差对味觉享受的调节及其对选择的影响。
Elife. 2016 Apr 21;5:e13747. doi: 10.7554/eLife.13747.
10
Tamping Ramping: Algorithmic, Implementational, and Computational Explanations of Phasic Dopamine Signals in the Accumbens.夯实与增强:伏隔核中阶段性多巴胺信号的算法、实现及计算解释
PLoS Comput Biol. 2015 Dec 23;11(12):e1004622. doi: 10.1371/journal.pcbi.1004622. eCollection 2015 Dec.
Curr Biol. 2005 Oct 25;15(20):1855-60. doi: 10.1016/j.cub.2005.09.016.
4
Addiction as a computational process gone awry.成瘾是一个出了差错的计算过程。
Science. 2004 Dec 10;306(5703):1944-7. doi: 10.1126/science.1102384.
5
Measuring state changes in human delay discounting: an experiential discounting task.测量人类延迟折扣中的状态变化:一项体验式折扣任务。
Behav Processes. 2004 Nov 30;67(3):343-56. doi: 10.1016/j.beproc.2004.06.003.
6
Separate neural systems value immediate and delayed monetary rewards.不同的神经系统对即时和延迟的金钱奖励有不同的价值判断。
Science. 2004 Oct 15;306(5695):503-7. doi: 10.1126/science.1100907.
7
Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops.对即时和未来奖励的预测会不同程度地激活皮质-基底神经节回路。
Nat Neurosci. 2004 Aug;7(8):887-93. doi: 10.1038/nn1279. Epub 2004 Jul 4.
8
Impulsivity and rapid discounting of delayed hypothetical rewards in cocaine-dependent individuals.可卡因依赖个体中对延迟假设奖励的冲动性及快速折扣
Exp Clin Psychopharmacol. 2003 Feb;11(1):18-25. doi: 10.1037//1064-1297.11.1.18.
9
Meta-learning in reinforcement learning.强化学习中的元学习。
Neural Netw. 2003 Jan;16(1):5-9. doi: 10.1016/s0893-6080(02)00228-9.
10
Neural economics and the biological substrates of valuation.神经经济学与估值的生物学基础
Neuron. 2002 Oct 10;36(2):265-84. doi: 10.1016/s0896-6273(02)00974-1.