• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

时间折扣与定向探索相关,但与随机探索无关。

Temporal discounting correlates with directed exploration but not with random exploration.

机构信息

Department of Psychology, University of Arizona, Tucson, USA.

Department of Psychological Science, Missouri University of Science and Technology, Rolla, USA.

出版信息

Sci Rep. 2020 Mar 4;10(1):4020. doi: 10.1038/s41598-020-60576-4.

DOI:10.1038/s41598-020-60576-4
PMID:32132573
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7055215/
Abstract

The explore-exploit dilemma describes the trade off that occurs any time we must choose between exploring unknown options and exploiting options we know well. Implicit in this trade off is how we value future rewards - exploiting is usually better in the short term, but in the longer term the benefits of exploration can be huge. Thus, in theory there should be a tight connection between how much people value future rewards, i.e. how much they discount future rewards relative to immediate rewards, and how likely they are to explore, with less 'temporal discounting' associated with more exploration. By measuring individual differences in temporal discounting and correlating them with explore-exploit behavior, we tested whether this theoretical prediction holds in practice. We used the 27-item Delay-Discounting Questionnaire to estimate temporal discounting and the Horizon Task to quantify two strategies of explore-exploit behavior: directed exploration, where information drives exploration by choice, and random exploration, where behavioral variability drives exploration by chance. We find a clear correlation between temporal discounting and directed exploration, with more temporal discounting leading to less directed exploration. Conversely, we find no relationship between temporal discounting and random exploration. Unexpectedly, we find that the relationship with directed exploration appears to be driven by a correlation between temporal discounting and uncertainty seeking at short time horizons, rather than information seeking at long horizons. Taken together our results suggest a nuanced relationship between temporal discounting and explore-exploit behavior that may be mediated by multiple factors.

摘要

探索-利用困境描述了我们在探索未知选项和利用我们熟知的选项之间必须做出选择时所面临的权衡。这种权衡隐含着我们如何看待未来的奖励——利用通常在短期内更好,但从长期来看,探索的好处可能是巨大的。因此,从理论上讲,人们对未来奖励的重视程度(即他们对未来奖励的贴现程度相对于即时奖励的程度)与他们探索的可能性之间应该有紧密的联系,较少的“时间贴现”与更多的探索相关联。通过测量个体在时间贴现方面的差异,并将其与探索-利用行为相关联,我们检验了这一理论预测在实践中是否成立。我们使用 27 项延迟折扣问卷来估计时间贴现,使用视野任务来量化探索-利用行为的两种策略:有指导的探索,其中信息通过选择驱动探索;随机探索,其中行为变化通过机会驱动探索。我们发现时间贴现与有指导的探索之间存在明显的相关性,时间贴现越多,有指导的探索就越少。相反,我们发现时间贴现与随机探索之间没有关系。出乎意料的是,我们发现与有指导的探索的关系似乎是由时间贴现与短期不确定性寻求之间的相关性驱动的,而不是长期信息寻求。总之,我们的结果表明,时间贴现与探索-利用行为之间存在一种微妙的关系,这种关系可能是由多种因素介导的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/7f03b60e3d85/41598_2020_60576_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/0beca5dd28c7/41598_2020_60576_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/0bc42858cf9a/41598_2020_60576_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/1bee1d12fb97/41598_2020_60576_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/13375af03458/41598_2020_60576_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/67ab3c59f605/41598_2020_60576_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/7f03b60e3d85/41598_2020_60576_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/0beca5dd28c7/41598_2020_60576_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/0bc42858cf9a/41598_2020_60576_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/1bee1d12fb97/41598_2020_60576_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/13375af03458/41598_2020_60576_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/67ab3c59f605/41598_2020_60576_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5ee/7055215/7f03b60e3d85/41598_2020_60576_Fig6_HTML.jpg

相似文献

1
Temporal discounting correlates with directed exploration but not with random exploration.时间折扣与定向探索相关,但与随机探索无关。
Sci Rep. 2020 Mar 4;10(1):4020. doi: 10.1038/s41598-020-60576-4.
2
Differential Effects of Psychotic Illness on Directed and Random Exploration.精神病性疾病对定向探索和随机探索的不同影响。
Comput Psychiatr. 2020;4:18-39. doi: 10.1162/cpsy_a_00027. Epub 2020 Aug 1.
3
Humans use directed and random exploration to solve the explore-exploit dilemma.人类利用有向探索和随机探索来解决探索与利用的两难困境。
J Exp Psychol Gen. 2014 Dec;143(6):2074-81. doi: 10.1037/a0038199. Epub 2014 Oct 27.
4
A causal role for right frontopolar cortex in directed, but not random, exploration.右侧额极在定向而非随机探索中起因果作用。
Elife. 2017 Sep 15;6:e27430. doi: 10.7554/eLife.27430.
5
Dopaminergic genes are associated with both directed and random exploration.多巴胺能基因与定向探索和随机探索都有关联。
Neuropsychologia. 2018 Nov;120:97-104. doi: 10.1016/j.neuropsychologia.2018.10.009. Epub 2018 Oct 19.
6
Do the Powerful Discount the Future Less? The Effects of Power on Temporal Discounting.权力越大的人对未来的折扣越小吗?权力对时间折扣的影响。
Front Psychol. 2017 Jun 21;8:1007. doi: 10.3389/fpsyg.2017.01007. eCollection 2017.
7
"What time is my next meal?" delay-discounting individuals choose smaller portions under conditions of uncertainty.“我的下一顿饭什么时候吃?”在不确定的情况下,倾向于延迟折扣的人会选择更小的份量。
Appetite. 2017 Sep 1;116:284-290. doi: 10.1016/j.appet.2017.05.019. Epub 2017 May 10.
8
Enhanced Neural Responses to Imagined Primary Rewards Predict Reduced Monetary Temporal Discounting.对想象中的主要奖励的增强神经反应预示着货币时间折扣的降低。
J Neurosci. 2015 Sep 23;35(38):13103-9. doi: 10.1523/JNEUROSCI.1863-15.2015.
9
Development of directed and random exploration in children.儿童定向和随机探索的发展。
Dev Sci. 2021 Jul;24(4):e13095. doi: 10.1111/desc.13095. Epub 2021 Mar 8.
10
Balancing exploration and exploitation with information and randomization.通过信息与随机化实现探索与利用的平衡。
Curr Opin Behav Sci. 2021 Apr;38:49-56. doi: 10.1016/j.cobeha.2020.10.001. Epub 2020 Nov 6.

引用本文的文献

1
Human Strategy Adaptation in Reinforcement Learning Resembles Policy Gradient Ascent.强化学习中的人类策略适应类似于策略梯度上升。
bioRxiv. 2025 Jul 31:2025.07.28.667308. doi: 10.1101/2025.07.28.667308.
2
Deep Learning Improves Parameter Estimation in Reinforcement Learning Models.深度学习改进强化学习模型中的参数估计。
bioRxiv. 2025 Jun 18:2025.03.21.644663. doi: 10.1101/2025.03.21.644663.
3
Basal ganglia deep brain stimulation restores cognitive flexibility and exploration-exploitation balance disrupted by NMDA-R antagonism.

本文引用的文献

1
Four converging measures of temporal discounting and their relationships with intelligence, executive functions, thinking dispositions, and behavioral outcomes.时间折扣的四种收敛性测量方法及其与智力、执行功能、思维倾向和行为结果的关系。
Front Psychol. 2015 Jun 5;6:728. doi: 10.3389/fpsyg.2015.00728. eCollection 2015.
基底神经节深部脑刺激可恢复因NMDA受体拮抗作用而破坏的认知灵活性和探索-利用平衡。
Nat Commun. 2025 May 28;16(1):4963. doi: 10.1038/s41467-025-60044-5.
4
TMS-EEG evidence links random exploration to inhibitory mechanisms in the dorsolateral prefrontal cortex.经颅磁刺激-脑电图证据表明,随机探索与背外侧前额叶皮层的抑制机制有关。
Sci Rep. 2025 May 5;15(1):15654. doi: 10.1038/s41598-025-00034-1.
5
Signatures of Perseveration and Heuristic-Based Directed Exploration in Two-Step Sequential Decision Task Behaviour.两步序贯决策任务行为中持续重复和基于启发式的定向探索特征
Comput Psychiatr. 2025 Feb 11;9(1):39-62. doi: 10.5334/cpsy.101. eCollection 2025.
6
Navigating Uncertainty: The Role of Mood and Confidence in Decision-Making Flexibility and Performance.应对不确定性:情绪与信心在决策灵活性和表现中的作用。
Behav Sci (Basel). 2024 Nov 28;14(12):1144. doi: 10.3390/bs14121144.
7
A causal role of the right dorsolateral prefrontal cortex in random exploration.右侧背外侧前额叶皮层在随机探索中的因果作用。
Sci Rep. 2024 Oct 22;14(1):24796. doi: 10.1038/s41598-024-76025-5.
8
Differential impacts of healthy cognitive aging on directed and random exploration.健康认知老化对定向和随机探索的影响差异。
Psychol Aging. 2024 Feb;39(1):88-101. doi: 10.1037/pag0000791.
9
Meta-Analysis Reveals That Explore-Exploit Decisions are Dissociable by Activation in the Dorsal Lateral Prefrontal Cortex, Anterior Insula, and the Dorsal Anterior Cingulate Cortex.荟萃分析表明,探索-利用决策可通过背外侧前额叶皮层、前岛叶和背侧前扣带回皮层的激活而分离。
bioRxiv. 2025 Mar 7:2023.10.21.563317. doi: 10.1101/2023.10.21.563317.
10
Instructed motivational states bias reinforcement learning and memory formation.指导动机状态会影响强化学习和记忆形成。
Proc Natl Acad Sci U S A. 2023 Aug;120(31):e2304881120. doi: 10.1073/pnas.2304881120. Epub 2023 Jul 25.