简单的阈值规则解决了资源积累搜索任务中的探索/利用权衡问题。

Simple Threshold Rules Solve Explore/Exploit Trade-offs in a Resource Accumulation Search Task.

机构信息

Cognitive Science Program and Department of Psychological and Brain Sciences, Indiana University Bloomington.

Indeed, Inc.

出版信息

Cogn Sci. 2020 Feb;44(2):e12817. doi: 10.1111/cogs.12817.

DOI:10.1111/cogs.12817

PMID:32065692

Abstract

How, and how well, do people switch between exploration and exploitation to search for and accumulate resources? We study the decision processes underlying such exploration/exploitation trade-offs using a novel card selection task that captures the common situation of searching among multiple resources (e.g., jobs) that can be exploited without depleting. With experience, participants learn to switch appropriately between exploration and exploitation and approach optimal performance. We model participants' behavior on this task with random, threshold, and sampling strategies, and find that a linear decreasing threshold rule best fits participants' results. Further evidence that participants use decreasing threshold-based strategies comes from reaction time differences between exploration and exploitation; however, participants themselves report non-decreasing thresholds. Decreasing threshold strategies that "front-load" exploration and switch quickly to exploitation are particularly effective in resource accumulation tasks, in contrast to optimal stopping problems like the Secretary Problem requiring longer exploration.

摘要

人们如何以及在多大程度上能够在探索和利用之间进行切换，以搜索和积累资源？我们使用一种新颖的卡片选择任务来研究这种探索/利用权衡背后的决策过程，该任务可以在不耗尽资源的情况下捕获对多种资源（例如工作）的常见搜索情况。随着经验的积累，参与者学会在探索和利用之间进行适当的切换，并接近最佳表现。我们使用随机、阈值和抽样策略对参与者在该任务中的行为进行建模，发现线性递减阈值规则最符合参与者的结果。参与者使用基于递减阈值的策略的进一步证据来自于探索和利用之间的反应时间差异；然而，参与者自己报告的阈值是非递减的。在资源积累任务中，递减阈值策略“前置”探索并快速切换到利用，这特别有效，而像秘书问题这样的最优停止问题则需要更长的探索时间。

相似文献

Simple Threshold Rules Solve Explore/Exploit Trade-offs in a Resource Accumulation Search Task.简单的阈值规则解决了资源积累搜索任务中的探索/利用权衡问题。

Cogn Sci. 2020 Feb;44(2):e12817. doi: 10.1111/cogs.12817.

Dopamine blockade impairs the exploration-exploitation trade-off in rats.多巴胺阻断会损害大鼠的探索-利用权衡。

Sci Rep. 2019 May 1;9(1):6770. doi: 10.1038/s41598-019-43245-z.

Humans use directed and random exploration to solve the explore-exploit dilemma.人类利用有向探索和随机探索来解决探索与利用的两难困境。

J Exp Psychol Gen. 2014 Dec;143(6):2074-81. doi: 10.1037/a0038199. Epub 2014 Oct 27.

Sex differences in learning from exploration.从探索中学习的性别差异。

Elife. 2021 Nov 19;10:e69748. doi: 10.7554/eLife.69748.

Transcranial Stimulation over Frontopolar Cortex Elucidates the Choice Attributes and Neural Mechanisms Used to Resolve Exploration-Exploitation Trade-Offs.经颅刺激额极皮层揭示了用于解决探索-利用权衡的选择属性和神经机制。

J Neurosci. 2015 Oct 28;35(43):14544-56. doi: 10.1523/JNEUROSCI.2322-15.2015.

Sources of suboptimality in a minimalistic explore-exploit task.在一个极简探索-利用任务中次优的来源。

Nat Hum Behav. 2019 Apr;3(4):361-368. doi: 10.1038/s41562-018-0526-x. Epub 2019 Feb 11.

Boldness predicts an individual's position along an exploration-exploitation foraging trade-off.大胆程度预示着个体在探索-利用觅食权衡中的位置。

J Anim Ecol. 2017 Sep;86(5):1257-1268. doi: 10.1111/1365-2656.12724. Epub 2017 Jul 24.

Exploration versus exploitation decisions in the human brain: A systematic review of functional neuroimaging and neuropsychological studies.人类大脑中的探索与开发决策：功能神经影像学和神经心理学研究的系统综述。

Neuropsychologia. 2024 Jan 10;192:108740. doi: 10.1016/j.neuropsychologia.2023.108740. Epub 2023 Nov 29.

Finding structure in multi-armed bandits.在多臂老虎机中寻找结构。

Cogn Psychol. 2020 Jun;119:101261. doi: 10.1016/j.cogpsych.2019.101261. Epub 2020 Feb 12.

Children are more exploratory and learn more than adults in an approach-avoid task.儿童在趋近-回避任务中比成人更具探索性，也学得更多。

Cognition. 2022 Jan;218:104940. doi: 10.1016/j.cognition.2021.104940. Epub 2021 Oct 26.

引用本文的文献

One factor to bind them all: visual foraging organization to predict patch leaving behavior with ROC curves.一个因素统揽全局：用视觉觅食组织和ROC曲线预测离开斑块行为

Cogn Res Princ Implic. 2025 Apr 5;10(1):16. doi: 10.1186/s41235-025-00624-7.

Testing the convergent validity, domain generality, and temporal stability of selected measures of people's tendency to explore.检验探索倾向的特定测量指标的聚合效度、领域普遍性和时间稳定性。

Nat Commun. 2024 Sep 4;15(1):7721. doi: 10.1038/s41467-024-51685-z.

Data-driven Interpretable Policy Construction for Personalized Mobile Health.用于个性化移动健康的数据驱动可解释策略构建

2022 IEEE Int Conf Digit Health IEEE IDCH 2022 (2022). 2022 Jul;2022:13-22. doi: 10.1109/ICDH55609.2022.00010. Epub 2022 Aug 24.

From exploration to exploitation: a shifting mental mode in late life development.从探索到开发：晚年发展中思维模式的转变。

Trends Cogn Sci. 2021 Dec;25(12):1058-1071. doi: 10.1016/j.tics.2021.09.001. Epub 2021 Sep 27.

Beyond stereotypes: Using socioemotional selectivity theory to improve messaging to older adults.超越刻板印象：运用社会情感选择理论改进针对老年人的信息传达

Curr Dir Psychol Sci. 2021 Aug 1;30(4):327-334. doi: 10.1177/09637214211011468. Epub 2021 Jun 25.

A linear threshold model for optimal stopping behavior.线性阈值模型用于最优停止行为。

Proc Natl Acad Sci U S A. 2020 Jun 9;117(23):12750-12755. doi: 10.1073/pnas.2002312117. Epub 2020 May 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

简单的阈值规则解决了资源积累搜索任务中的探索/利用权衡问题。

Simple Threshold Rules Solve Explore/Exploit Trade-offs in a Resource Accumulation Search Task.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献