文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

人类在认知限制下适应性地解决探索-利用困境:来自多臂赌博机任务的证据。

Humans adaptively resolve the explore-exploit dilemma under cognitive constraints: Evidence from a multi-armed bandit task.

机构信息

Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA.

Department of Psychology, Pennsylvania State University, State College, PA, USA; Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.

出版信息

Cognition. 2022 Dec;229:105233. doi: 10.1016/j.cognition.2022.105233. Epub 2022 Jul 30.


DOI:10.1016/j.cognition.2022.105233
PMID:35917612
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9530017/
Abstract

When navigating uncertain worlds, humans must balance exploring new options versus exploiting known rewards. Longer horizons and spatially structured option values encourage humans to explore, but the impact of real-world cognitive constraints such as environment size and memory demands on explore-exploit decisions is unclear. In the present study, humans chose between options varying in uncertainty during a multi-armed bandit task with varying environment size and memory demands. Regression and cognitive computational models of choice behavior showed that with a lower cognitive load, humans are more exploratory than a simulated value-maximizing learner, but under cognitive constraints, they adaptively scale down exploration to maintain exploitation. Thus, while humans are curious, cognitive constraints force people to decrease their strategic exploration in a resource-rational-like manner to focus on harvesting known rewards.

摘要

当人类在不确定的世界中导航时,他们必须在探索新选项和利用已知奖励之间取得平衡。更长的视野和空间结构的选项值鼓励人类进行探索,但现实世界认知限制(如环境大小和记忆需求)对探索-利用决策的影响尚不清楚。在本研究中,人类在一个具有不同环境大小和记忆需求的多臂老虎机任务中,在不确定性不同的选项之间进行选择。选择行为的回归和认知计算模型表明,在认知负荷较低的情况下,人类比模拟的最大化价值学习者更具探索性,但在认知限制下,他们会适应性地减少探索以保持利用。因此,虽然人类具有好奇心,但认知限制迫使人们以类似于资源理性的方式减少策略性探索,以专注于收获已知奖励。

相似文献

[1]
Humans adaptively resolve the explore-exploit dilemma under cognitive constraints: Evidence from a multi-armed bandit task.

Cognition. 2022-12

[2]
Sex differences in learning from exploration.

Elife. 2021-11-19

[3]
Uncertainty and exploration in a restless bandit problem.

Top Cogn Sci. 2015-4

[4]
Finding structure in multi-armed bandits.

Cogn Psychol. 2020-6

[5]
Development of directed and random exploration in children.

Dev Sci. 2021-7

[6]
Overtaking method based on sand-sifter mechanism: Why do optimistic value functions find optimal solutions in multi-armed bandit problems?

Biosystems. 2015-9

[7]
Transcranial Stimulation over Frontopolar Cortex Elucidates the Choice Attributes and Neural Mechanisms Used to Resolve Exploration-Exploitation Trade-Offs.

J Neurosci. 2015-10-28

[8]
Cortical substrates for exploratory decisions in humans.

Nature. 2006-6-15

[9]
Primate Orbitofrontal Cortex Codes Information Relevant for Managing Explore-Exploit Tradeoffs.

J Neurosci. 2020-2-14

[10]
Putting bandits into context: How function learning supports decision making.

J Exp Psychol Learn Mem Cogn. 2018-6

引用本文的文献

[1]
Dynamic prefrontal coupling coordinates adaptive decision-making.

Res Sq. 2025-4-9

[2]
Perceptual Novelty Drives Early Exploration in a Bottom-Up Manner.

Dev Sci. 2025-5

[3]
Negative affect-driven impulsivity as hierarchical model-based overgeneralization.

Trends Cogn Sci. 2025-5

[4]
Humans rationally balance detailed and temporally abstract world models.

Commun Psychol. 2025-1-4

[5]
Active learning with human heuristics: an algorithm robust to labeling bias.

Front Artif Intell. 2024-11-19

[6]
Revisiting the role of computational neuroimaging in the era of integrative neuroscience.

Neuropsychopharmacology. 2024-11

[7]
Bayesian Reinforcement Learning With Limited Cognitive Load.

Open Mind (Camb). 2024-4-3

[8]
The structure and development of explore-exploit decision making.

Cogn Psychol. 2024-5

[9]
Common and distinct equity preferences in children and adults.

Front Psychol. 2024-2-14

[10]
Reward-based option competition in human dorsal stream and transition from stochastic exploration to exploitation in continuous space.

Sci Adv. 2024-2-23

本文引用的文献

[1]
Stan: A Probabilistic Programming Language.

J Stat Softw. 2017

[2]
Time pressure changes how people explore and respond to uncertainty.

Sci Rep. 2022-3-8

[3]
Human complex exploration strategies are enriched by noradrenaline-modulated heuristics.

Elife. 2021-1-4

[4]
Improving the Reliability of Computational Analyses: Model-Based Planning and Its Relationship With Compulsivity.

Biol Psychiatry Cogn Neurosci Neuroimaging. 2020-6

[5]
Searching for Rewards Like a Child Means Less Generalization and More Directed Exploration.

Psychol Sci. 2019-10-25

[6]
Structured, uncertainty-driven exploration in real-world consumer choice.

Proc Natl Acad Sci U S A. 2019-6-24

[7]
Subcortical Substrates of Explore-Exploit Decisions in Primates.

Neuron. 2019-6-10

[8]
Generalization guides human exploration in vast decision spaces.

Nat Hum Behav. 2018-11-12

[9]
Resource-rational analysis: Understanding human cognition as the optimal use of limited computational resources.

Behav Brain Sci. 2019-2-4

[10]
Should we control? The interplay between cognitive control and information integration in the resolution of the exploration-exploitation dilemma.

J Exp Psychol Gen. 2019-1-21

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索