人类探索性决策的皮质基础。

Cortical substrates for exploratory decisions in humans.

作者信息

Daw Nathaniel D, O'Doherty John P, Dayan Peter, Seymour Ben, Dolan Raymond J

机构信息

Gatsby Computational Neuroscience Unit, University College London (UCL), Alexandra House, 17 Queen Square, London WC1N 3AR, UK.

出版信息

Nature. 2006 Jun 15;441(7095):876-9. doi: 10.1038/nature04766.

DOI:10.1038/nature04766

PMID:16778890

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2635947/

Abstract

Decision making in an uncertain environment poses a conflict between the opposing demands of gathering and exploiting information. In a classic illustration of this 'exploration-exploitation' dilemma, a gambler choosing between multiple slot machines balances the desire to select what seems, on the basis of accumulated experience, the richest option, against the desire to choose a less familiar option that might turn out more advantageous (and thereby provide information for improving future decisions). Far from representing idle curiosity, such exploration is often critical for organisms to discover how best to harvest resources such as food and water. In appetitive choice, substantial experimental evidence, underpinned by computational reinforcement learning (RL) theory, indicates that a dopaminergic, striatal and medial prefrontal network mediates learning to exploit. In contrast, although exploration has been well studied from both theoretical and ethological perspectives, its neural substrates are much less clear. Here we show, in a gambling task, that human subjects' choices can be characterized by a computationally well-regarded strategy for addressing the explore/exploit dilemma. Furthermore, using this characterization to classify decisions as exploratory or exploitative, we employ functional magnetic resonance imaging to show that the frontopolar cortex and intraparietal sulcus are preferentially active during exploratory decisions. In contrast, regions of striatum and ventromedial prefrontal cortex exhibit activity characteristic of an involvement in value-based exploitative decision making. The results suggest a model of action selection under uncertainty that involves switching between exploratory and exploitative behavioural modes, and provide a computationally precise characterization of the contribution of key decision-related brain systems to each of these functions.

摘要

在不确定环境中进行决策会在收集信息和利用信息这两种相互对立的需求之间产生冲突。在这个“探索 - 利用”困境的经典示例中，一名在多个老虎机之间做出选择的赌徒，需要在基于积累经验选择看起来最有收益的选项的欲望，与选择一个可能更具优势（从而为改进未来决策提供信息）但不太熟悉的选项的欲望之间进行权衡。这种探索远非代表着无意义的好奇心，对于生物体发现如何最好地获取食物和水等资源通常至关重要。在偏好选择中，大量基于计算强化学习（RL）理论的实验证据表明，多巴胺能、纹状体和内侧前额叶网络介导了利用性学习。相比之下，尽管从理论和行为学角度对探索都进行了充分研究，但其神经基础却不太明确。在这里，我们在一项赌博任务中表明，人类受试者的选择可以通过一种在计算上备受认可的策略来表征，该策略用于解决探索/利用困境。此外，利用这种表征将决策分类为探索性或利用性，我们采用功能磁共振成像来表明，在探索性决策过程中，额极皮质和顶内沟优先活跃。相比之下，纹状体和腹内侧前额叶皮质区域表现出参与基于价值的利用性决策的活动特征。这些结果提出了一种在不确定情况下的行动选择模型，该模型涉及在探索性和利用性行为模式之间切换，并为关键决策相关脑系统对这些功能的贡献提供了计算上精确的表征。

相似文献

Cortical substrates for exploratory decisions in humans.

Nature. 2006 Jun 15;441(7095):876-9. doi: 10.1038/nature04766.

The neurocomputational bases of explore-exploit decision-making.

Neuron. 2022 Jun 1;110(11):1869-1879.e5. doi: 10.1016/j.neuron.2022.03.014. Epub 2022 Apr 6.

Transcranial Stimulation over Frontopolar Cortex Elucidates the Choice Attributes and Neural Mechanisms Used to Resolve Exploration-Exploitation Trade-Offs.

J Neurosci. 2015 Oct 28;35(43):14544-56. doi: 10.1523/JNEUROSCI.2322-15.2015.

Primate Orbitofrontal Cortex Codes Information Relevant for Managing Explore-Exploit Tradeoffs.

J Neurosci. 2020 Mar 18;40(12):2553-2561. doi: 10.1523/JNEUROSCI.2355-19.2020. Epub 2020 Feb 14.

A frontal dopamine system for reflective exploratory behavior.

Neurobiol Learn Mem. 2015 Sep;123:84-91. doi: 10.1016/j.nlm.2015.05.004. Epub 2015 May 22.

Ready, set, explore! Event-related potentials reveal the time-course of exploratory decisions.

Brain Res. 2019 Sep 15;1719:183-193. doi: 10.1016/j.brainres.2019.05.039. Epub 2019 May 29.

Learning the value of information and reward over time when solving exploration-exploitation problems.

Sci Rep. 2017 Dec 5;7(1):16919. doi: 10.1038/s41598-017-17237-w.

Subcortical Substrates of Explore-Exploit Decisions in Primates.

Neuron. 2019 Aug 7;103(3):533-545.e5. doi: 10.1016/j.neuron.2019.05.017. Epub 2019 Jun 10.

Prefrontal cortex activity is reduced in gambling and nongambling substance users during decision-making.

Hum Brain Mapp. 2007 Dec;28(12):1276-86. doi: 10.1002/hbm.20344.

Reward-dependent learning in neuronal networks for planning and decision making.

Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0.

引用本文的文献

Value-Directed Remembering: A Dual-Process Perspective.

Behav Sci (Basel). 2025 Aug 17;15(8):1113. doi: 10.3390/bs15081113.

Rate and noise in human amygdala drive increased exploration in aversive learning.

Nature. 2025 Aug 27. doi: 10.1038/s41586-025-09466-1.

Cortical network modulations associated with prolonged training of the multiple object-tracking task.

Imaging Neurosci (Camb). 2025 May 12;3. doi: 10.1162/imag_a_00577. eCollection 2025.

Data-driven equation discovery reveals nonlinear reinforcement learning in humans.

Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2413441122. doi: 10.1073/pnas.2413441122. Epub 2025 Jul 31.

Further examining how animals weigh conflicting information about reward sources over time.

Anim Cogn. 2025 Jul 30;28(1):74. doi: 10.1007/s10071-025-01982-x.

Model-based exploration is measurable across tasks but not linked to personality and psychiatric assessments.

Sci Rep. 2025 Jul 28;15(1):27479. doi: 10.1038/s41598-025-09152-2.

A Foraging-Theory-Based Model Captures the Spectrum of Human Behavioral Diversity in Sequential Decision Making.

bioRxiv. 2025 Jun 10:2025.05.06.652482. doi: 10.1101/2025.05.06.652482.

Assessing social anhedonia in a transdiagnostic sample: Insights from a computational psychiatry lens.

J Mood Anxiety Disord. 2024 Sep 17;8:100088. doi: 10.1016/j.xjmad.2024.100088. eCollection 2024 Dec.

Exploration is associated with socioeconomic disparities in learning and academic achievement in adolescence.

Nat Commun. 2025 Jul 9;16(1):6342. doi: 10.1038/s41467-025-61746-6.

Discovering cognitive strategies with tiny recurrent neural networks.

Nature. 2025 Jul 2. doi: 10.1038/s41586-025-09142-4.

本文引用的文献

Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control.

Nat Neurosci. 2005 Dec;8(12):1704-11. doi: 10.1038/nn1560. Epub 2005 Nov 6.

The functional organization of the intraparietal sulcus in humans and monkeys.

J Anat. 2005 Jul;207(1):3-17. doi: 10.1111/j.1469-7580.2005.00426.x.

Midbrain dopamine neurons encode a quantitative reward prediction error signal.

Neuron. 2005 Jul 7;47(1):129-41. doi: 10.1016/j.neuron.2005.05.020.

Reward representations and reward-related learning in the human brain: insights from neuroimaging.

Curr Opin Neurobiol. 2004 Dec;14(6):769-76. doi: 10.1016/j.conb.2004.10.016.

Separate neural systems value immediate and delayed monetary rewards.

Science. 2004 Oct 15;306(5695):503-7. doi: 10.1126/science.1100907.

Activity in posterior parietal cortex is correlated with the relative subjective desirability of action.

Neuron. 2004 Oct 14;44(2):365-78. doi: 10.1016/j.neuron.2004.09.009.

Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops.

Nat Neurosci. 2004 Aug;7(8):887-93. doi: 10.1038/nn1279. Epub 2004 Jul 4.

Matching behavior and the representation of value in the parietal cortex.

Science. 2004 Jun 18;304(5678):1782-7. doi: 10.1126/science.1094765.

Dissociable roles of ventral and dorsal striatum in instrumental conditioning.

Science. 2004 Apr 16;304(5669):452-4. doi: 10.1126/science.1094285.

Anterior prefrontal cortex: insights into function from anatomy and neuroimaging.

Nat Rev Neurosci. 2004 Mar;5(3):184-94. doi: 10.1038/nrn1343.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类探索性决策的皮质基础。

Cortical substrates for exploratory decisions in humans.

作者信息

Daw Nathaniel D, O'Doherty John P, Dayan Peter, Seymour Ben, Dolan Raymond J

机构信息

Gatsby Computational Neuroscience Unit, University College London (UCL), Alexandra House, 17 Queen Square, London WC1N 3AR, UK.

出版信息

Nature. 2006 Jun 15;441(7095):876-9. doi: 10.1038/nature04766.

DOI:10.1038/nature04766

PMID:16778890

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2635947/

Abstract

摘要

人类探索性决策的皮质基础。

Cortical substrates for exploratory decisions in humans.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

人类探索性决策的皮质基础。

Cortical substrates for exploratory decisions in humans.

作者信息

机构信息

出版信息