灵长类动物探索-利用决策的皮质下基质。

Subcortical Substrates of Explore-Exploit Decisions in Primates.

机构信息

Laboratory of Neuropsychology, National Institute of Mental Health, National Institute of Health, Bethesda, MD 20892, USA; Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR 97239, USA; Division of Neuroscience, Oregon National Primate Research Center, Beaverton, OR 97006, USA.

Laboratory of Neuropsychology, National Institute of Mental Health, National Institute of Health, Bethesda, MD 20892, USA.

出版信息

Neuron. 2019 Aug 7;103(3):533-545.e5. doi: 10.1016/j.neuron.2019.05.017. Epub 2019 Jun 10.

Abstract

The explore-exploit dilemma refers to the challenge of deciding when to forego immediate rewards and explore new opportunities that could lead to greater rewards in the future. While motivational neural circuits facilitate learning based on past choices and outcomes, it is unclear whether they also support computations relevant for deciding when to explore. We recorded neural activity in the amygdala and ventral striatum of rhesus macaques as they solved a task that required them to balance novelty-driven exploration with exploitation of what they had already learned. Using a partially observable Markov decision process (POMDP) model to quantify explore-exploit trade-offs, we identified that the ventral striatum and amygdala differ in how they represent the immediate value of exploitative choices and the future value of exploratory choices. These findings show that subcortical motivational circuits are important in guiding explore-exploit decisions.

摘要

探索-利用困境是指在决定何时放弃即时奖励,以及何时探索新的机会以获得更大的未来回报时所面临的挑战。虽然激励性神经回路有助于根据过去的选择和结果进行学习,但尚不清楚它们是否也支持与决定何时探索相关的计算。我们在恒河猴解决需要平衡由新奇驱动的探索与对已学内容的利用的任务时,记录了其杏仁核和腹侧纹状体的神经活动。我们使用部分可观察马尔可夫决策过程(POMDP)模型来量化探索-利用权衡,结果表明腹侧纹状体和杏仁核在表示剥削性选择的即时价值和探索性选择的未来价值方面存在差异。这些发现表明,皮质下激励性电路在指导探索-利用决策方面很重要。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索