Suppr超能文献

基于理论的强化学习的神经结构。

The neural architecture of theory-based reinforcement learning.

机构信息

Department of Psychology and Center for Brain Science, Harvard University, Cambridge, MA 02138, USA; Center for Brains, Minds, and Machines, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Motional AD, Inc., Boston, MA 02210, USA.

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Center for Brains, Minds, and Machines, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.

出版信息

Neuron. 2023 Apr 19;111(8):1331-1344.e8. doi: 10.1016/j.neuron.2023.01.023. Epub 2023 Mar 9.

Abstract

Humans learn internal models of the world that support planning and generalization in complex environments. Yet it remains unclear how such internal models are represented and learned in the brain. We approach this question using theory-based reinforcement learning, a strong form of model-based reinforcement learning in which the model is a kind of intuitive theory. We analyzed fMRI data from human participants learning to play Atari-style games. We found evidence of theory representations in prefrontal cortex and of theory updating in prefrontal cortex, occipital cortex, and fusiform gyrus. Theory updates coincided with transient strengthening of theory representations. Effective connectivity during theory updating suggests that information flows from prefrontal theory-coding regions to posterior theory-updating regions. Together, our results are consistent with a neural architecture in which top-down theory representations originating in prefrontal regions shape sensory predictions in visual areas, where factored theory prediction errors are computed and trigger bottom-up updates of the theory.

摘要

人类学习内部的世界模型,以支持在复杂环境中的规划和泛化。然而,目前尚不清楚大脑中是如何表示和学习这些内部模型的。我们使用基于理论的强化学习来解决这个问题,这是一种强有力的基于模型的强化学习形式,其中模型是一种直观的理论。我们分析了人类参与者学习玩 Atari 风格游戏的 fMRI 数据。我们在额皮质和枕叶皮质以及梭状回中发现了理论表示的证据,以及在额皮质、枕叶皮质和梭状回中发现了理论更新的证据。理论更新与理论表示的短暂增强同时发生。理论更新期间的有效连通性表明,信息从额皮质的理论编码区域流向后部的理论更新区域。总的来说,我们的结果与一种神经架构一致,即源自额皮质区域的自上而下的理论表示塑造了视觉区域的感觉预测,在这些区域中计算了因子化的理论预测误差,并触发了理论的自下而上更新。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb73/10200004/48d4313e5e64/nihms-1882586-f0001.jpg

相似文献

1
The neural architecture of theory-based reinforcement learning.基于理论的强化学习的神经结构。
Neuron. 2023 Apr 19;111(8):1331-1344.e8. doi: 10.1016/j.neuron.2023.01.023. Epub 2023 Mar 9.
2
Causal Inference Gates Corticostriatal Learning.因果推理门控皮质纹状体学习。
J Neurosci. 2021 Aug 11;41(32):6892-6904. doi: 10.1523/JNEUROSCI.2796-20.2021. Epub 2021 Jul 9.
3
The Neural Correlates of Hierarchical Predictions for Perceptual Decisions.层级预测对知觉决策的神经关联。
J Neurosci. 2018 May 23;38(21):5008-5021. doi: 10.1523/JNEUROSCI.2901-17.2018. Epub 2018 Apr 30.

本文引用的文献

1
Inductive biases in theory-based reinforcement learning.基于理论的强化学习中的归纳偏差。
Cogn Psychol. 2022 Nov;138:101509. doi: 10.1016/j.cogpsych.2022.101509. Epub 2022 Sep 21.
3
Causal Inference Gates Corticostriatal Learning.因果推理门控皮质纹状体学习。
J Neurosci. 2021 Aug 11;41(32):6892-6904. doi: 10.1523/JNEUROSCI.2796-20.2021. Epub 2021 Jul 9.
4
Is Activity Silent Working Memory Simply Episodic Memory?活动型工作记忆是否仅仅是情景记忆?
Trends Cogn Sci. 2021 Apr;25(4):284-293. doi: 10.1016/j.tics.2021.01.003. Epub 2021 Feb 4.
5
What Is the Model in Model-Based Planning?基于模型规划中的模型是什么?
Cogn Sci. 2021 Jan;45(1):e12928. doi: 10.1111/cogs.12928.
10
Automated anatomical labelling atlas 3.自动解剖学标注图谱 3.
Neuroimage. 2020 Feb 1;206:116189. doi: 10.1016/j.neuroimage.2019.116189. Epub 2019 Sep 12.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验