Suppr超能文献

多维环境中的强化学习依赖于注意力机制。

Reinforcement learning in multidimensional environments relies on attention mechanisms.

作者信息

Niv Yael, Daniel Reka, Geana Andra, Gershman Samuel J, Leong Yuan Chang, Radulescu Angela, Wilson Robert C

机构信息

Department of Psychology and Neuroscience Institute, Princeton University, Princeton, New Jersey 08540,

Department of Psychology and Neuroscience Institute, Princeton University, Princeton, New Jersey 08540.

出版信息

J Neurosci. 2015 May 27;35(21):8145-57. doi: 10.1523/JNEUROSCI.2978-14.2015.

Abstract

In recent years, ideas from the computational field of reinforcement learning have revolutionized the study of learning in the brain, famously providing new, precise theories of how dopamine affects learning in the basal ganglia. However, reinforcement learning algorithms are notorious for not scaling well to multidimensional environments, as is required for real-world learning. We hypothesized that the brain naturally reduces the dimensionality of real-world problems to only those dimensions that are relevant to predicting reward, and conducted an experiment to assess by what algorithms and with what neural mechanisms this "representation learning" process is realized in humans. Our results suggest that a bilateral attentional control network comprising the intraparietal sulcus, precuneus, and dorsolateral prefrontal cortex is involved in selecting what dimensions are relevant to the task at hand, effectively updating the task representation through trial and error. In this way, cortical attention mechanisms interact with learning in the basal ganglia to solve the "curse of dimensionality" in reinforcement learning.

摘要

近年来,强化学习这一计算领域的理念彻底改变了大脑学习的研究,尤其为多巴胺如何影响基底神经节中的学习提供了全新且精确的理论。然而,强化学习算法因无法很好地扩展到多维环境而声名狼藉,而现实世界中的学习恰恰需要这种多维环境。我们推测,大脑会自然地将现实世界问题的维度降低到仅与预测奖励相关的那些维度,并进行了一项实验,以评估人类通过何种算法以及何种神经机制来实现这种“表征学习”过程。我们的研究结果表明,一个由顶内沟、楔前叶和背外侧前额叶皮层组成的双侧注意力控制网络参与选择与手头任务相关的维度,通过反复试验有效地更新任务表征。通过这种方式,皮层注意力机制与基底神经节中的学习相互作用,以解决强化学习中的“维度诅咒”问题。

相似文献

3
Neural Mechanisms for Undoing the "Curse of Dimensionality".消除“维度灾难”的神经机制
J Neurosci. 2015 Sep 2;35(35):12083-4. doi: 10.1523/JNEUROSCI.2428-15.2015.

引用本文的文献

本文引用的文献

2
Neural mechanisms of object-based attention.基于对象的注意力的神经机制。
Science. 2014 Apr 25;344(6182):424-7. doi: 10.1126/science.1247003. Epub 2014 Apr 10.
4
Learning to classify integral-dimension stimuli.学习分类整维度刺激。
Psychon Bull Rev. 1996 Jun;3(2):222-6. doi: 10.3758/BF03212422.
5
Decoding the brain's algorithm for categorization from its neural implementation.从神经实现中解码大脑的分类算法。
Curr Biol. 2013 Oct 21;23(20):2023-7. doi: 10.1016/j.cub.2013.08.035. Epub 2013 Oct 3.
8
Inferring relevance in a changing world.推断变化世界中的相关性。
Front Hum Neurosci. 2012 Jan 24;5:189. doi: 10.3389/fnhum.2011.00189. eCollection 2011.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验