Niv Yael, Daniel Reka, Geana Andra, Gershman Samuel J, Leong Yuan Chang, Radulescu Angela, Wilson Robert C
Department of Psychology and Neuroscience Institute, Princeton University, Princeton, New Jersey 08540,
Department of Psychology and Neuroscience Institute, Princeton University, Princeton, New Jersey 08540.
J Neurosci. 2015 May 27;35(21):8145-57. doi: 10.1523/JNEUROSCI.2978-14.2015.
In recent years, ideas from the computational field of reinforcement learning have revolutionized the study of learning in the brain, famously providing new, precise theories of how dopamine affects learning in the basal ganglia. However, reinforcement learning algorithms are notorious for not scaling well to multidimensional environments, as is required for real-world learning. We hypothesized that the brain naturally reduces the dimensionality of real-world problems to only those dimensions that are relevant to predicting reward, and conducted an experiment to assess by what algorithms and with what neural mechanisms this "representation learning" process is realized in humans. Our results suggest that a bilateral attentional control network comprising the intraparietal sulcus, precuneus, and dorsolateral prefrontal cortex is involved in selecting what dimensions are relevant to the task at hand, effectively updating the task representation through trial and error. In this way, cortical attention mechanisms interact with learning in the basal ganglia to solve the "curse of dimensionality" in reinforcement learning.
近年来,强化学习这一计算领域的理念彻底改变了大脑学习的研究,尤其为多巴胺如何影响基底神经节中的学习提供了全新且精确的理论。然而,强化学习算法因无法很好地扩展到多维环境而声名狼藉,而现实世界中的学习恰恰需要这种多维环境。我们推测,大脑会自然地将现实世界问题的维度降低到仅与预测奖励相关的那些维度,并进行了一项实验,以评估人类通过何种算法以及何种神经机制来实现这种“表征学习”过程。我们的研究结果表明,一个由顶内沟、楔前叶和背外侧前额叶皮层组成的双侧注意力控制网络参与选择与手头任务相关的维度,通过反复试验有效地更新任务表征。通过这种方式,皮层注意力机制与基底神经节中的学习相互作用,以解决强化学习中的“维度诅咒”问题。