Dayan Peter
Gatsby Computational Neuroscience Unit, UCL London, UK.
Front Neurosci. 2008 Dec 15;2(2):255-63. doi: 10.3389/neuro.01.031.2008. eCollection 2008 Dec.
Complex cognitive tasks present a range of computational and algorithmic challenges for neural accounts of both learning and inference. In particular, it is extremely hard to solve them using the sort of simple policies that have been extensively studied as solutions to elementary Markov decision problems. There has thus been recent interest in architectures for the instantiation and even learning of policies that are formally more complicated than these, involving operations such as gated working memory. However, the focus of these ideas and methods has largely been on what might best be considered as automatized, routine or, in the sense of animal conditioning, habitual, performance. Thus, they have yet to provide a route towards understanding the workings of rule-based control, which is critical for cognitively sophisticated competence. Here, we review a recent suggestion for a uniform architecture for habitual and rule-based execution, discuss some of the habitual mechanisms that underpin the use of rules, and consider a statistical relationship between rules and habits.
复杂认知任务对学习和推理的神经模型提出了一系列计算和算法挑战。特别是,使用那些作为基本马尔可夫决策问题解决方案而被广泛研究的简单策略来解决这些任务极其困难。因此,最近人们对那些形式上比这些策略更复杂的策略的实例化甚至学习架构产生了兴趣,这些策略涉及诸如门控工作记忆等操作。然而,这些思想和方法的重点主要在于那些最适合被视为自动化、常规的,或者从动物条件作用的角度来看是习惯性的表现。因此,它们尚未提供一条理解基于规则控制运作的途径,而基于规则的控制对于认知复杂能力至关重要。在这里,我们回顾了最近关于习惯性和基于规则执行的统一架构的建议,讨论了一些支撑规则使用的习惯性机制,并考虑了规则与习惯之间的统计关系。