Chersi Fabian, Pezzulo Giovanni
Institute of Cognitive Sciences and Technologies, CNR, Via S. Martino della Battaglia 44, 00185, Rome, Italy.
Cogn Process. 2012 Aug;13 Suppl 1:S125-9. doi: 10.1007/s10339-012-0475-7.
The hippocampus plays a central role in spatial representation, declarative and episodic memory. In this area, so-called place cells possess high spatial selectivity, firing preferentially when the individual is within a small area of the environment. Interestingly, it has been found in rats that these cells can be active also when the animal is outside the location or context of their corresponding place field producing so-called "forward sweeps". These typically occur at decision points during task execution and seem to be utilized, among other things, for the evaluation of potential alternative paths. Anticipatory firing is also found in the ventral striatum, a brain area that is strongly interconnected with the hippocampus and is known to encode value and reward. In this paper, we describe a biologically based computational model of the hippocampal-ventral striatum circuit that implements a goal-directed mechanism of choice, with the hippocampus primarily involved in the mental simulation of possible navigation paths and the ventral striatum involved in the evaluation of the associated reward expectancies. The model is validated in a navigation task in which a rat is placed in a complex maze with multiple rewarding sites. We show that the rat mentally activates place cells to simulate paths, estimate their value, and make decisions, implementing two essential processes of model-based reinforcement learning algorithms of choice: look-ahead prediction and the evaluation of predicted states.
海马体在空间表征、陈述性记忆和情景记忆中起着核心作用。在这个区域,所谓的位置细胞具有很高的空间选择性,当个体处于环境中的一个小区域内时优先放电。有趣的是,在大鼠中发现,当动物处于其相应位置场的位置或背景之外时,这些细胞也可以活跃,产生所谓的“前向扫描”。这些情况通常发生在任务执行过程中的决策点,并且似乎除其他外还被用于评估潜在的替代路径。在腹侧纹状体中也发现了预期放电,腹侧纹状体是一个与海马体紧密相连的脑区,已知其编码价值和奖励。在本文中,我们描述了一种基于生物学的海马体-腹侧纹状体回路计算模型,该模型实现了一种目标导向的选择机制,其中海马体主要参与对可能的导航路径的心理模拟,而腹侧纹状体参与对相关奖励预期的评估。该模型在一个导航任务中得到验证,在该任务中,一只大鼠被放置在一个有多个奖励地点的复杂迷宫中。我们表明,大鼠在心理上激活位置细胞以模拟路径、估计其价值并做出决策,实现了基于模型的强化学习选择算法的两个基本过程:前瞻性预测和对预测状态的评估。