Mannella Francesco, Baldassarre Gianluca
Laboratory of Autonomous Robotics and Artificial Life, Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche (LARAL-ISTC-CNR), Via San Martino della Battaglia 44, 00185 Roma, Italy.
Philos Trans R Soc Lond B Biol Sci. 2007 Mar 29;362(1479):383-401. doi: 10.1098/rstb.2006.1966.
Previous experiments have shown that when domestic chicks (Gallus gallus) are first trained to locate food elements hidden at the centre of a closed square arena and then are tested in a square arena of double the size, they search for food both at its centre and at a distance from walls similar to the distance of the centre from the walls experienced during training. This paper presents a computational model that successfully reproduces these behaviours. The model is based on a neural-network implementation of the reinforcement-learning actor - critic architecture (in this architecture the 'critic' learns to evaluate perceived states in terms of predicted future rewards, while the 'actor' learns to increase the probability of selecting the actions that lead to higher evaluations). The analysis of the model suggests which type of information and cognitive mechanisms might underlie chicks' behaviours: (i) the tendency to explore the area at a specific distance from walls might be based on the processing of the height of walls' horizontal edges, (ii) the capacity to generalize the search at the centre of square arenas independently of their size might be based on the processing of the relative position of walls' vertical edges on the horizontal plane (equalization of walls' width), and (iii) the whole behaviour exhibited in the large square arena can be reproduced by assuming the existence of an attention process that, at each time, focuses chicks' internal processing on either one of the two previously discussed information sources. The model also produces testable predictions regarding the generalization capabilities that real chicks should exhibit if trained in circular arenas of varying size. The paper also highlights the potentialities of the model to address other experiments on animals' navigation and analyses its strengths and weaknesses in comparison to other models.
先前的实验表明,当家鸡(原鸡)首先被训练去寻找隐藏在封闭方形场地中心的食物元素,然后在两倍大小的方形场地中进行测试时,它们会在场地中心以及距离墙壁一定距离处寻找食物,这个距离与训练期间场地中心到墙壁的距离相似。本文提出了一个计算模型,该模型成功地再现了这些行为。该模型基于强化学习行动者-评判者架构的神经网络实现(在这种架构中,“评判者”学会根据预测的未来奖励来评估感知到的状态,而“行动者”学会增加选择导致更高评估的行动的概率)。对该模型的分析表明,哪种类型的信息和认知机制可能是雏鸡行为的基础:(i)在距离墙壁特定距离处探索区域的倾向可能基于对墙壁水平边缘高度的处理,(ii)在方形场地中心进行搜索的能力,而不考虑场地大小,可能基于对墙壁垂直边缘在水平面上相对位置的处理(墙壁宽度的均等化),并且(iii)通过假设存在一个注意力过程,在每次将雏鸡的内部处理集中在先前讨论的两个信息源之一上,可以再现雏鸡在大方形场地中表现出的整体行为。该模型还产生了关于实际雏鸡如果在不同大小的圆形场地中训练应表现出的泛化能力的可测试预测。本文还强调了该模型在解决关于动物导航的其他实验方面的潜力,并分析了其与其他模型相比的优势和劣势。