Aguilera Miguel, Bedia Manuel G
Information and Autonomous Systems-Research Center for Life, Mind, and Society, University of the Basque Country, Donostia, Spain.
Department of Computer Science and Systems Engineering, University of Zaragoza, Zaragoza, Spain.
Front Neurorobot. 2018 Oct 2;12:55. doi: 10.3389/fnbot.2018.00055. eCollection 2018.
The activity of many biological and cognitive systems is not poised deep within a specific regime of activity. Instead, they operate near points of critical behavior located at the boundary between different phases. Certain authors link some of the properties of criticality with the ability of living systems to generate autonomous or intrinsically generated behavior. However, these claims remain highly speculative. In this paper, we intend to explore the connection between criticality and autonomous behavior through conceptual models that show how embodied agents may adapt themselves toward critical points. We propose to exploit maximum entropy models and their formal descriptions of indicators of criticality to present a learning model that drives generic agents toward critical points. Specifically, we derive such a learning model in an embodied Boltzmann machine by implementing a gradient ascent rule that maximizes the heat capacity of the controller in order to make the network maximally sensitive to external perturbations. We test and corroborate the model by implementing an embodied agent in the Mountain Car benchmark test, which is controlled by a Boltzmann machine that adjusts its weights according to the model. We find that the neural controller reaches an apparent point of criticality, which coincides with a transition point of the behavior of the agent between two regimes of behavior, maximizing the synergistic information between its sensors and the combination of hidden and motor neurons. Finally, we discuss the potential of our learning model to answer questions about the connection between criticality and the capabilities of living systems to autonomously generate intrinsic constraints on their behavior. We suggest that these "critical agents" are able to acquire flexible behavioral patterns that are useful for the development of successful strategies in different contexts.
许多生物和认知系统的活动并非处于特定活动状态的深处。相反,它们在位于不同阶段边界的临界行为点附近运行。某些作者将临界性的一些特性与生命系统产生自主或内在产生行为的能力联系起来。然而,这些说法仍然极具推测性。在本文中,我们打算通过概念模型来探索临界性与自主行为之间的联系,这些模型展示了具身智能体如何朝着临界点进行自我调整。我们建议利用最大熵模型及其对临界性指标的形式化描述,来呈现一个将通用智能体驱动至临界点的学习模型。具体而言,我们在具身玻尔兹曼机中通过实施梯度上升规则来推导这样一个学习模型,该规则使控制器的热容量最大化,以便使网络对外部扰动具有最大敏感性。我们通过在山地车基准测试中实现一个具身智能体来测试和验证该模型,该智能体由一个根据模型调整其权重的玻尔兹曼机控制。我们发现神经控制器达到了一个明显的临界状态点,这与智能体行为在两种行为模式之间的转变点相吻合,从而使传感器与隐藏神经元和运动神经元组合之间的协同信息最大化。最后,我们讨论了我们的学习模型在回答关于临界性与生命系统自主产生行为内在约束能力之间联系的问题方面的潜力。我们认为这些“临界智能体”能够获得灵活的行为模式,这对于在不同情境中制定成功策略很有用。