Institute for Theoretical Physics, University of Innsbruck, Innsbruck, Austria.
Fachbereich Philosophie, Universität Konstanz, Konstanz, Germany.
PLoS One. 2020 Dec 18;15(12):e0243628. doi: 10.1371/journal.pone.0243628. eCollection 2020.
Collective behavior, and swarm formation in particular, has been studied from several perspectives within a large variety of fields, ranging from biology to physics. In this work, we apply Projective Simulation to model each individual as an artificial learning agent that interacts with its neighbors and surroundings in order to make decisions and learn from them. Within a reinforcement learning framework, we discuss one-dimensional learning scenarios where agents need to get to food resources to be rewarded. We observe how different types of collective motion emerge depending on the distance the agents need to travel to reach the resources. For instance, strongly aligned swarms emerge when the food source is placed far away from the region where agents are situated initially. In addition, we study the properties of the individual trajectories that occur within the different types of emergent collective dynamics. Agents trained to find distant resources exhibit individual trajectories that are in most cases best fit by composite correlated random walks with features that resemble Lévy walks. This composite motion emerges from the collective behavior developed under the specific foraging selection pressures. On the other hand, agents trained to reach nearby resources predominantly exhibit Brownian trajectories.
群体行为,特别是蜂群形成,已经在多个领域从多个角度进行了研究,从生物学到物理学都有涉及。在这项工作中,我们应用投影模拟来模拟每个个体作为一个人工学习代理,与邻居和周围环境进行交互,以便做出决策并从中学习。在强化学习框架内,我们讨论了一维学习场景,其中代理需要获取食物资源以获得奖励。我们观察到不同类型的集体运动是如何根据代理到达资源所需的距离而出现的。例如,当食物源放置在远离代理最初所在区域很远的地方时,就会出现强烈对齐的蜂群。此外,我们研究了不同类型涌现的集体动力学中个体轨迹的特性。为寻找远距离资源而训练的代理表现出的个体轨迹,在大多数情况下,最适合由具有类似于 Lévy 游走特征的复合相关随机游走来拟合。这种复合运动是在特定的觅食选择压力下发展起来的集体行为产生的。另一方面,为到达附近资源而训练的代理主要表现出布朗运动轨迹。