State Key Laboratory for Turbulence and Complex Systems, College of Engineering, Peking University, Beijing, China.
PLoS One. 2011;6(7):e21787. doi: 10.1371/journal.pone.0021787. Epub 2011 Jul 7.
We study the evolution of cooperation among selfish individuals in the stochastic strategy spatial prisoner's dilemma game. We equip players with the particle swarm optimization technique, and find that it may lead to highly cooperative states even if the temptations to defect are strong. The concept of particle swarm optimization was originally introduced within a simple model of social dynamics that can describe the formation of a swarm, i.e., analogous to a swarm of bees searching for a food source. Essentially, particle swarm optimization foresees changes in the velocity profile of each player, such that the best locations are targeted and eventually occupied. In our case, each player keeps track of the highest payoff attained within a local topological neighborhood and its individual highest payoff. Thus, players make use of their own memory that keeps score of the most profitable strategy in previous actions, as well as use of the knowledge gained by the swarm as a whole, to find the best available strategy for themselves and the society. Following extensive simulations of this setup, we find a significant increase in the level of cooperation for a wide range of parameters, and also a full resolution of the prisoner's dilemma. We also demonstrate extreme efficiency of the optimization algorithm when dealing with environments that strongly favor the proliferation of defection, which in turn suggests that swarming could be an important phenomenon by means of which cooperation can be sustained even under highly unfavorable conditions. We thus present an alternative way of understanding the evolution of cooperative behavior and its ubiquitous presence in nature, and we hope that this study will be inspirational for future efforts aimed in this direction.
我们研究了自私个体在随机策略空间囚徒困境博弈中的合作演化。我们为参与者配备了粒子群优化技术,发现即使背叛的诱惑很强,它也可能导致高度合作的状态。粒子群优化的概念最初是在一个简单的社会动态模型中引入的,该模型可以描述群体的形成,即类似于蜜蜂群寻找食物源。本质上,粒子群优化预见了每个参与者的速度分布的变化,以便瞄准和最终占领最佳位置。在我们的情况下,每个参与者都跟踪在局部拓扑邻域内获得的最高回报及其个人最高回报。因此,参与者利用自己的记忆,记录之前行动中最有利可图的策略,以及利用整个群体获得的知识,为自己和社会找到最佳可用策略。在对这种设置进行广泛模拟后,我们发现合作水平在广泛的参数范围内显著提高,并且完全解决了囚徒困境。我们还展示了优化算法在处理强烈有利于背叛扩散的环境时的极端效率,这反过来表明,群体行为可能是一种重要的现象,通过这种现象,即使在非常不利的条件下,合作也可以得到维持。因此,我们提出了一种理解合作行为演化及其在自然界中普遍存在的替代方法,我们希望这项研究将为未来朝着这个方向努力提供灵感。