Pereira Marcelo Alves, Martinez Alexandre Souto
Universidade de São Paulo - Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Av. Bandeirantes, 3900, 14040-901 Ribeirão Preto, São Paulo, Brazil.
J Theor Biol. 2010 Aug 7;265(3):346-58. doi: 10.1016/j.jtbi.2010.05.022. Epub 2010 May 21.
The Prisoner's Dilemma (PD) game is applied in several research fields due to the emergence of cooperation among selfish players. In this work the PD is studied in a one-dimensional lattice, where each cell represents a player, which in turn can interact with the neighbors playing the PD (cooperate or defect). The update of states adopts the Pavlovian Evolutionary Strategy (PES) or Darwinian Evolutionary Strategy (DES). Adopting PES, if a player receives a positive payoff greater than his/her aspiration level, he/she keeps the current state, and switches otherwise. Adopting DES, player compares his/her payoff with payoff of opponents. If it is not the highest, he/she copies the state of fittest player, switching the state if it is different of his/her current state. The critical temptation values obtained analytically are reported, and the cluster patterns that emerge from the interactions among the players are shown. Also we defined analytical functions that calculate the maximum/minimum size of defective/cooperative clusters. Also, the parameter space is explored with exhaustive computational simulations, which confirm the analytical results and reinforce that Pavlovian strategy foments cooperation among players. In steady state, system can reach the cooperative or quasi-regular phases, when adopting the PES, and cooperative, defective or chaotic phases, adopting the DES. The new quasi-regular phase occurs when several players switch their states in each round, but the proportion of cooperators does not show significant variation. Additionally, the present work shows that the lowest temptation level (T=1) may be considered a trivial case only for the particular case where the players interact with only one neighbor, otherwise system presents the same features that for higher temptation values.
由于自私的参与者之间出现了合作,囚徒困境(PD)博弈被应用于多个研究领域。在这项工作中,我们在一维晶格中研究了PD博弈,其中每个单元格代表一个参与者,该参与者又可以与玩PD博弈的邻居(合作或背叛)进行互动。状态更新采用巴甫洛夫进化策略(PES)或达尔文进化策略(DES)。采用PES时,如果一个参与者获得的正收益大于其期望水平,他/她将保持当前状态,否则进行切换。采用DES时,参与者将自己的收益与对手的收益进行比较。如果不是最高的,他/她会复制最适应的参与者的状态,如果与他/她当前的状态不同则进行切换。报告了通过分析获得的临界诱惑值,并展示了参与者之间互动产生的聚类模式。我们还定义了计算缺陷/合作聚类的最大/最小规模的分析函数。此外,通过详尽的计算模拟探索了参数空间,这证实了分析结果,并进一步证明巴甫洛夫策略促进了参与者之间的合作。在稳态下,采用PES时系统可以达到合作或准规则阶段,采用DES时系统可以达到合作、缺陷或混沌阶段。新的准规则阶段出现在每个回合中有几个参与者切换其状态,但合作者的比例没有显著变化时。此外,目前的工作表明,只有在参与者仅与一个邻居互动的特定情况下,最低诱惑水平(T = 1)才可能被视为一个平凡的情况,否则系统呈现出与较高诱惑值相同的特征。