Nishimura K, Stephens D W
Nebraska Behavioural Biology Group, School of Biological Sciences, University of Nebraska, Lincoln, NE 68588, USA.
J Theor Biol. 1997 Sep 7;188(1):1-10. doi: 10.1006/jtbi.1997.0439.
The iterated Prisoner's Dilemma (IPD) is usually analysed by evaluating arithmetic mean pay-offs in an ESS analysis. We consider several points that the standard argument does not address. Finite population size and finite numbers of matches in the IPD game lead us to consider both pay-off variance and the sampling process in the evolutionary game. We provide a general form for the pay-off variance of the Markov strategist in the IPD game, and present a general analysis of the initial invasion process of an "all defection strategist" (ALLD) into a "tit-for-tat" (TFT) strategist population by considering stochastic processes. Finite population size, strategic error an the variances of pay-offs alter the prediction concerning the initial invasion of ALLD compared with the standard Evolutionarily Stable Strategy (ESS) analysis. Even though TFT gets the larger arithmetic mean, the variance of its pay-off is also larger when the expected iterations of the game are sufficiently large. Therefore, the boundary of the parameter of the probability of game continuity, w, above which ALLD does not have advantage to invade into the TFT population, becomes a bit larger than predicted by the deterministic model.
重复囚徒困境(IPD)通常在进化稳定策略(ESS)分析中通过评估算术平均收益来进行分析。我们考虑了一些标准论证未涉及的要点。IPD博弈中有限的种群规模和有限的匹配次数促使我们在进化博弈中同时考虑收益方差和抽样过程。我们给出了IPD博弈中马尔可夫策略者收益方差的一般形式,并通过考虑随机过程,对“总是背叛策略者”(ALLD)侵入“针锋相对”(TFT)策略者群体的初始入侵过程进行了一般性分析。与标准的进化稳定策略(ESS)分析相比,有限的种群规模、策略误差和收益方差改变了关于ALLD初始入侵的预测。即使TFT获得了更大的算术平均收益,但当博弈的预期迭代次数足够大时,其收益方差也更大。因此,博弈连续性概率参数w的边界,即在此边界之上ALLD侵入TFT群体没有优势,变得比确定性模型预测的略大一些。