共同作用为巴甫洛夫式策略的进化成功提供了合理依据。

Co-action provides rational basis for the evolutionary success of Pavlovian strategies.

作者信息

Sasidevan V, Sinha Sitabhra

机构信息

The Institute of Mathematical Sciences, CIT Campus, Taramani, Chennai 600113, India.

出版信息

Sci Rep. 2016 Aug 1;6:30831. doi: 10.1038/srep30831.

DOI:10.1038/srep30831

PMID:27476604

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4967866/

Abstract

Strategies incorporating direct reciprocity, e.g., Tit-for-Tat and Pavlov, have been shown to be successful for playing the Iterated Prisoners Dilemma (IPD), a paradigmatic problem for studying the evolution of cooperation among non-kin individuals. However it is an open question whether such reciprocal strategies can emerge as the rational outcome of repeated interactions between selfish agents. Here we show that adopting a co-action perspective, which takes into account the symmetry between agents - a relevant consideration in biological and social contexts - naturally leads to such a strategy. For a 2-player IPD, we show that the co-action solution corresponds to the Pavlov strategy, thereby providing a rational basis for it. For an IPD involving many players, an instance of the Public Goods game where cooperation is generally considered to be harder to achieve, we show that the cooperators always outnumber defectors in the co-action equilibrium. This can be seen as a generalization of Pavlov to contests involving many players. In general, repeated interactions allow rational agents to become aware of the inherent symmetry of their situation, enabling them to achieve robust cooperation through co-action strategies - which, in the case of IPD, is a reciprocal Pavlovian one.

摘要

包含直接互惠的策略，例如“以牙还牙”策略和“巴甫洛夫”策略，已被证明在玩重复囚徒困境（IPD）时是成功的，IPD是研究非亲属个体间合作进化的一个典型问题。然而，这样的互惠策略是否能作为自私个体间重复互动的理性结果而出现，这仍是一个悬而未决的问题。在此我们表明，采用一种共同行动视角，即考虑到个体间的对称性——这在生物学和社会背景中是一个相关考量因素——自然会导致这样一种策略。对于两人的IPD，我们表明共同行动解决方案对应于“巴甫洛夫”策略，从而为其提供了一个理性基础。对于涉及多个参与者的IPD，即公共物品博弈的一个实例，在这种情况下合作通常被认为更难实现，我们表明在共同行动均衡中，合作者的数量总是超过背叛者。这可以看作是“巴甫洛夫”策略在涉及多个参与者的竞争中的一种推广。一般来说，重复互动使理性个体能够意识到其处境的内在对称性，使他们能够通过共同行动策略实现稳健的合作——在IPD的情况下，这是一种互惠的“巴甫洛夫式”策略。