School of Engineering and Computer Science, Victoria University of Wellington, Wellington 6012, New Zealand.
School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6012, New Zealand.
Proc Natl Acad Sci U S A. 2023 Jun 6;120(23):e2302107120. doi: 10.1073/pnas.2302107120. Epub 2023 May 30.
Helping strangers at a cost to oneself is a hallmark of many human interactions, but difficult to justify from the viewpoint of natural selection, particularly in anonymous one-shot interactions. Reputational scoring can provide the necessary motivation via "indirect reciprocity," but maintaining reliable scores requires close oversight to prevent cheating. We show that in the absence of such supervision, it is possible that scores might be managed by mutual consent between the agents themselves instead of by third parties. The space of possible strategies for such "consented" score changes is very large but, using a simple cooperation game, we search it, asking what kinds of agreement can i) invade a population from rare and ii) resist invasion once common. We prove mathematically and demonstrate computationally that score mediation by mutual consent does enable cooperation without oversight. Moreover, the most invasive and stable strategies belong to one family and ground the concept of value by incrementing one score at the cost of the other, thus closely resembling the token exchange that underlies money in everyday human transactions. The most successful strategy has the flavor of money except that agents without money can generate new score if they meet. This strategy is evolutionarily stable, and has higher fitness, but is not physically realizable in a decentralized way; when conservation of score is enforced more money-like strategies dominate. The equilibrium distribution of scores under any of this family of strategies is geometric, meaning that agents with score 0 are inherent to money-like strategies.
帮助陌生人而使自己付出代价是许多人类互动的特点,但从自然选择的角度来看,这很难解释,尤其是在匿名的一次性互动中。声誉评分可以通过“间接互惠”提供必要的动机,但要保持可靠的评分,需要密切监督以防止作弊。我们表明,在没有这种监督的情况下,评分可能是由代理人之间的相互同意来管理的,而不是由第三方来管理。这种“同意”评分变化的可能策略的空间非常大,但是,我们使用一个简单的合作游戏来搜索它,询问什么样的协议可以 i)从稀有中入侵种群,ii)一旦常见就抵抗入侵。我们从数学上证明并通过计算证明,相互同意的评分调解确实可以在没有监督的情况下促进合作。此外,最具侵略性和稳定性的策略属于一个家族,并且通过以牺牲另一个为代价来增加一个分数来为价值概念提供基础,因此与日常生活中人类交易中货币的基础代币交换非常相似。最成功的策略具有货币的味道,除了没有货币的代理人如果相遇可以生成新的分数。这种策略在进化上是稳定的,并且具有更高的适应性,但不能以分散的方式实现;当评分守恒被强制执行时,更类似于货币的策略占主导地位。在任何这类策略的得分均衡分布都是几何的,这意味着具有分数 0 的代理人是货币策略所固有的。