Grupo Interdisciplinar de Sistemas Complejos (GISC), Departamento de Matemáticas, Universidad Carlos III de Madrid, , 28911 Leganés, Madrid, Spain.
J R Soc Interface. 2014 Feb 19;11(94):20131186. doi: 10.1098/rsif.2013.1186. Print 2014 May 6.
Cooperative behaviour lies at the very basis of human societies, yet its evolutionary origin remains a key unsolved puzzle. Whereas reciprocity or conditional cooperation is one of the most prominent mechanisms proposed to explain the emergence of cooperation in social dilemmas, recent experimental findings on networked Prisoner's Dilemma games suggest that conditional cooperation also depends on the previous action of the player-namely on the 'mood' in which the player is currently in. Roughly, a majority of people behave as conditional cooperators if they cooperated in the past, whereas they ignore the context and free ride with high probability if they did not. However, the ultimate origin of this behaviour represents a conundrum itself. Here, we aim specifically to provide an evolutionary explanation of moody conditional cooperation (MCC). To this end, we perform an extensive analysis of different evolutionary dynamics for players' behavioural traits-ranging from standard processes used in game theory based on pay-off comparison to others that include non-economic or social factors. Our results show that only a dynamic built upon reinforcement learning is able to give rise to evolutionarily stable MCC, and at the end to reproduce the human behaviours observed in the experiments.
合作行为是人类社会的基础,但它的进化起源仍然是一个关键的未解决的难题。虽然互惠或条件合作是解释社会困境中合作出现的最突出机制之一,但最近关于网络囚徒困境游戏的实验发现表明,条件合作也取决于玩家的先前行为,即玩家当前的“情绪”。大致来说,如果一个人过去合作过,那么他们很可能会表现出条件合作,而如果他们没有合作过,那么他们很可能会忽略情境并搭便车。然而,这种行为的最终起源本身就是一个谜。在这里,我们的目的是专门为情绪化的条件合作(MCC)提供一个进化解释。为此,我们对玩家行为特征的不同进化动力学进行了广泛的分析,范围从基于收益比较的博弈论标准过程到包括非经济或社会因素的其他过程。我们的结果表明,只有基于强化学习的动态才能产生进化稳定的 MCC,并最终再现实验中观察到的人类行为。