Nax Heinrich H, Perc Matjaž
Department of Social Sciences, ETH Zürich, Clausiusstrasse 37-C3, 8092 Zurich, Switzerland.
1] Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia [2] Faculty of Natural Sciences and Mathematics, University of Maribor, Koroška cesta 160, SI-2000 Maribor, Slovenia.
Sci Rep. 2015 Jan 26;5:8010. doi: 10.1038/srep08010.
We consider an environment where players are involved in a public goods game and must decide repeatedly whether to make an individual contribution or not. However, players lack strategically relevant information about the game and about the other players in the population. The resulting behavior of players is completely uncoupled from such information, and the individual strategy adjustment dynamics are driven only by reinforcement feedbacks from each player's own past. We show that the resulting "directional learning" is sufficient to explain cooperative deviations away from the Nash equilibrium. We introduce the concept of k-strong equilibria, which nest both the Nash equilibrium and the Aumann-strong equilibrium as two special cases, and we show that, together with the parameters of the learning model, the maximal k-strength of equilibrium determines the stationary distribution. The provisioning of public goods can be secured even under adverse conditions, as long as players are sufficiently responsive to the changes in their own payoffs and adjust their actions accordingly. Substantial levels of public cooperation can thus be explained without arguments involving selflessness or social preferences, solely on the basis of uncoordinated directional (mis)learning.
参与者参与公共物品博弈,且必须反复决定是否做出个人贡献。然而,参与者缺乏关于该博弈以及群体中其他参与者的具有战略相关性的信息。参与者的最终行为与此类信息完全脱钩,个体策略调整动态仅由每个参与者自身过去的强化反馈驱动。我们表明,由此产生的“定向学习”足以解释偏离纳什均衡的合作偏差。我们引入了k强均衡的概念,它将纳什均衡和奥曼强均衡作为两种特殊情况包含在内,并且我们表明,与学习模型的参数一起,均衡的最大k强度决定了平稳分布。只要参与者对自身收益的变化有足够的反应并相应地调整其行动,即使在不利条件下也能确保公共物品的供应。因此,仅基于不协调的定向(错误)学习,就可以在不涉及无私或社会偏好的情况下解释相当程度的公共合作。