Sanders James B T, Farmer J Doyne, Galla Tobias
Theoretical Physics, School of Physics and Astronomy, The University of Manchester, Manchester, M13 9PL, United Kingdom.
Institute for New Economic Thinking at the Oxford Martin School, University of Oxford, Oxford, OX2 6ED, UK.
Sci Rep. 2018 Mar 20;8(1):4902. doi: 10.1038/s41598-018-22013-5.
We study adaptive learning in a typical p-player game. The payoffs of the games are randomly generated and then held fixed. The strategies of the players evolve through time as the players learn. The trajectories in the strategy space display a range of qualitatively different behaviours, with attractors that include unique fixed points, multiple fixed points, limit cycles and chaos. In the limit where the game is complicated, in the sense that the players can take many possible actions, we use a generating-functional approach to establish the parameter range in which learning dynamics converge to a stable fixed point. The size of this region goes to zero as the number of players goes to infinity, suggesting that complex non-equilibrium behaviour, exemplified by chaos, is the norm for complicated games with many players.
我们研究典型的p人博弈中的适应性学习。博弈的收益是随机生成的,然后保持不变。随着参与者学习,他们的策略随时间演变。策略空间中的轨迹表现出一系列性质不同的行为,吸引子包括唯一不动点、多个不动点、极限环和混沌。在博弈复杂的极限情况下,即参与者可以采取许多可能行动的情况下,我们使用生成泛函方法来确定学习动态收敛到稳定不动点的参数范围。当参与者数量趋于无穷大时,该区域的大小趋于零,这表明以混沌为代表的复杂非平衡行为是具有许多参与者的复杂博弈的常态。