使用强化学习分析公共物品博弈：群体规模增加对合作的影响。

Analysing public goods games using reinforcement learning: effect of increasing group size on cooperation.

作者信息

Tamura Kazuhiro, Morita Satoru

机构信息

Department of Environment and Energy Systems, Graduate School of Science and Technology, Shizuoka University, Hamamatsu 432-8561, Japan.

Department of Mathematical and Systems Engineering, Shizuoka University, Hamamatsu 432-8561, Japan.

出版信息

R Soc Open Sci. 2024 Dec 11;11(12):241195. doi: 10.1098/rsos.241195. eCollection 2024 Dec.

DOI:10.1098/rsos.241195

PMID:39665088

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11631413/

Abstract

Electricity competition, restrictions on carbon dioxide (CO2) emissions and arm races between nations are examples of social dilemmas within human society. In the presence of social dilemmas, rational choice in game theory leads to the avoidance of cooperative behaviour owing to its cost. However, in experiments using public goods games that simulate social dilemmas, humans have often exhibited cooperative behaviour that deviates from individual rationality. Despite extensive research, the alignment between human cooperative behaviour and game theory predictions remains inconsistent. This study proposes an alternative approach to solve this problem. We used Q-learning, a form of artificial intelligence that mimics decision-making processes of humans who do not possess the rationality assumed in game theory. This study explores the potential for cooperation by varying the number of participants in public goods games using deep Q-learning. The simulations demonstrate that agents with Q-learning can acquire cooperative behaviour similar to that of humans. Moreover, we found that cooperation is more likely to occur as the group size increases. These results support and reinforce existing experiments involving humans. In addition, they have potential applications for creating cooperation without sanctions.

摘要

电力竞争、二氧化碳（CO₂）排放限制以及国家间的军备竞赛都是人类社会中社会困境的例子。在社会困境存在的情况下，博弈论中的理性选择会导致由于合作行为的成本而避免合作。然而，在使用模拟社会困境的公共物品博弈的实验中，人类常常表现出偏离个体理性的合作行为。尽管进行了广泛的研究，但人类合作行为与博弈论预测之间的一致性仍然不一致。本研究提出了一种解决这个问题的替代方法。我们使用了Q学习，这是一种人工智能形式，它模仿了不具备博弈论中所假设的理性的人类的决策过程。本研究通过使用深度Q学习改变公共物品博弈中的参与者数量来探索合作的潜力。模拟结果表明，具有Q学习能力的智能体可以获得与人类相似的合作行为。此外，我们发现随着群体规模的增加，合作更有可能发生。这些结果支持并强化了现有的涉及人类的实验。此外，它们在无需制裁就能创造合作方面具有潜在应用价值。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

使用强化学习分析公共物品博弈：群体规模增加对合作的影响。

Analysing public goods games using reinforcement learning: effect of increasing group size on cooperation.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

使用强化学习分析公共物品博弈：群体规模增加对合作的影响。

Analysing public goods games using reinforcement learning: effect of increasing group size on cooperation.

作者信息

机构信息

出版信息

相似文献

本文引用的文献