• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用强化学习分析公共物品博弈:群体规模增加对合作的影响。

Analysing public goods games using reinforcement learning: effect of increasing group size on cooperation.

作者信息

Tamura Kazuhiro, Morita Satoru

机构信息

Department of Environment and Energy Systems, Graduate School of Science and Technology, Shizuoka University, Hamamatsu 432-8561, Japan.

Department of Mathematical and Systems Engineering, Shizuoka University, Hamamatsu 432-8561, Japan.

出版信息

R Soc Open Sci. 2024 Dec 11;11(12):241195. doi: 10.1098/rsos.241195. eCollection 2024 Dec.

DOI:10.1098/rsos.241195
PMID:39665088
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11631413/
Abstract

Electricity competition, restrictions on carbon dioxide (CO2) emissions and arm races between nations are examples of social dilemmas within human society. In the presence of social dilemmas, rational choice in game theory leads to the avoidance of cooperative behaviour owing to its cost. However, in experiments using public goods games that simulate social dilemmas, humans have often exhibited cooperative behaviour that deviates from individual rationality. Despite extensive research, the alignment between human cooperative behaviour and game theory predictions remains inconsistent. This study proposes an alternative approach to solve this problem. We used Q-learning, a form of artificial intelligence that mimics decision-making processes of humans who do not possess the rationality assumed in game theory. This study explores the potential for cooperation by varying the number of participants in public goods games using deep Q-learning. The simulations demonstrate that agents with Q-learning can acquire cooperative behaviour similar to that of humans. Moreover, we found that cooperation is more likely to occur as the group size increases. These results support and reinforce existing experiments involving humans. In addition, they have potential applications for creating cooperation without sanctions.

摘要

电力竞争、二氧化碳(CO₂)排放限制以及国家间的军备竞赛都是人类社会中社会困境的例子。在社会困境存在的情况下,博弈论中的理性选择会导致由于合作行为的成本而避免合作。然而,在使用模拟社会困境的公共物品博弈的实验中,人类常常表现出偏离个体理性的合作行为。尽管进行了广泛的研究,但人类合作行为与博弈论预测之间的一致性仍然不一致。本研究提出了一种解决这个问题的替代方法。我们使用了Q学习,这是一种人工智能形式,它模仿了不具备博弈论中所假设的理性的人类的决策过程。本研究通过使用深度Q学习改变公共物品博弈中的参与者数量来探索合作的潜力。模拟结果表明,具有Q学习能力的智能体可以获得与人类相似的合作行为。此外,我们发现随着群体规模的增加,合作更有可能发生。这些结果支持并强化了现有的涉及人类的实验。此外,它们在无需制裁就能创造合作方面具有潜在应用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/841a28116fda/rsos.241195.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/3fdc074c0d6a/rsos.241195.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/0b992d940d82/rsos.241195.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/27eeed333698/rsos.241195.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/012ab3af08a3/rsos.241195.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/841a28116fda/rsos.241195.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/3fdc074c0d6a/rsos.241195.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/0b992d940d82/rsos.241195.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/27eeed333698/rsos.241195.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/012ab3af08a3/rsos.241195.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f202/11631413/841a28116fda/rsos.241195.f005.jpg

相似文献

1
Analysing public goods games using reinforcement learning: effect of increasing group size on cooperation.使用强化学习分析公共物品博弈:群体规模增加对合作的影响。
R Soc Open Sci. 2024 Dec 11;11(12):241195. doi: 10.1098/rsos.241195. eCollection 2024 Dec.
2
Generalized Social Dilemmas: The Evolution of Cooperation in Populations with Variable Group Size.广义社会困境:变群组大小的群体中合作的演化。
Bull Math Biol. 2019 Nov;81(11):4643-4674. doi: 10.1007/s11538-018-00545-1. Epub 2018 Dec 17.
3
Payoff-based learning explains the decline in cooperation in public goods games.基于回报的学习解释了公共物品博弈中合作行为的减少。
Proc Biol Sci. 2015 Feb 22;282(1801):20142678. doi: 10.1098/rspb.2014.2678.
4
Evolutionary multi-agent reinforcement learning in group social dilemmas.群体社会困境中的进化多智能体强化学习
Chaos. 2025 Feb 1;35(2). doi: 10.1063/5.0246332.
5
Review: Game theory of public goods in one-shot social dilemmas without assortment.综述:无分类一次性社会困境中的公共物品博弈论
J Theor Biol. 2012 Apr 21;299:9-20. doi: 10.1016/j.jtbi.2011.06.018. Epub 2011 Jun 24.
6
Dilemma of dilemmas: how collective and individual perspectives can clarify the size dilemma in voluntary linear public goods dilemmas.两难困境中的两难:集体视角与个体视角如何阐明自愿线性公共物品两难中的规模困境
PLoS One. 2015 Mar 23;10(3):e0120379. doi: 10.1371/journal.pone.0120379. eCollection 2015.
7
Learning enables adaptation in cooperation for multi-player stochastic games.学习能够促进多人随机博弈中的合作适应。
J R Soc Interface. 2020 Nov;17(172):20200639. doi: 10.1098/rsif.2020.0639. Epub 2020 Nov 18.
8
Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.强化学习解释了条件性合作及其喜怒无常的同类现象。
PLoS Comput Biol. 2016 Jul 20;12(7):e1005034. doi: 10.1371/journal.pcbi.1005034. eCollection 2016 Jul.
9
Bounded learning and planning in public goods games.公共物品博弈中的有界学习和规划。
Phys Rev E. 2023 May;107(5-1):054140. doi: 10.1103/PhysRevE.107.054140.
10
Cooperation, psychological game theory, and limitations of rationality in social interaction.合作、心理博弈论与社会互动中理性的局限性
Behav Brain Sci. 2003 Apr;26(2):139-53; discussion 153-98. doi: 10.1017/s0140525x03000050.

本文引用的文献

1
The impact of generative artificial intelligence on socioeconomic inequalities and policy making.生成式人工智能对社会经济不平等和政策制定的影响。
PNAS Nexus. 2024 Jun 11;3(6):pgae191. doi: 10.1093/pnasnexus/pgae191. eCollection 2024 Jun.
2
A Turing test of whether AI chatbots are behaviorally similar to humans.人工智能聊天机器人是否在行为上与人类相似的图灵测试。
Proc Natl Acad Sci U S A. 2024 Feb 27;121(9):e2313925121. doi: 10.1073/pnas.2313925121. Epub 2024 Feb 22.
3
The emergence of economic rationality of GPT.GPT 的经济理性的出现。
Proc Natl Acad Sci U S A. 2023 Dec 19;120(51):e2316205120. doi: 10.1073/pnas.2316205120. Epub 2023 Dec 12.
4
Improving the Survival Time of Multiagents in Social Dilemmas through Neurotransmitter-Based Deep Q-Learning Model of Emotions.通过基于神经递质的情绪深度 Q 学习模型提高社会困境中多智能体的生存时间。
J Healthc Eng. 2022 Jan 25;2022:3449433. doi: 10.1155/2022/3449433. eCollection 2022.
5
Large scale and information effects on cooperation in public good games.大规模和信息效应对公共物品博弈中合作的影响。
Sci Rep. 2019 Oct 21;9(1):15023. doi: 10.1038/s41598-019-50964-w.
6
Social learning strategies regulate the wisdom and madness of interactive crowds.社会学习策略调节互动群体的智慧与疯狂。
Nat Hum Behav. 2019 Feb;3(2):183-193. doi: 10.1038/s41562-018-0518-x. Epub 2019 Jan 21.
7
Group size effects and critical mass in public goods games.群体规模效应对公共物品博弈的影响和关键质量。
Sci Rep. 2019 Apr 2;9(1):5503. doi: 10.1038/s41598-019-41988-3.
8
DeepStack: Expert-level artificial intelligence in heads-up no-limit poker.深筹码:单人无限注德州扑克中的专家级人工智能。
Science. 2017 May 5;356(6337):508-513. doi: 10.1126/science.aam6960. Epub 2017 Mar 2.
9
Group Size Effect on Cooperation in One-Shot Social Dilemmas II: Curvilinear Effect.单次社会困境中群体规模对合作的影响II:曲线效应
PLoS One. 2015 Jul 16;10(7):e0131419. doi: 10.1371/journal.pone.0131419. eCollection 2015.
10
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.