• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从长迭代囚徒困境实验中的观察结果推断策略。

Inferring strategies from observations in long iterated Prisoner's dilemma experiments.

机构信息

Artificial Intelligence Laboratory, Vrije Universiteit Brussel, Brussels, 1050, Belgium.

Machine Learning Group, Université Libre de Bruxelles, Brussels, 1050, Belgium.

出版信息

Sci Rep. 2022 May 9;12(1):7589. doi: 10.1038/s41598-022-11654-2.

DOI:10.1038/s41598-022-11654-2
PMID:35534534
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9085774/
Abstract

While many theoretical studies have revealed the strategies that could lead to and maintain cooperation in the Iterated Prisoner's dilemma, less is known about what human participants actually do in this game and how strategies change when being confronted with anonymous partners in each round. Previous attempts used short experiments, made different assumptions of possible strategies, and led to very different conclusions. We present here two long treatments that differ in the partner matching strategy used, i.e. fixed or shuffled partners. Here we use unsupervised methods to cluster the players based on their actions and then Hidden Markov Model to infer what the memory-one strategies are in each cluster. Analysis of the inferred strategies reveals that fixed partner interaction leads to behavioral self-organization. Shuffled partners generate subgroups of memory-one strategies that remain entangled, apparently blocking the self-selection process that leads to fully cooperating participants in the fixed partner treatment. Analyzing the latter in more detail shows that AllC, AllD, TFT- and WSLS-like behavior can be observed. This study also reveals that long treatments are needed as experiments with less than 25 rounds capture mostly the learning phase participants go through in these kinds of experiments.

摘要

虽然许多理论研究揭示了导致和维持迭代囚徒困境中合作的策略,但对于人类参与者在这个游戏中实际做了什么,以及当面对每轮匿名合作伙伴时策略如何变化,我们知之甚少。之前的尝试使用了短期实验,对可能的策略做出了不同的假设,并得出了非常不同的结论。我们在这里介绍两种不同的长期处理方法,它们在使用的伙伴匹配策略上有所不同,即固定或随机伙伴。在这里,我们使用无监督方法根据参与者的行为对其进行聚类,然后使用隐马尔可夫模型推断每个聚类中的记忆-1 策略是什么。对推断策略的分析表明,固定伙伴的相互作用导致行为的自组织。随机伙伴生成的记忆-1 策略的子组仍然纠缠在一起,显然阻止了导致固定伙伴处理中完全合作参与者的自我选择过程。更详细地分析后者表明,可以观察到 AllC、AllD、TFT 和 WSLS 样行为。这项研究还表明,需要进行长期处理,因为少于 25 轮的实验主要捕捉到了参与者在这类实验中经历的学习阶段。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/78bf2e2e6567/41598_2022_11654_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/eb227b676264/41598_2022_11654_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/63db96eaa479/41598_2022_11654_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/853740ddcee9/41598_2022_11654_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/6c0a3c4304a9/41598_2022_11654_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/bd10b35564fd/41598_2022_11654_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/55cd47ceb8f6/41598_2022_11654_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/78bf2e2e6567/41598_2022_11654_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/eb227b676264/41598_2022_11654_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/63db96eaa479/41598_2022_11654_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/853740ddcee9/41598_2022_11654_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/6c0a3c4304a9/41598_2022_11654_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/bd10b35564fd/41598_2022_11654_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/55cd47ceb8f6/41598_2022_11654_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f2d/9085774/78bf2e2e6567/41598_2022_11654_Fig7_HTML.jpg

相似文献

1
Inferring strategies from observations in long iterated Prisoner's dilemma experiments.从长迭代囚徒困境实验中的观察结果推断策略。
Sci Rep. 2022 May 9;12(1):7589. doi: 10.1038/s41598-022-11654-2.
2
Strategies that enforce linear payoff relationships under observation errors in Repeated Prisoner's Dilemma game.在重复囚徒困境博弈中观察误差下强制线性收益关系的策略。
J Theor Biol. 2019 Sep 21;477:63-76. doi: 10.1016/j.jtbi.2019.06.009. Epub 2019 Jun 12.
3
From rationality to cooperativeness: The totally mixed Nash equilibrium in Markov strategies in the iterated Prisoner's Dilemma.从理性到合作:重复囚徒困境中马尔可夫策略的完全混合纳什均衡
PLoS One. 2017 Nov 30;12(11):e0180754. doi: 10.1371/journal.pone.0180754. eCollection 2017.
4
Win-Stay-Lose-Shift as a self-confirming equilibrium in the iterated Prisoner's Dilemma.在重复囚徒困境中,赢留输走是一种自我确认的均衡。
Proc Biol Sci. 2021 Jun 30;288(1953):20211021. doi: 10.1098/rspb.2021.1021.
5
A simple rule of direct reciprocity leads to the stable coexistence of cooperation and defection in the Prisoner's Dilemma game.在囚徒困境博弈中,一条简单的直接互惠规则导致了合作与背叛的稳定共存。
J Theor Biol. 2017 May 7;420:12-17. doi: 10.1016/j.jtbi.2017.02.036. Epub 2017 Mar 1.
6
Collapse of cooperation in evolving games.进化博弈中合作的瓦解。
Proc Natl Acad Sci U S A. 2014 Dec 9;111(49):17558-63. doi: 10.1073/pnas.1408618111. Epub 2014 Nov 24.
7
Recognising and evaluating the effectiveness of extortion in the Iterated Prisoner's Dilemma.识别和评估迭代囚徒困境中的敲诈行为的有效性。
PLoS One. 2024 Jul 26;19(7):e0304641. doi: 10.1371/journal.pone.0304641. eCollection 2024.
8
Cooperative responses in rats playing a 2 × 2 game: Effects of opponent strategy, payoff, and oxytocin.大鼠在 2×2 游戏中表现出的合作反应:对手策略、收益和催产素的影响。
Psychoneuroendocrinology. 2020 Nov;121:104803. doi: 10.1016/j.psyneuen.2020.104803. Epub 2020 Aug 2.
9
Working memory constrains human cooperation in the Prisoner's Dilemma.工作记忆限制了囚徒困境中人类的合作。
Proc Natl Acad Sci U S A. 1998 Nov 10;95(23):13755-8. doi: 10.1073/pnas.95.23.13755.
10
The consequences of switching strategies in a two-player iterated survival game.在两人重复生存博弈中转换策略的后果。
J Math Biol. 2021 Feb 6;82(3):17. doi: 10.1007/s00285-021-01569-3.

引用本文的文献

1
Stable strategies of direct and indirect reciprocity across all social dilemmas.适用于所有社会困境的直接和间接互惠的稳定策略。
PNAS Nexus. 2025 May 10;4(5):pgaf154. doi: 10.1093/pnasnexus/pgaf154. eCollection 2025 May.
2
Dynamics of cooperation in concurrent games.并发博弈中的合作动态
Nat Commun. 2025 Feb 11;16(1):1524. doi: 10.1038/s41467-025-56083-7.
3
Conditional cooperation with longer memory.具有更长记忆的条件性合作。

本文引用的文献

1
No Strategy Can Win in the Repeated Prisoner's Dilemma: Linking Game Theory and Computer Simulations.在重复囚徒困境中没有策略能获胜:连接博弈论与计算机模拟
Front Robot AI. 2018 Aug 29;5:102. doi: 10.3389/frobt.2018.00102. eCollection 2018.
2
Timing Uncertainty in Collective Risk Dilemmas Encourages Group Reciprocation and Polarization.集体风险困境中的时间不确定性会促使群体互惠与两极分化。
iScience. 2020 Oct 31;23(12):101752. doi: 10.1016/j.isci.2020.101752. eCollection 2020 Dec 18.
3
Cooperation in the Prisoner's Dilemma: an experimental comparison between pure and mixed strategies.
Proc Natl Acad Sci U S A. 2024 Dec 10;121(50):e2420125121. doi: 10.1073/pnas.2420125121. Epub 2024 Dec 6.
4
Evolution of reciprocity with limited payoff memory.回报有限记忆下的互惠行为演变。
Proc Biol Sci. 2024 Jun;291(2025):20232493. doi: 10.1098/rspb.2023.2493. Epub 2024 Jun 19.
5
Adaptive dynamics of memory-one strategies in the repeated donation game.记忆策略在重复捐赠游戏中的适应动态。
PLoS Comput Biol. 2023 Jun 29;19(6):e1010987. doi: 10.1371/journal.pcbi.1010987. eCollection 2023 Jun.
6
Fast deliberation is related to unconditional behaviour in iterated Prisoners' Dilemma experiments.快速决策与重复囚徒困境实验中的无条件行为有关。
Sci Rep. 2022 Nov 24;12(1):20287. doi: 10.1038/s41598-022-24849-4.
囚徒困境中的合作:纯策略与混合策略的实验比较
R Soc Open Sci. 2019 Jul 10;6(7):182142. doi: 10.1098/rsos.182142. eCollection 2019 Jul.
4
Self-organizing Complex Networks: individual versus global rules.自组织复杂网络:个体规则与全局规则
Front Physiol. 2017 Jul 7;8:478. doi: 10.3389/fphys.2017.00478. eCollection 2017.
5
Comparing reactive and memory-one strategies of direct reciprocity.比较直接互惠的反应性策略和记忆一策略。
Sci Rep. 2016 May 10;6:25676. doi: 10.1038/srep25676.
6
Human cooperation.人类合作。
Trends Cogn Sci. 2013 Aug;17(8):413-25. doi: 10.1016/j.tics.2013.06.003. Epub 2013 Jul 13.
7
Consistent strategy updating in spatial and non-spatial behavioral experiments does not promote cooperation in social networks.在空间和非空间行为实验中持续更新策略并不会促进社交网络中的合作。
PLoS One. 2012;7(11):e47718. doi: 10.1371/journal.pone.0047718. Epub 2012 Nov 19.
8
Three is a crowd in iterated prisoner's dilemmas: experimental evidence on reciprocal behavior.在重复囚徒困境中,三个人就太多了:关于互惠行为的实验证据。
Sci Rep. 2012;2:638. doi: 10.1038/srep00638. Epub 2012 Sep 7.
9
Iterated Prisoner's Dilemma contains strategies that dominate any evolutionary opponent.迭代囚徒困境包含了能够支配任何进化对手的策略。
Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10409-13. doi: 10.1073/pnas.1206569109. Epub 2012 May 21.
10
Generosity pays in the presence of direct reciprocity: a comprehensive study of 2 × 2 repeated games.在存在直接互惠的情况下,慷慨会得到回报:对 2×2 重复博弈的综合研究。
PLoS One. 2012;7(4):e35135. doi: 10.1371/journal.pone.0035135. Epub 2012 Apr 18.