• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

战争的艺术:超越记忆——群体博弈中的一种策略

The art of war: beyond memory-one strategies in population games.

作者信息

Lee Christopher, Harper Marc, Fryer Dashiell

机构信息

Institute for Genomics and Proteomics, University of California, Los Angeles, CA, USA; Dept. of Chemistry & Biochemistry, University of California, Los Angeles, CA, USA; Dept. of Computer Science, University of California, Los Angeles, CA, USA; Molecular Biology Institute, University of California, Los Angeles, CA, USA.

Institute for Genomics and Proteomics, University of California, Los Angeles, CA, USA.

出版信息

PLoS One. 2015 Mar 24;10(3):e0120625. doi: 10.1371/journal.pone.0120625. eCollection 2015.

DOI:10.1371/journal.pone.0120625
PMID:25803576
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4372540/
Abstract

We show that the history of play in a population game contains exploitable information that can be successfully used by sophisticated strategies to defeat memory-one opponents, including zero determinant strategies. The history allows a player to label opponents by their strategies, enabling a player to determine the population distribution and to act differentially based on the opponent's strategy in each pairwise interaction. For the Prisoner's Dilemma, these advantages lead to the natural formation of cooperative coalitions among similarly behaving players and eventually to unilateral defection against opposing player types. We show analytically and empirically that optimal play in population games depends strongly on the population distribution. For example, the optimal strategy for a minority player type against a resident TFT population is ALLC, while for a majority player type the optimal strategy versus TFT players is ALLD. Such behaviors are not accessible to memory-one strategies. Drawing inspiration from Sun Tzu's the Art of War, we implemented a non-memory-one strategy for population games based on techniques from machine learning and statistical inference that can exploit the history of play in this manner. Via simulation we find that this strategy is essentially uninvadable and can successfully invade (significantly more likely than a neutral mutant) essentially all known memory-one strategies for the Prisoner's Dilemma, including ALLC (always cooperate), ALLD (always defect), tit-for-tat (TFT), win-stay-lose-shift (WSLS), and zero determinant (ZD) strategies, including extortionate and generous strategies.

摘要

我们表明,群体博弈中的博弈历史包含可利用的信息,复杂策略可以成功利用这些信息来击败记忆一策略对手,包括零行列式策略。这段历史使玩家能够根据对手的策略对其进行标记,从而使玩家能够确定群体分布,并在每次两两互动中根据对手的策略采取不同行动。对于囚徒困境而言,这些优势导致行为相似的玩家之间自然形成合作联盟,并最终导致对对立玩家类型的单方面背叛。我们通过分析和实证表明,群体博弈中的最优玩法在很大程度上取决于群体分布。例如,少数玩家类型对抗常驻的以牙还牙(TFT)群体时的最优策略是始终合作(ALLC),而多数玩家类型对抗TFT玩家时的最优策略是始终背叛(ALLD)。记忆一策略无法实现此类行为。从孙子的《孙子兵法》中汲取灵感,我们基于机器学习和统计推断技术,为群体博弈实施了一种非记忆一策略,该策略能够以这种方式利用博弈历史。通过模拟我们发现,该策略基本上不可被入侵,并且能够成功入侵(比中性突变体的可能性大得多)囚徒困境中基本上所有已知的记忆一策略,包括始终合作(ALLC)、始终背叛(ALLD)、以牙还牙(TFT)、赢留输变(WSLS)以及零行列式(ZD)策略,包括敲诈勒索和慷慨策略。

相似文献

1
The art of war: beyond memory-one strategies in population games.战争的艺术:超越记忆——群体博弈中的一种策略
PLoS One. 2015 Mar 24;10(3):e0120625. doi: 10.1371/journal.pone.0120625. eCollection 2015.
2
Tit-for-tat or win-stay, lose-shift?以牙还牙还是赢则继续,输则改变?
J Theor Biol. 2007 Aug 7;247(3):574-80. doi: 10.1016/j.jtbi.2007.03.027. Epub 2007 Mar 24.
3
From extortion to generosity, evolution in the Iterated Prisoner's Dilemma.从敲诈勒索到慷慨大方,重复囚徒困境中的进化。
Proc Natl Acad Sci U S A. 2013 Sep 17;110(38):15348-53. doi: 10.1073/pnas.1306246110. Epub 2013 Sep 3.
4
Strategies that enforce linear payoff relationships under observation errors in Repeated Prisoner's Dilemma game.在重复囚徒困境博弈中观察误差下强制线性收益关系的策略。
J Theor Biol. 2019 Sep 21;477:63-76. doi: 10.1016/j.jtbi.2019.06.009. Epub 2019 Jun 12.
5
Win-stay, lose-shift strategies for repeated games-memory length, aspiration levels and noise.重复博弈中的赢留输变策略——记忆长度、期望水平与噪声
J Theor Biol. 1999 May 21;198(2):183-95. doi: 10.1006/jtbi.1999.0909.
6
No strategy is evolutionarily stable in the repeated prisoner's dilemma.在重复的囚徒困境中,没有任何策略在进化上是稳定的。
J Theor Biol. 1994 May 21;168(2):117-30. doi: 10.1006/jtbi.1994.1092.
7
Autocratic strategies for iterated games with arbitrary action spaces.具有任意行动空间的重复博弈的独裁策略。
Proc Natl Acad Sci U S A. 2016 Mar 29;113(13):3573-8. doi: 10.1073/pnas.1520163113. Epub 2016 Mar 14.
8
A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game.在囚徒困境博弈中,一种“赢则继续,输则转换”的策略比针锋相对策略表现更优。
Nature. 1993 Jul 1;364(6432):56-8. doi: 10.1038/364056a0.
9
Evolutionary performance of zero-determinant strategies in multiplayer games.多人游戏中零行列式策略的进化性能
J Theor Biol. 2015 Jun 7;374:115-24. doi: 10.1016/j.jtbi.2015.03.032. Epub 2015 Apr 3.
10
Natural selection of memory-one strategies for the iterated prisoner's dilemma.重复囚徒困境中记忆一策略的自然选择
J Theor Biol. 2000 Apr 21;203(4):335-55. doi: 10.1006/jtbi.2000.1089.

引用本文的文献

1
Recognising and evaluating the effectiveness of extortion in the Iterated Prisoner's Dilemma.识别和评估迭代囚徒困境中的敲诈行为的有效性。
PLoS One. 2024 Jul 26;19(7):e0304641. doi: 10.1371/journal.pone.0304641. eCollection 2024.
2
Direct reciprocity between individuals that use different strategy spaces.个体之间使用不同策略空间的直接互惠。
PLoS Comput Biol. 2022 Jun 14;18(6):e1010149. doi: 10.1371/journal.pcbi.1010149. eCollection 2022 Jun.
3
Using a theory of mind to find best responses to memory-one strategies.运用心理理论寻找对记忆策略的最佳反应。

本文引用的文献

1
Collapse of cooperation in evolving games.进化博弈中合作的瓦解。
Proc Natl Acad Sci U S A. 2014 Dec 9;111(49):17558-63. doi: 10.1073/pnas.1408618111. Epub 2014 Nov 24.
2
Adaptive dynamics of extortion and compliance.敲诈与服从的适应动态。
PLoS One. 2013 Nov 1;8(11):e77886. doi: 10.1371/journal.pone.0077886. eCollection 2013.
3
From extortion to generosity, evolution in the Iterated Prisoner's Dilemma.从敲诈勒索到慷慨大方,重复囚徒困境中的进化。
Sci Rep. 2020 Oct 14;10(1):17287. doi: 10.1038/s41598-020-74181-y.
4
Evolution reinforces cooperation with the emergence of self-recognition mechanisms: An empirical study of strategies in the Moran process for the iterated prisoner's dilemma.进化通过自我识别机制的出现来加强合作:对迭代囚徒困境中 Moran 过程策略的实证研究。
PLoS One. 2018 Oct 25;13(10):e0204981. doi: 10.1371/journal.pone.0204981. eCollection 2018.
5
Evolutionary consequences of behavioral diversity.行为多样性的进化后果。
Proc Natl Acad Sci U S A. 2016 Nov 8;113(45):E7003-E7009. doi: 10.1073/pnas.1608990113. Epub 2016 Oct 24.
6
Small groups and long memories promote cooperation.小群体和长久的记忆促进合作。
Sci Rep. 2016 Jun 1;6:26889. doi: 10.1038/srep26889.
Proc Natl Acad Sci U S A. 2013 Sep 17;110(38):15348-53. doi: 10.1073/pnas.1306246110. Epub 2013 Sep 3.
4
Evolutionary instability of zero-determinant strategies demonstrates that winning is not everything.零行列式策略的进化不稳定性表明,获胜并非一切。
Nat Commun. 2013;4:2193. doi: 10.1038/ncomms3193.
5
Fusing enacted and expected mimicry generates a winning strategy that promotes the evolution of cooperation.融合真实和预期的模仿行为会产生一种成功的策略,从而促进合作的进化。
Proc Natl Acad Sci U S A. 2013 Jun 18;110(25):10229-33. doi: 10.1073/pnas.1308221110. Epub 2013 Jun 3.
6
Evolution of extortion in Iterated Prisoner's Dilemma games.重复囚徒困境博弈中的敲诈勒索行为的演变。
Proc Natl Acad Sci U S A. 2013 Apr 23;110(17):6913-8. doi: 10.1073/pnas.1214834110. Epub 2013 Apr 9.
7
Extortion and cooperation in the Prisoner's Dilemma.囚徒困境中的敲诈与合作。
Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10134-5. doi: 10.1073/pnas.1208087109. Epub 2012 Jun 18.
8
Direct reciprocity in structured populations.结构种群中的直接互惠。
Proc Natl Acad Sci U S A. 2012 Jun 19;109(25):9929-34. doi: 10.1073/pnas.1206694109. Epub 2012 Jun 4.
9
Iterated Prisoner's Dilemma contains strategies that dominate any evolutionary opponent.迭代囚徒困境包含了能够支配任何进化对手的策略。
Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10409-13. doi: 10.1073/pnas.1206569109. Epub 2012 May 21.
10
Tit-for-tat or win-stay, lose-shift?以牙还牙还是赢则继续,输则改变?
J Theor Biol. 2007 Aug 7;247(3):574-80. doi: 10.1016/j.jtbi.2007.03.027. Epub 2007 Mar 24.