从敲诈勒索到慷慨大方，重复囚徒困境中的进化。

From extortion to generosity, evolution in the Iterated Prisoner's Dilemma.

机构信息

Department of Biology, University of Pennsylvania, Philadelphia, PA 19104.

出版信息

Proc Natl Acad Sci U S A. 2013 Sep 17;110(38):15348-53. doi: 10.1073/pnas.1306246110. Epub 2013 Sep 3.

DOI:10.1073/pnas.1306246110

PMID:24003115

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3780848/

Abstract

Recent work has revealed a new class of "zero-determinant" (ZD) strategies for iterated, two-player games. ZD strategies allow a player to unilaterally enforce a linear relationship between her score and her opponent's score, and thus to achieve an unusual degree of control over both players' long-term payoffs. Although originally conceived in the context of classical two-player game theory, ZD strategies also have consequences in evolving populations of players. Here, we explore the evolutionary prospects for ZD strategies in the Iterated Prisoner's Dilemma (IPD). Several recent studies have focused on the evolution of "extortion strategies," a subset of ZD strategies, and have found them to be unsuccessful in populations. Nevertheless, we identify a different subset of ZD strategies, called "generous ZD strategies," that forgive defecting opponents but nonetheless dominate in evolving populations. For all but the smallest population sizes, generous ZD strategies are not only robust to being replaced by other strategies but can selectively replace any noncooperative ZD strategy. Generous strategies can be generalized beyond the space of ZD strategies, and they remain robust to invasion. When evolution occurs on the full set of all IPD strategies, selection disproportionately favors these generous strategies. In some regimes, generous strategies outperform even the most successful of the well-known IPD strategies, including win-stay-lose-shift.

摘要

最近的研究揭示了一类新的“零行列式”（ZD）策略，用于迭代的两人游戏。ZD 策略允许玩家单方面强制她的得分和她对手的得分之间存在线性关系，从而对两个玩家的长期收益实现了不同寻常的控制程度。尽管最初是在经典的两人博弈论背景下提出的，但 ZD 策略在玩家的进化群体中也有后果。在这里，我们探讨了 ZD 策略在迭代囚徒困境（IPD）中的进化前景。最近的几项研究集中在“敲诈策略”（ZD 策略的一个子集）的进化上，并发现它们在群体中不成功。然而，我们确定了 ZD 策略的另一个子集，称为“慷慨 ZD 策略”，它原谅背叛的对手，但在进化群体中占主导地位。对于除最小种群大小外的所有情况，慷慨 ZD 策略不仅对被其他策略取代具有稳健性，而且可以选择性地取代任何非合作的 ZD 策略。慷慨策略可以推广到 ZD 策略空间之外，并且对入侵具有稳健性。当进化发生在 IPD 策略的完整集合上时，选择不成比例地有利于这些慷慨的策略。在某些情况下，慷慨策略的表现甚至优于最成功的知名 IPD 策略，包括赢留输换。

相似文献

From extortion to generosity, evolution in the Iterated Prisoner's Dilemma.从敲诈勒索到慷慨大方，重复囚徒困境中的进化。

Proc Natl Acad Sci U S A. 2013 Sep 17;110(38):15348-53. doi: 10.1073/pnas.1306246110. Epub 2013 Sep 3.

Extortion can outperform generosity in the iterated prisoner's dilemma.在重复囚徒困境中，敲诈策略可能比慷慨策略表现得更好。

Nat Commun. 2016 Apr 12;7:11125. doi: 10.1038/ncomms11125.

Misperception influence on zero-determinant strategies in iterated Prisoner's Dilemma.错误感知对迭代囚徒困境中零行列式策略的影响。

Sci Rep. 2022 Mar 25;12(1):5174. doi: 10.1038/s41598-022-08750-8.

Evolutionary dynamics of zero-determinant strategies in repeated multiplayer games.重复多人博弈中零行列式策略的进化动力学。

J Theor Biol. 2022 Sep 21;549:111209. doi: 10.1016/j.jtbi.2022.111209. Epub 2022 Jun 30.

Strategies that enforce linear payoff relationships under observation errors in Repeated Prisoner's Dilemma game.在重复囚徒困境博弈中观察误差下强制线性收益关系的策略。

J Theor Biol. 2019 Sep 21;477:63-76. doi: 10.1016/j.jtbi.2019.06.009. Epub 2019 Jun 12.

Autocratic strategies for iterated games with arbitrary action spaces.具有任意行动空间的重复博弈的独裁策略。

Proc Natl Acad Sci U S A. 2016 Mar 29;113(13):3573-8. doi: 10.1073/pnas.1520163113. Epub 2016 Mar 14.

Payoff landscapes and the robustness of selfish optimization in iterated games.迭代博弈中的收益景观和自利优化的稳健性。

J Math Biol. 2022 May 12;84(6):55. doi: 10.1007/s00285-022-01758-8.

Evolution of extortion in Iterated Prisoner's Dilemma games.重复囚徒困境博弈中的敲诈勒索行为的演变。

Proc Natl Acad Sci U S A. 2013 Apr 23;110(17):6913-8. doi: 10.1073/pnas.1214834110. Epub 2013 Apr 9.

Outlearning extortioners: unbending strategies can foster reciprocal fairness and cooperation.战胜敲诈者：坚定的策略能够促进互惠公平与合作。

PNAS Nexus. 2023 May 25;2(6):pgad176. doi: 10.1093/pnasnexus/pgad176. eCollection 2023 Jun.

Linear algebraic structure of zero-determinant strategies in repeated games.重复博弈中零行列式策略的线性代数结构。

PLoS One. 2020 Apr 2;15(4):e0230973. doi: 10.1371/journal.pone.0230973. eCollection 2020.

引用本文的文献

Evolving general cooperation with a Bayesian theory of mind.与贝叶斯心理理论不断发展的一般合作。

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2400993122. doi: 10.1073/pnas.2400993122. Epub 2025 Jun 16.

Unilateral incentive alignment in two-agent stochastic games.双智能体随机博弈中的单边激励对齐

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319927121. doi: 10.1073/pnas.2319927121. Epub 2025 Jun 16.

Stable strategies of direct and indirect reciprocity across all social dilemmas.适用于所有社会困境的直接和间接互惠的稳定策略。

PNAS Nexus. 2025 May 10;4(5):pgaf154. doi: 10.1093/pnasnexus/pgaf154. eCollection 2025 May.

Evolutionary dynamics of behavioral motivations for cooperation.合作行为动机的进化动态

Nat Commun. 2025 Apr 29;16(1):4023. doi: 10.1038/s41467-025-59366-1.

Indirect reciprocity in the public goods game with collective reputations.具有集体声誉的公共物品博弈中的间接互惠。

J R Soc Interface. 2025 Apr;22(225):20240827. doi: 10.1098/rsif.2024.0827. Epub 2025 Apr 2.

Repeated games with partner choice.有伙伴选择的重复博弈。

PLoS Comput Biol. 2025 Feb 4;21(2):e1012810. doi: 10.1371/journal.pcbi.1012810. eCollection 2025 Feb.

Conditional cooperation with longer memory.具有更长记忆的条件性合作。

Proc Natl Acad Sci U S A. 2024 Dec 10;121(50):e2420125121. doi: 10.1073/pnas.2420125121. Epub 2024 Dec 6.

Resolving social dilemmas with minimal reward transfer.以最小的奖励转移解决社会困境。

Auton Agent Multi Agent Syst. 2024;38(2):49. doi: 10.1007/s10458-024-09675-4. Epub 2024 Oct 12.

Evolution of reciprocity with limited payoff memory.回报有限记忆下的互惠行为演变。

Proc Biol Sci. 2024 Jun;291(2025):20232493. doi: 10.1098/rspb.2023.2493. Epub 2024 Jun 19.

The distorting effects of producer strategies: Why engagement does not reveal consumer preferences for misinformation.生产者策略的扭曲效应：为什么参与度不能揭示消费者对错误信息的偏好。

Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2315195121. doi: 10.1073/pnas.2315195121. Epub 2024 Feb 27.

本文引用的文献

Evolutionary instability of zero-determinant strategies demonstrates that winning is not everything.零行列式策略的进化不稳定性表明，获胜并非一切。

Nat Commun. 2013;4:2193. doi: 10.1038/ncomms3193.

Evolution of extortion in Iterated Prisoner's Dilemma games.重复囚徒困境博弈中的敲诈勒索行为的演变。

Proc Natl Acad Sci U S A. 2013 Apr 23;110(17):6913-8. doi: 10.1073/pnas.1214834110. Epub 2013 Apr 9.

Extortion and cooperation in the Prisoner's Dilemma.囚徒困境中的敲诈与合作。

Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10134-5. doi: 10.1073/pnas.1208087109. Epub 2012 Jun 18.

Iterated Prisoner's Dilemma contains strategies that dominate any evolutionary opponent.迭代囚徒困境包含了能够支配任何进化对手的策略。

Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10409-13. doi: 10.1073/pnas.1206569109. Epub 2012 May 21.

Critical dynamics in the evolution of stochastic strategies for the iterated prisoner's dilemma.随机策略在迭代囚徒困境中的演化的临界动力学。

PLoS Comput Biol. 2010 Oct 7;6(10):e1000948. doi: 10.1371/journal.pcbi.1000948.

Coordinated punishment of defectors sustains cooperation and can proliferate when rare.当稀有资源发生背叛时，协调惩罚背叛者可以维持合作，并使其扩散。

Science. 2010 Apr 30;328(5978):617-20. doi: 10.1126/science.1183665.

Stochastic evolutionary dynamics of direct reciprocity.直接互惠的随机进化动力学。

Proc Biol Sci. 2010 Feb 7;277(1680):463-8. doi: 10.1098/rspb.2009.1171. Epub 2009 Oct 21.

Tit-for-tat or win-stay, lose-shift?以牙还牙还是赢则继续，输则改变？

J Theor Biol. 2007 Aug 7;247(3):574-80. doi: 10.1016/j.jtbi.2007.03.027. Epub 2007 Mar 24.

Stochastic dynamics of invasion and fixation.入侵与固定的随机动力学

Phys Rev E Stat Nonlin Soft Matter Phys. 2006 Jul;74(1 Pt 1):011909. doi: 10.1103/PhysRevE.74.011909. Epub 2006 Jul 17.

Emergence of cooperation and evolutionary stability in finite populations.有限种群中合作的出现与进化稳定性

Nature. 2004 Apr 8;428(6983):646-50. doi: 10.1038/nature02414.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验