Suppr超能文献

从敲诈勒索到慷慨大方,重复囚徒困境中的进化。

From extortion to generosity, evolution in the Iterated Prisoner's Dilemma.

机构信息

Department of Biology, University of Pennsylvania, Philadelphia, PA 19104.

出版信息

Proc Natl Acad Sci U S A. 2013 Sep 17;110(38):15348-53. doi: 10.1073/pnas.1306246110. Epub 2013 Sep 3.

Abstract

Recent work has revealed a new class of "zero-determinant" (ZD) strategies for iterated, two-player games. ZD strategies allow a player to unilaterally enforce a linear relationship between her score and her opponent's score, and thus to achieve an unusual degree of control over both players' long-term payoffs. Although originally conceived in the context of classical two-player game theory, ZD strategies also have consequences in evolving populations of players. Here, we explore the evolutionary prospects for ZD strategies in the Iterated Prisoner's Dilemma (IPD). Several recent studies have focused on the evolution of "extortion strategies," a subset of ZD strategies, and have found them to be unsuccessful in populations. Nevertheless, we identify a different subset of ZD strategies, called "generous ZD strategies," that forgive defecting opponents but nonetheless dominate in evolving populations. For all but the smallest population sizes, generous ZD strategies are not only robust to being replaced by other strategies but can selectively replace any noncooperative ZD strategy. Generous strategies can be generalized beyond the space of ZD strategies, and they remain robust to invasion. When evolution occurs on the full set of all IPD strategies, selection disproportionately favors these generous strategies. In some regimes, generous strategies outperform even the most successful of the well-known IPD strategies, including win-stay-lose-shift.

摘要

最近的研究揭示了一类新的“零行列式”(ZD)策略,用于迭代的两人游戏。ZD 策略允许玩家单方面强制她的得分和她对手的得分之间存在线性关系,从而对两个玩家的长期收益实现了不同寻常的控制程度。尽管最初是在经典的两人博弈论背景下提出的,但 ZD 策略在玩家的进化群体中也有后果。在这里,我们探讨了 ZD 策略在迭代囚徒困境(IPD)中的进化前景。最近的几项研究集中在“敲诈策略”(ZD 策略的一个子集)的进化上,并发现它们在群体中不成功。然而,我们确定了 ZD 策略的另一个子集,称为“慷慨 ZD 策略”,它原谅背叛的对手,但在进化群体中占主导地位。对于除最小种群大小外的所有情况,慷慨 ZD 策略不仅对被其他策略取代具有稳健性,而且可以选择性地取代任何非合作的 ZD 策略。慷慨策略可以推广到 ZD 策略空间之外,并且对入侵具有稳健性。当进化发生在 IPD 策略的完整集合上时,选择不成比例地有利于这些慷慨的策略。在某些情况下,慷慨策略的表现甚至优于最成功的知名 IPD 策略,包括赢留输换。

相似文献

1
From extortion to generosity, evolution in the Iterated Prisoner's Dilemma.从敲诈勒索到慷慨大方,重复囚徒困境中的进化。
Proc Natl Acad Sci U S A. 2013 Sep 17;110(38):15348-53. doi: 10.1073/pnas.1306246110. Epub 2013 Sep 3.
6
Autocratic strategies for iterated games with arbitrary action spaces.具有任意行动空间的重复博弈的独裁策略。
Proc Natl Acad Sci U S A. 2016 Mar 29;113(13):3573-8. doi: 10.1073/pnas.1520163113. Epub 2016 Mar 14.
8
Evolution of extortion in Iterated Prisoner's Dilemma games.重复囚徒困境博弈中的敲诈勒索行为的演变。
Proc Natl Acad Sci U S A. 2013 Apr 23;110(17):6913-8. doi: 10.1073/pnas.1214834110. Epub 2013 Apr 9.
10
Linear algebraic structure of zero-determinant strategies in repeated games.重复博弈中零行列式策略的线性代数结构。
PLoS One. 2020 Apr 2;15(4):e0230973. doi: 10.1371/journal.pone.0230973. eCollection 2020.

引用本文的文献

1
Evolving general cooperation with a Bayesian theory of mind.与贝叶斯心理理论不断发展的一般合作。
Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2400993122. doi: 10.1073/pnas.2400993122. Epub 2025 Jun 16.
2
Unilateral incentive alignment in two-agent stochastic games.双智能体随机博弈中的单边激励对齐
Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319927121. doi: 10.1073/pnas.2319927121. Epub 2025 Jun 16.
4
Evolutionary dynamics of behavioral motivations for cooperation.合作行为动机的进化动态
Nat Commun. 2025 Apr 29;16(1):4023. doi: 10.1038/s41467-025-59366-1.
5
Indirect reciprocity in the public goods game with collective reputations.具有集体声誉的公共物品博弈中的间接互惠。
J R Soc Interface. 2025 Apr;22(225):20240827. doi: 10.1098/rsif.2024.0827. Epub 2025 Apr 2.
6
Repeated games with partner choice.有伙伴选择的重复博弈。
PLoS Comput Biol. 2025 Feb 4;21(2):e1012810. doi: 10.1371/journal.pcbi.1012810. eCollection 2025 Feb.
7
Conditional cooperation with longer memory.具有更长记忆的条件性合作。
Proc Natl Acad Sci U S A. 2024 Dec 10;121(50):e2420125121. doi: 10.1073/pnas.2420125121. Epub 2024 Dec 6.
8
Resolving social dilemmas with minimal reward transfer.以最小的奖励转移解决社会困境。
Auton Agent Multi Agent Syst. 2024;38(2):49. doi: 10.1007/s10458-024-09675-4. Epub 2024 Oct 12.
9
Evolution of reciprocity with limited payoff memory.回报有限记忆下的互惠行为演变。
Proc Biol Sci. 2024 Jun;291(2025):20232493. doi: 10.1098/rspb.2023.2493. Epub 2024 Jun 19.

本文引用的文献

2
Evolution of extortion in Iterated Prisoner's Dilemma games.重复囚徒困境博弈中的敲诈勒索行为的演变。
Proc Natl Acad Sci U S A. 2013 Apr 23;110(17):6913-8. doi: 10.1073/pnas.1214834110. Epub 2013 Apr 9.
3
Extortion and cooperation in the Prisoner's Dilemma.囚徒困境中的敲诈与合作。
Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10134-5. doi: 10.1073/pnas.1208087109. Epub 2012 Jun 18.
7
Stochastic evolutionary dynamics of direct reciprocity.直接互惠的随机进化动力学。
Proc Biol Sci. 2010 Feb 7;277(1680):463-8. doi: 10.1098/rspb.2009.1171. Epub 2009 Oct 21.
8
Tit-for-tat or win-stay, lose-shift?以牙还牙还是赢则继续,输则改变?
J Theor Biol. 2007 Aug 7;247(3):574-80. doi: 10.1016/j.jtbi.2007.03.027. Epub 2007 Mar 24.
9
Stochastic dynamics of invasion and fixation.入侵与固定的随机动力学
Phys Rev E Stat Nonlin Soft Matter Phys. 2006 Jul;74(1 Pt 1):011909. doi: 10.1103/PhysRevE.74.011909. Epub 2006 Jul 17.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验