• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

有限重复博弈中的零行列式策略。

Zero-determinant strategies in finitely repeated games.

作者信息

Ichinose Genki, Masuda Naoki

机构信息

Department of Mathematical and Systems Engineering, Shizuoka University, 3-5- 1 Johoku, Naka-ku, Hamamatsu, 432-8561, Japan.

Department of Engineering Mathematics, University of Bristol, Merchant Venturers Building, Woodland Road, Clifton, Bristol BS8 1UB, United Kingdom.

出版信息

J Theor Biol. 2018 Feb 7;438:61-77. doi: 10.1016/j.jtbi.2017.11.002. Epub 2017 Nov 14.

DOI:10.1016/j.jtbi.2017.11.002
PMID:29154776
Abstract

Direct reciprocity is a mechanism for sustaining mutual cooperation in repeated social dilemma games, where a player would keep cooperation to avoid being retaliated by a co-player in the future. So-called zero-determinant (ZD) strategies enable a player to unilaterally set a linear relationship between the player's own payoff and the co-player's payoff regardless of the strategy of the co-player. In the present study, we analytically study zero-determinant strategies in finitely repeated (two-person) prisoner's dilemma games with a general payoff matrix. Our results are as follows. First, we present the forms of solutions that extend the known results for infinitely repeated games (with a discount factor w of unity) to the case of finitely repeated games (0 < w < 1). Second, for the three most prominent ZD strategies, the equalizers, extortioners, and generous strategies, we derive the threshold value of w above which the ZD strategies exist. Third, we show that the only strategies that enforce a linear relationship between the two players' payoffs are either the ZD strategies or unconditional strategies, where the latter independently cooperates with a fixed probability in each round of the game, proving a conjecture previously made for infinitely repeated games.

摘要

直接互惠是在重复社会困境博弈中维持相互合作的一种机制,在这种博弈中,玩家会保持合作以避免未来被合作玩家报复。所谓的零行列式(ZD)策略使玩家能够单方面设定自己的收益与合作玩家收益之间的线性关系,而不管合作玩家的策略如何。在本研究中,我们对具有一般收益矩阵的有限重复(两人)囚徒困境博弈中的零行列式策略进行了分析研究。我们的结果如下。首先,我们给出了解的形式,将无限重复博弈(折扣因子w为1)的已知结果扩展到了有限重复博弈(0 < w < 1)的情况。其次,对于三种最突出的ZD策略,即均等者策略、敲诈者策略和慷慨策略,我们推导出了w的阈值,高于该阈值ZD策略存在。第三,我们表明,唯一能强制在两个玩家收益之间建立线性关系的策略要么是ZD策略,要么是无条件策略,其中后者在游戏的每一轮中以固定概率独立合作,这证明了之前针对无限重复博弈所做的一个猜想。

相似文献

1
Zero-determinant strategies in finitely repeated games.有限重复博弈中的零行列式策略。
J Theor Biol. 2018 Feb 7;438:61-77. doi: 10.1016/j.jtbi.2017.11.002. Epub 2017 Nov 14.
2
Strategies that enforce linear payoff relationships under observation errors in Repeated Prisoner's Dilemma game.在重复囚徒困境博弈中观察误差下强制线性收益关系的策略。
J Theor Biol. 2019 Sep 21;477:63-76. doi: 10.1016/j.jtbi.2019.06.009. Epub 2019 Jun 12.
3
Adapting paths against zero-determinant strategies in repeated prisoner's dilemma games.在重复囚徒困境博弈中适应零行列式策略的路径。
J Theor Biol. 2022 Sep 21;549:111211. doi: 10.1016/j.jtbi.2022.111211. Epub 2022 Jul 8.
4
The robustness of zero-determinant strategies in Iterated Prisoner's Dilemma games.重复囚徒困境博弈中零行列式策略的稳健性
J Theor Biol. 2014 Sep 21;357:46-54. doi: 10.1016/j.jtbi.2014.05.004. Epub 2014 May 10.
5
Zero-determinant strategies under observation errors in repeated games.重复博弈中存在观测误差时的零行列式策略。
Phys Rev E. 2020 Sep;102(3-1):032115. doi: 10.1103/PhysRevE.102.032115.
6
Payoff landscapes and the robustness of selfish optimization in iterated games.迭代博弈中的收益景观和自利优化的稳健性。
J Math Biol. 2022 May 12;84(6):55. doi: 10.1007/s00285-022-01758-8.
7
Conditions for the existence of zero-determinant strategies under observation errors in repeated games.重复博弈中存在观测误差时零行列式策略的存在条件。
J Theor Biol. 2021 Oct 7;526:110810. doi: 10.1016/j.jtbi.2021.110810. Epub 2021 Jun 10.
8
Zero-Determinant Strategies in Iterated Public Goods Game.重复公共物品博弈中的零行列式策略
Sci Rep. 2015 Aug 21;5:13096. doi: 10.1038/srep13096.
9
Linear algebraic structure of zero-determinant strategies in repeated games.重复博弈中零行列式策略的线性代数结构。
PLoS One. 2020 Apr 2;15(4):e0230973. doi: 10.1371/journal.pone.0230973. eCollection 2020.
10
Adaptive dynamics of extortion and compliance.敲诈与服从的适应动态。
PLoS One. 2013 Nov 1;8(11):e77886. doi: 10.1371/journal.pone.0077886. eCollection 2013.

引用本文的文献

1
Unilateral incentive alignment in two-agent stochastic games.双智能体随机博弈中的单边激励对齐
Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319927121. doi: 10.1073/pnas.2319927121. Epub 2025 Jun 16.
2
Stable strategies of direct and indirect reciprocity across all social dilemmas.适用于所有社会困境的直接和间接互惠的稳定策略。
PNAS Nexus. 2025 May 10;4(5):pgaf154. doi: 10.1093/pnasnexus/pgaf154. eCollection 2025 May.
3
Dynamics of cooperation in concurrent games.并发博弈中的合作动态
Nat Commun. 2025 Feb 11;16(1):1524. doi: 10.1038/s41467-025-56083-7.
4
Recognising and evaluating the effectiveness of extortion in the Iterated Prisoner's Dilemma.识别和评估迭代囚徒困境中的敲诈行为的有效性。
PLoS One. 2024 Jul 26;19(7):e0304641. doi: 10.1371/journal.pone.0304641. eCollection 2024.
5
Individualistic attitudes in Iterated Prisoner's Dilemma undermine evolutionary fitness and may drive cooperative human players to extinction.重复囚徒困境中的个人主义态度会损害进化适应性,并可能导致合作的人类参与者灭绝。
R Soc Open Sci. 2024 Mar 20;11(3):230867. doi: 10.1098/rsos.230867. eCollection 2024 Mar.
6
Efficiency and resilience of cooperation in asymmetric social dilemmas.不对称社会困境中的合作效率和恢复力。
Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2315558121. doi: 10.1073/pnas.2315558121. Epub 2024 Feb 26.
7
Outlearning extortioners: unbending strategies can foster reciprocal fairness and cooperation.战胜敲诈者:坚定的策略能够促进互惠公平与合作。
PNAS Nexus. 2023 May 25;2(6):pgad176. doi: 10.1093/pnasnexus/pgad176. eCollection 2023 Jun.
8
Evolution of direct reciprocity in group-structured populations.群体结构中直接互惠的进化。
Sci Rep. 2022 Nov 4;12(1):18645. doi: 10.1038/s41598-022-23467-4.
9
Direct reciprocity between individuals that use different strategy spaces.个体之间使用不同策略空间的直接互惠。
PLoS Comput Biol. 2022 Jun 14;18(6):e1010149. doi: 10.1371/journal.pcbi.1010149. eCollection 2022 Jun.
10
Cooperation in alternating interactions with memory constraints.在具有记忆约束的交替互动中进行合作。
Nat Commun. 2022 Feb 8;13(1):737. doi: 10.1038/s41467-022-28336-2.