• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用心理理论寻找对记忆策略的最佳反应。

Using a theory of mind to find best responses to memory-one strategies.

机构信息

School of Mathematics, Cardiff University, Cardiff, CF24 4AG, UK.

Max Planck Institute for Evolutionary Biology, Plön, 24 306, Germany.

出版信息

Sci Rep. 2020 Oct 14;10(1):17287. doi: 10.1038/s41598-020-74181-y.

DOI:10.1038/s41598-020-74181-y
PMID:33057134
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7560663/
Abstract

Memory-one strategies are a set of Iterated Prisoner's Dilemma strategies that have been praised for their mathematical tractability and performance against single opponents. This manuscript investigates best response memory-one strategies with a theory of mind for their opponents. The results add to the literature that has shown that extortionate play is not always optimal by showing that optimal play is often not extortionate. They also provide evidence that memory-one strategies suffer from their limited memory in multi agent interactions and can be out performed by optimised strategies with longer memory. We have developed a theory that has allowed to explore the entire space of memory-one strategies. The framework presented is suitable to study memory-one strategies in the Prisoner's Dilemma, but also in evolutionary processes such as the Moran process. Furthermore, results on the stability of defection in populations of memory-one strategies are also obtained.

摘要

记忆策略是一套迭代囚徒困境策略,因其数学可解性和对单一对手的表现而受到赞誉。本文研究了具有对手思维理论的最佳响应记忆策略。研究结果表明,通过展示最优策略并不总是专横的,从而补充了专横策略并非总是最优的文献。此外,这些结果还提供了证据表明,在多主体交互中,记忆策略受到其有限记忆的限制,并且可以被具有更长记忆的优化策略超越。我们已经开发了一种理论,该理论允许我们探索记忆策略的整个空间。提出的框架适用于研究囚徒困境中的记忆策略,也适用于进化过程,如 Moran 过程。此外,还获得了记忆策略种群中背叛稳定性的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2586/7560663/f7da86333d61/41598_2020_74181_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2586/7560663/3dbfa46283b9/41598_2020_74181_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2586/7560663/4aafca84a9db/41598_2020_74181_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2586/7560663/f7da86333d61/41598_2020_74181_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2586/7560663/3dbfa46283b9/41598_2020_74181_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2586/7560663/4aafca84a9db/41598_2020_74181_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2586/7560663/f7da86333d61/41598_2020_74181_Fig3_HTML.jpg

相似文献

1
Using a theory of mind to find best responses to memory-one strategies.运用心理理论寻找对记忆策略的最佳反应。
Sci Rep. 2020 Oct 14;10(1):17287. doi: 10.1038/s41598-020-74181-y.
2
Recognising and evaluating the effectiveness of extortion in the Iterated Prisoner's Dilemma.识别和评估迭代囚徒困境中的敲诈行为的有效性。
PLoS One. 2024 Jul 26;19(7):e0304641. doi: 10.1371/journal.pone.0304641. eCollection 2024.
3
Extortion can outperform generosity in the iterated prisoner's dilemma.在重复囚徒困境中,敲诈策略可能比慷慨策略表现得更好。
Nat Commun. 2016 Apr 12;7:11125. doi: 10.1038/ncomms11125.
4
Duality between cooperation and defection in the presence of tit-for-tat in replicator dynamics.在复制者动态中存在针锋相对策略时合作与背叛之间的二元性。
J Theor Biol. 2017 Oct 7;430:215-220. doi: 10.1016/j.jtbi.2017.07.026. Epub 2017 Jul 26.
5
Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma.强化学习为重复囚徒困境产生了占优策略。
PLoS One. 2017 Dec 11;12(12):e0188046. doi: 10.1371/journal.pone.0188046. eCollection 2017.
6
Autocratic strategies for iterated games with arbitrary action spaces.具有任意行动空间的重复博弈的独裁策略。
Proc Natl Acad Sci U S A. 2016 Mar 29;113(13):3573-8. doi: 10.1073/pnas.1520163113. Epub 2016 Mar 14.
7
Iterated Prisoner's Dilemma contains strategies that dominate any evolutionary opponent.迭代囚徒困境包含了能够支配任何进化对手的策略。
Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10409-13. doi: 10.1073/pnas.1206569109. Epub 2012 May 21.
8
Win-Stay-Lose-Shift as a self-confirming equilibrium in the iterated Prisoner's Dilemma.在重复囚徒困境中,赢留输走是一种自我确认的均衡。
Proc Biol Sci. 2021 Jun 30;288(1953):20211021. doi: 10.1098/rspb.2021.1021.
9
Working memory constrains human cooperation in the Prisoner's Dilemma.工作记忆限制了囚徒困境中人类的合作。
Proc Natl Acad Sci U S A. 1998 Nov 10;95(23):13755-8. doi: 10.1073/pnas.95.23.13755.
10
Evolutionary dynamics of zero-determinant strategies in repeated multiplayer games.重复多人博弈中零行列式策略的进化动力学。
J Theor Biol. 2022 Sep 21;549:111209. doi: 10.1016/j.jtbi.2022.111209. Epub 2022 Jun 30.

引用本文的文献

1
Properties of winning Iterated Prisoner's Dilemma strategies.获胜的重复囚徒困境策略的属性。
PLoS Comput Biol. 2024 Dec 26;20(12):e1012644. doi: 10.1371/journal.pcbi.1012644. eCollection 2024 Dec.
2
Conditional cooperation with longer memory.具有更长记忆的条件性合作。
Proc Natl Acad Sci U S A. 2024 Dec 10;121(50):e2420125121. doi: 10.1073/pnas.2420125121. Epub 2024 Dec 6.
3
Evolution of reciprocity with limited payoff memory.回报有限记忆下的互惠行为演变。

本文引用的文献

1
Recognising and evaluating the effectiveness of extortion in the Iterated Prisoner's Dilemma.识别和评估迭代囚徒困境中的敲诈行为的有效性。
PLoS One. 2024 Jul 26;19(7):e0304641. doi: 10.1371/journal.pone.0304641. eCollection 2024.
2
Partners and rivals in direct reciprocity.直接互惠的伙伴和对手。
Nat Hum Behav. 2018 Jul;2(7):469-477. doi: 10.1038/s41562-018-0320-9. Epub 2018 Mar 19.
3
Evolution reinforces cooperation with the emergence of self-recognition mechanisms: An empirical study of strategies in the Moran process for the iterated prisoner's dilemma.
Proc Biol Sci. 2024 Jun;291(2025):20232493. doi: 10.1098/rspb.2023.2493. Epub 2024 Jun 19.
4
Adaptive dynamics of memory-one strategies in the repeated donation game.记忆策略在重复捐赠游戏中的适应动态。
PLoS Comput Biol. 2023 Jun 29;19(6):e1010987. doi: 10.1371/journal.pcbi.1010987. eCollection 2023 Jun.
5
Mutation enhances cooperation in direct reciprocity.突变增强了直接互惠中的合作。
Proc Natl Acad Sci U S A. 2023 May 16;120(20):e2221080120. doi: 10.1073/pnas.2221080120. Epub 2023 May 8.
6
Direct reciprocity between individuals that use different strategy spaces.个体之间使用不同策略空间的直接互惠。
PLoS Comput Biol. 2022 Jun 14;18(6):e1010149. doi: 10.1371/journal.pcbi.1010149. eCollection 2022 Jun.
7
Misperception influence on zero-determinant strategies in iterated Prisoner's Dilemma.错误感知对迭代囚徒困境中零行列式策略的影响。
Sci Rep. 2022 Mar 25;12(1):5174. doi: 10.1038/s41598-022-08750-8.
8
Human players manage to extort more than the mutual cooperation payoff in repeated social dilemmas.人类玩家在重复的社会困境中设法获得的收益超过了相互合作的收益。
Sci Rep. 2021 Aug 19;11(1):16820. doi: 10.1038/s41598-021-96061-9.
进化通过自我识别机制的出现来加强合作:对迭代囚徒困境中 Moran 过程策略的实证研究。
PLoS One. 2018 Oct 25;13(10):e0204981. doi: 10.1371/journal.pone.0204981. eCollection 2018.
4
Re-run, Repeat, Reproduce, Reuse, Replicate: Transforming Code into Scientific Contributions.重新运行、重复、再现、复用、复制:将代码转化为科学贡献。
Front Neuroinform. 2018 Jan 4;11:69. doi: 10.3389/fninf.2017.00069. eCollection 2017.
5
Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma.强化学习为重复囚徒困境产生了占优策略。
PLoS One. 2017 Dec 11;12(12):e0188046. doi: 10.1371/journal.pone.0188046. eCollection 2017.
6
Partners or rivals? Strategies for the iterated prisoner's dilemma.合作伙伴还是竞争对手?重复囚徒困境的策略
Games Econ Behav. 2015 Jul;92:41-52. doi: 10.1016/j.geb.2015.05.005.
7
The art of war: beyond memory-one strategies in population games.战争的艺术:超越记忆——群体博弈中的一种策略
PLoS One. 2015 Mar 24;10(3):e0120625. doi: 10.1371/journal.pone.0120625. eCollection 2015.
8
Theory of mind: did evolution fool us?心理理论:进化愚弄了我们吗?
PLoS One. 2014 Feb 5;9(2):e87619. doi: 10.1371/journal.pone.0087619. eCollection 2014.
9
Adaptive dynamics of extortion and compliance.敲诈与服从的适应动态。
PLoS One. 2013 Nov 1;8(11):e77886. doi: 10.1371/journal.pone.0077886. eCollection 2013.
10
Evolutionary instability of zero-determinant strategies demonstrates that winning is not everything.零行列式策略的进化不稳定性表明,获胜并非一切。
Nat Commun. 2013;4:2193. doi: 10.1038/ncomms3193.