• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重复博弈中行为数据的战略信息处理

Strategic Information Processing from Behavioural Data in Iterated Games.

作者信息

Harré Michael S

机构信息

Complex Systems Research Group, Faculty of Engineering and IT, The University of Sydney, Sydney 2006, Australia.

出版信息

Entropy (Basel). 2018 Jan 4;20(1):27. doi: 10.3390/e20010027.

DOI:10.3390/e20010027
PMID:33265117
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7512235/
Abstract

Iterated games are an important framework of economic theory and application, at least since the original work of Axelrod's computational tournaments of the early 80's. Recent theoretical results have shown that games (the economic context) and game theory (the decision-making process) are both formally equivalent to computational logic gates. Here these results are extended to behavioural data obtained from an experiment in which rhesus monkeys sequentially played thousands of the "matching pennies" game, an empirical example similar to Axelrod's tournaments in which algorithms played against one another. The results show that the monkeys exhibit a rich variety of behaviours, both between and within subjects when playing opponents of varying complexity. Despite earlier suggestions, there is no clear evidence that the win-stay, lose-switch strategy is used, however there is evidence of non-linear strategy-based interactions between the predictors of future choices. It is also shown that there is consistent evidence across protocols and across individuals that the monkeys extract non-markovian information, i.e., information from more than just the most recent state of the game. This work shows that the use of information theory in game theory can test important hypotheses that would otherwise be more difficult to extract using traditional statistical methods.

摘要

至少从20世纪80年代初阿克塞尔罗德的计算竞赛的开创性工作以来,重复博弈一直是经济理论与应用的一个重要框架。最近的理论结果表明,博弈(经济背景)和博弈论(决策过程)在形式上都等同于计算逻辑门。在此,这些结果被扩展到从一项实验中获得的行为数据,在该实验中,恒河猴依次进行了数千次“猜硬币”游戏,这是一个类似于阿克塞尔罗德竞赛的实证例子,在竞赛中算法相互对抗。结果表明,当与不同复杂度的对手博弈时,猴子在个体之间以及个体内部都表现出丰富多样的行为。尽管早期有相关推测,但没有明确证据表明猴子使用了“赢则继续,输则改变”策略,不过有证据表明未来选择的预测因素之间存在基于策略的非线性相互作用。研究还表明,在不同的实验方案和不同个体之间都有一致的证据表明,猴子提取了非马尔可夫信息,即不仅仅是来自博弈最近状态的信息。这项工作表明,在博弈论中使用信息论可以检验一些重要假设,否则使用传统统计方法将更难提取这些假设。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/dad6fb3e1bf1/entropy-20-00027-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/3ea560c06e44/entropy-20-00027-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/25f16a93d00e/entropy-20-00027-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/53e33be08f03/entropy-20-00027-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/dcf48c0f94d3/entropy-20-00027-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/dad6fb3e1bf1/entropy-20-00027-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/3ea560c06e44/entropy-20-00027-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/25f16a93d00e/entropy-20-00027-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/53e33be08f03/entropy-20-00027-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/dcf48c0f94d3/entropy-20-00027-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69c5/7512235/dad6fb3e1bf1/entropy-20-00027-g005.jpg

相似文献

1
Strategic Information Processing from Behavioural Data in Iterated Games.重复博弈中行为数据的战略信息处理
Entropy (Basel). 2018 Jan 4;20(1):27. doi: 10.3390/e20010027.
2
Is Tit-for-Tat the Answer? On the Conclusions Drawn from Axelrod's Tournaments.以牙还牙是答案吗?关于从阿克塞尔罗德竞赛得出的结论。
PLoS One. 2015 Jul 30;10(7):e0134128. doi: 10.1371/journal.pone.0134128. eCollection 2015.
3
A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game.在囚徒困境博弈中,一种“赢则继续,输则转换”的策略比针锋相对策略表现更优。
Nature. 1993 Jul 1;364(6432):56-8. doi: 10.1038/364056a0.
4
Evolutionary matching-pennies game on bipartite regular networks.二分正则网络上的进化猜硬币博弈
Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Apr;89(4):042820. doi: 10.1103/PhysRevE.89.042820. Epub 2014 Apr 30.
5
Working memory constrains human cooperation in the Prisoner's Dilemma.工作记忆限制了囚徒困境中人类的合作。
Proc Natl Acad Sci U S A. 1998 Nov 10;95(23):13755-8. doi: 10.1073/pnas.95.23.13755.
6
Human cooperation in the simultaneous and the alternating Prisoner's Dilemma: Pavlov versus Generous Tit-for-Tat.人类在同步和交替囚徒困境中的合作:巴甫洛夫策略与慷慨以牙还牙策略。
Proc Natl Acad Sci U S A. 1996 Apr 2;93(7):2686-9. doi: 10.1073/pnas.93.7.2686.
7
Playing Extensive Games with Learning of Opponent's Cognition.与对手认知学习博弈。
Sensors (Basel). 2024 Feb 7;24(4):1078. doi: 10.3390/s24041078.
8
Behavioural studies of strategic thinking in games.游戏中战略思维的行为研究。
Trends Cogn Sci. 2003 May;7(5):225-231. doi: 10.1016/s1364-6613(03)00094-9.
9
Cognitive Model of Trust Dynamics Predicts Human Behavior within and between Two Games of Strategic Interaction with Computerized Confederate Agents.信任动态的认知模型预测人类在与计算机化同盟代理进行的两场战略互动游戏内部及之间的行为。
Front Psychol. 2016 Feb 12;7:49. doi: 10.3389/fpsyg.2016.00049. eCollection 2016.
10
Social cycling and conditional responses in the Rock-Paper-Scissors game.石头剪刀布游戏中的社交循环与条件反应。
Sci Rep. 2014 Jul 25;4:5830. doi: 10.1038/srep05830.

引用本文的文献

1
Information Theory for Agents in Artificial Intelligence, Psychology, and Economics.人工智能、心理学和经济学中智能体的信息论
Entropy (Basel). 2021 Mar 6;23(3):310. doi: 10.3390/e23030310.
2
Information Theory in Game Theory.博弈论中的信息论
Entropy (Basel). 2018 Oct 24;20(11):817. doi: 10.3390/e20110817.
3
A Co-Opetitive Automated Negotiation Model for Vertical Allied Enterprises Teams and Stakeholders.一种面向垂直联盟企业团队和利益相关者的协同竞争自动协商模型。

本文引用的文献

1
Self-referential basis of undecidable dynamics: From the Liar paradox and the halting problem to the edge of chaos.自指基础的不可判定动力学:从说谎者悖论和停机问题到混沌边缘。
Phys Life Rev. 2019 Dec;31:134-156. doi: 10.1016/j.plrev.2018.12.003. Epub 2019 Jan 8.
2
The Umwelt of an embodied agent--a measure-theoretic definition.具身智能体的 Umwelt——一种测度论定义。
Theory Biosci. 2015 Dec;134(3-4):105-16. doi: 10.1007/s12064-015-0217-3.
3
Cortical signals for rewarded actions and strategic exploration.奖励动作和策略探索的皮层信号。
Entropy (Basel). 2018 Apr 14;20(4):286. doi: 10.3390/e20040286.
Neuron. 2013 Oct 2;80(1):223-34. doi: 10.1016/j.neuron.2013.07.040. Epub 2013 Sep 5.
4
Measuring information-transfer delays.测量信息传递延迟。
PLoS One. 2013;8(2):e55809. doi: 10.1371/journal.pone.0055809. Epub 2013 Feb 28.
5
Lateral intraparietal cortex and reinforcement learning during a mixed-strategy game.混合策略游戏中的顶内沟外侧皮质与强化学习
J Neurosci. 2009 Jun 3;29(22):7278-89. doi: 10.1523/JNEUROSCI.1479-09.2009.
6
Game theory and neural basis of social decision making.博弈论与社会决策的神经基础
Nat Neurosci. 2008 Apr;11(4):404-9. doi: 10.1038/nn2065. Epub 2008 Mar 26.
7
Reinforcement learning and decision making in monkeys during a competitive game.猴子在竞争性游戏中的强化学习与决策
Brain Res Cogn Brain Res. 2004 Dec;22(1):45-58. doi: 10.1016/j.cogbrainres.2004.07.007.
8
Prefrontal cortex and decision making in a mixed-strategy game.前额叶皮层与混合策略博弈中的决策制定
Nat Neurosci. 2004 Apr;7(4):404-10. doi: 10.1038/nn1209. Epub 2004 Mar 7.
9
Measuring information transfer.测量信息传递。
Phys Rev Lett. 2000 Jul 10;85(2):461-4. doi: 10.1103/PhysRevLett.85.461.
10
A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game.在囚徒困境博弈中,一种“赢则继续,输则转换”的策略比针锋相对策略表现更优。
Nature. 1993 Jul 1;364(6432):56-8. doi: 10.1038/364056a0.