• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多个人工智能在迭代的石头剪刀布游戏中与人类竞争并获胜。

Multi-AI competing and winning against humans in iterated Rock-Paper-Scissors game.

机构信息

National Engineering Research Center for Optical Instruments, Centre for Optical and Electromagnetic Research, Zhejiang University, Hangzhou, 310058, China.

Ningbo Research Institute, Zhejiang University, Ningbo, 315100, China.

出版信息

Sci Rep. 2020 Aug 17;10(1):13873. doi: 10.1038/s41598-020-70544-7.

DOI:10.1038/s41598-020-70544-7
PMID:32807813
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7431549/
Abstract

Predicting and modeling human behavior and finding trends within human decision-making processes is a major problem of social science. Rock Paper Scissors (RPS) is the fundamental strategic question in many game theory problems and real-world competitions. Finding the right approach to beat a particular human opponent is challenging. Here we use an AI (artificial intelligence) algorithm based on Markov Models of one fixed memory length (abbreviated as "single AI") to compete against humans in an iterated RPS game. We model and predict human competition behavior by combining many Markov Models with different fixed memory lengths (abbreviated as "multi-AI"), and develop an architecture of multi-AI with changeable parameters to adapt to different competition strategies. We introduce a parameter called "focus length" (a positive number such as 5 or 10) to control the speed and sensitivity for our multi-AI to adapt to the opponent's strategy change. The focus length is the number of previous rounds that the multi-AI should look at when determining which Single-AI has the best performance and should choose to play for the next game. We experimented with 52 different people, each playing 300 rounds continuously against one specific multi-AI model, and demonstrated that our strategy could win against more than 95% of human opponents.

摘要

预测和模拟人类行为,发现人类决策过程中的趋势,是社会科学的一个主要问题。石头剪刀布(RPS)是许多博弈论问题和现实世界竞赛中的基本战略问题。找到击败特定人类对手的正确方法是具有挑战性的。在这里,我们使用基于马尔可夫模型的人工智能(AI)算法(简称“单 AI”),在迭代的 RPS 游戏中与人类竞争。我们通过结合具有不同固定记忆长度的多个马尔可夫模型来对人类竞争行为进行建模和预测(简称“多 AI”),并开发出一种具有可变参数的多 AI 架构,以适应不同的竞争策略。我们引入了一个名为“关注长度”(例如 5 或 10 这样的正整数)的参数,以控制多 AI 适应对手策略变化的速度和敏感性。关注长度是多 AI 在确定哪个单 AI 表现最佳并应选择在下一场比赛中使用时应该查看的前几轮的数量。我们对 52 名不同的人进行了实验,每个人连续与一个特定的多 AI 模型进行 300 轮比赛,结果表明,我们的策略可以击败 95%以上的人类对手。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/fb62f58a94df/41598_2020_70544_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/9d2ccfbaa42a/41598_2020_70544_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/779cccd6f783/41598_2020_70544_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/87fe03ca391b/41598_2020_70544_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/fb62f58a94df/41598_2020_70544_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/9d2ccfbaa42a/41598_2020_70544_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/779cccd6f783/41598_2020_70544_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/87fe03ca391b/41598_2020_70544_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2a2/7431549/fb62f58a94df/41598_2020_70544_Fig4_HTML.jpg

相似文献

1
Multi-AI competing and winning against humans in iterated Rock-Paper-Scissors game.多个人工智能在迭代的石头剪刀布游戏中与人类竞争并获胜。
Sci Rep. 2020 Aug 17;10(1):13873. doi: 10.1038/s41598-020-70544-7.
2
Social cycling and conditional responses in the Rock-Paper-Scissors game.石头剪刀布游戏中的社交循环与条件反应。
Sci Rep. 2014 Jul 25;4:5830. doi: 10.1038/srep05830.
3
Cyclic game dynamics driven by iterated reasoning.由迭代推理驱动的循环博弈动态。
PLoS One. 2013;8(2):e56416. doi: 10.1371/journal.pone.0056416. Epub 2013 Feb 18.
4
Pigeons (Columba livia) approach Nash equilibrium in experimental Matching Pennies competitions.在实验性的猜硬币博弈竞赛中,鸽子(家鸽)接近纳什均衡。
J Exp Anal Behav. 2009 Mar;91(2):169-83. doi: 10.1901/jeab.2009.91-169.
5
The Anterior Insula Tracks Behavioral Entropy during an Interpersonal Competitive Game.在前额叶脑岛在人际竞争游戏中追踪行为熵。
PLoS One. 2015 Jun 3;10(6):e0123329. doi: 10.1371/journal.pone.0123329. eCollection 2015.
6
Reinforcement learning and decision making in monkeys during a competitive game.猴子在竞争性游戏中的强化学习与决策
Brain Res Cogn Brain Res. 2004 Dec;22(1):45-58. doi: 10.1016/j.cogbrainres.2004.07.007.
7
Ambush strategy enhances organisms' performance in rock-paper-scissors games.埋伏策略增强了生物体在石头剪刀布游戏中的表现。
Biosystems. 2024 Jun;240:105229. doi: 10.1016/j.biosystems.2024.105229. Epub 2024 May 11.
8
Repeated rock, paper, scissors play reveals limits in adaptive sequential behavior.重复的石头剪刀布游戏揭示了自适应序列行为的局限性。
Cogn Psychol. 2024 Jun;151:101654. doi: 10.1016/j.cogpsych.2024.101654. Epub 2024 Apr 23.
9
Balancing model-based and memory-free action selection under competitive pressure.在竞争压力下平衡基于模型和无记忆的动作选择。
Elife. 2019 Oct 2;8:e48810. doi: 10.7554/eLife.48810.
10
Artificial intelligence framework for simulating clinical decision-making: a Markov decision process approach.人工智能框架模拟临床决策:马尔可夫决策过程方法。
Artif Intell Med. 2013 Jan;57(1):9-19. doi: 10.1016/j.artmed.2012.12.003. Epub 2012 Dec 31.

引用本文的文献

1
Deviation from Nash mixed equilibrium in repeated rock-scissors-paper reflect individual traits.在重复的石头剪刀布游戏中偏离纳什混合均衡反映了个体特征。
Sci Rep. 2025 Apr 29;15(1):14955. doi: 10.1038/s41598-025-95444-6.
2
Predicting rock-paper-scissors choices based on single-trial EEG signals.基于单次试验脑电图信号预测石头剪刀布的选择。
Psych J. 2024 Feb;13(1):19-30. doi: 10.1002/pchj.688. Epub 2023 Oct 31.

本文引用的文献

1
Markov models-Markov chains.马尔可夫模型——马尔可夫链。
Nat Methods. 2019 Aug;16(8):663-664. doi: 10.1038/s41592-019-0476-x.
2
Ecology: Contests between species aid biodiversity.生态学:物种间的竞争有助于生物多样性。
Nature. 2017 Aug 10;548(7666):166-167. doi: 10.1038/nature23103. Epub 2017 Jul 26.
3
Higher-order interactions stabilize dynamics in competitive network models.高阶相互作用稳定竞争网络模型中的动力学。
Nature. 2017 Aug 10;548(7666):210-213. doi: 10.1038/nature23273. Epub 2017 Jul 26.
4
Behavioural and neural modulation of win-stay but not lose-shift strategies as a function of outcome value in Rock, Paper, Scissors.在“石头、剪刀、布”游戏中,作为结果价值函数的赢则坚持而非输则改变策略的行为和神经调节。
Sci Rep. 2016 Sep 23;6:33809. doi: 10.1038/srep33809.
5
Stochastic Evolution Dynamic of the Rock-Scissors-Paper Game Based on a Quasi Birth and Death Process.基于拟生灭过程的石头剪刀布游戏的随机演化动力学。
Sci Rep. 2016 Jun 27;6:28585. doi: 10.1038/srep28585.
6
Microbiology: Taking the bad with the good.微生物学:好坏并存。
Nature. 2015 May 28;521(7553):431-2. doi: 10.1038/nature14525.
7
Social cycling and conditional responses in the Rock-Paper-Scissors game.石头剪刀布游戏中的社交循环与条件反应。
Sci Rep. 2014 Jul 25;4:5830. doi: 10.1038/srep05830.
8
A Markovian analysis of bacterial genome sequence constraints.细菌基因组序列约束的马尔可夫分析。
PeerJ. 2013 Aug 29;1:e127. doi: 10.7717/peerj.127. eCollection 2013.
9
Emerging of Stochastic Dynamical Equalities and Steady State Thermodynamics from Darwinian Dynamics.从达尔文动力学中涌现出随机动力学等式和稳态热力学。
Commun Theor Phys. 2008 May 15;49(5):1073-1090. doi: 10.1088/0253-6102/49/5/01.
10
Cyclic dominance and biodiversity in well-mixed populations.均匀混合种群中的循环优势与生物多样性
Phys Rev Lett. 2008 Feb 8;100(5):058104. doi: 10.1103/PhysRevLett.100.058104. Epub 2008 Feb 7.