与大语言模型进行重复博弈。

Playing repeated games with large language models.

作者信息

Akata Elif, Schulz Lion, Coda-Forno Julian, Oh Seong Joon, Bethge Matthias, Schulz Eric

机构信息

Institute for Human-Centered AI, Helmholtz Munich, Oberschleißheim, Germany.

Max Planck Institute for Biological Cybernetics, Tübingen, Germany.

出版信息

Nat Hum Behav. 2025 May 8. doi: 10.1038/s41562-025-02172-y.

DOI:10.1038/s41562-025-02172-y

PMID:40341716

Abstract

Large language models (LLMs) are increasingly used in applications where they interact with humans and other agents. We propose to use behavioural game theory to study LLMs' cooperation and coordination behaviour. Here we let different LLMs play finitely repeated 2 × 2 games with each other, with human-like strategies, and actual human players. Our results show that LLMs perform particularly well at self-interested games such as the iterated Prisoner's Dilemma family. However, they behave suboptimally in games that require coordination, such as the Battle of the Sexes. We verify that these behavioural signatures are stable across robustness checks. We also show how GPT-4's behaviour can be modulated by providing additional information about its opponent and by using a 'social chain-of-thought' strategy. This also leads to better scores and more successful coordination when interacting with human players. These results enrich our understanding of LLMs' social behaviour and pave the way for a behavioural game theory for machines.

摘要

大语言模型（LLMs）越来越多地应用于与人类和其他智能体交互的场景中。我们建议使用行为博弈论来研究大语言模型的合作与协调行为。在此，我们让不同的大语言模型相互之间、与具有类人策略的模型以及实际人类玩家进行有限次重复的2×2博弈。我们的结果表明，大语言模型在诸如重复囚徒困境这类自利博弈中表现尤为出色。然而，在诸如性别之战这类需要协调的博弈中，它们的表现并不理想。我们验证了这些行为特征在稳健性检查中是稳定的。我们还展示了如何通过提供关于对手的额外信息以及使用“社会思维链”策略来调节GPT-4的行为。这在与人类玩家交互时也能带来更好的分数和更成功的协调。这些结果丰富了我们对大语言模型社会行为的理解，并为机器行为博弈论铺平了道路。

相似文献

Playing repeated games with large language models.

Nat Hum Behav. 2025 May 8. doi: 10.1038/s41562-025-02172-y.

Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.

J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.

Stigma Management Strategies of Autistic Social Media Users.

Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.

Evolving general cooperation with a Bayesian theory of mind.

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2400993122. doi: 10.1073/pnas.2400993122. Epub 2025 Jun 16.

Sexual Harassment and Prevention Training

Stench of Errors or the Shine of Potential: The Challenge of (Ir)Responsible Use of ChatGPT in Speech-Language Pathology.

Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70088. doi: 10.1111/1460-6984.70088.

Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.

J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.

Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.

JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.

"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.

Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.

Comparative economics: how studying other primates helps us better understand the evolution of our own economic decision making.

Philos Trans R Soc Lond B Biol Sci. 2023 May 8;378(1876):20210497. doi: 10.1098/rstb.2021.0497. Epub 2023 Mar 20.

引用本文的文献

Evaluating the ability of large Language models to predict human social decisions.

Sci Rep. 2025 Sep 2;15(1):32290. doi: 10.1038/s41598-025-17188-7.

A foundation model to predict and capture human cognition.

Nature. 2025 Jul 2. doi: 10.1038/s41586-025-09215-4.

A publicly available benchmark for assessing large language models' ability to predict how humans balance self-interest and the interest of others.

Sci Rep. 2025 Jul 1;15(1):21428. doi: 10.1038/s41598-025-01715-7.

Comparing AI and human decision-making mechanisms in daily collaborative experiments.

iScience. 2025 May 21;28(6):112711. doi: 10.1016/j.isci.2025.112711. eCollection 2025 Jun 20.

Using large language models to facilitate academic work in the psychological sciences.

Curr Psychol. 2025;44(9):7910-7918. doi: 10.1007/s12144-025-07438-2. Epub 2025 Jan 28.

Static network structure cannot stabilize cooperation among large language model agents.

PLoS One. 2025 May 22;20(5):e0320094. doi: 10.1371/journal.pone.0320094. eCollection 2025.

Emergent social conventions and collective bias in LLM populations.

Sci Adv. 2025 May 16;11(20):eadu9368. doi: 10.1126/sciadv.adu9368. Epub 2025 May 14.

DruGagent: Multi-Agent Large Language Model-Based Reasoning for Drug-Target Interaction Prediction.

ArXiv. 2025 Apr 7:arXiv:2408.13378v4.

本文引用的文献

Evaluating large language models in theory of mind tasks.

Proc Natl Acad Sci U S A. 2024 Nov 5;121(45):e2405460121. doi: 10.1073/pnas.2405460121. Epub 2024 Oct 29.

Language models, like humans, show content effects on reasoning tasks.

PNAS Nexus. 2024 Jul 16;3(7):pgae233. doi: 10.1093/pnasnexus/pgae233. eCollection 2024 Jul.

Facilitating cooperation in human-agent hybrid populations through autonomous agents.

iScience. 2023 Oct 12;26(11):108179. doi: 10.1016/j.isci.2023.108179. eCollection 2023 Nov 17.

Emergent analogical reasoning in large language models.

Nat Hum Behav. 2023 Sep;7(9):1526-1541. doi: 10.1038/s41562-023-01659-w. Epub 2023 Jul 31.

Using cognitive psychology to understand GPT-3.

Proc Natl Acad Sci U S A. 2023 Feb 7;120(6):e2218523120. doi: 10.1073/pnas.2218523120. Epub 2023 Feb 2.

Planning with Theory of Mind.

Trends Cogn Sci. 2022 Nov;26(11):959-971. doi: 10.1016/j.tics.2022.08.003. Epub 2022 Sep 8.

Learning from other minds: An optimistic critique of reinforcement learning models of social learning.

Curr Opin Behav Sci. 2021 Apr;38:110-115. doi: 10.1016/j.cobeha.2021.01.006. Epub 2021 Mar 23.

Computational Psychiatry for Computers.

iScience. 2020 Nov 7;23(12):101772. doi: 10.1016/j.isci.2020.101772. eCollection 2020 Dec 18.

Knowing me, knowing you: theory of mind in AI.

Psychol Med. 2020 May;50(7):1057-1061. doi: 10.1017/S0033291720000835. Epub 2020 May 7.

The Formation of Social Conventions in Real-Time Environments.

PLoS One. 2016 Mar 22;11(3):e0151670. doi: 10.1371/journal.pone.0151670. eCollection 2016.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

与大语言模型进行重复博弈。

Playing repeated games with large language models.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献