与贝叶斯心理理论不断发展的一般合作。

Evolving general cooperation with a Bayesian theory of mind.

作者信息

Kleiman-Weiner Max, Vientós Alejandro, Rand David G, Tenenbaum Joshua B

机构信息

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139.

Department of Marketing and International Business, Foster School of Business, University of Washington, Seattle, WA 98195.

出版信息

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2400993122. doi: 10.1073/pnas.2400993122. Epub 2025 Jun 16.

DOI:10.1073/pnas.2400993122

PMID:40523189

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12207496/

Abstract

Theories of the evolution of cooperation through reciprocity explain how unrelated self-interested individuals can accomplish more together than they can on their own. The most prominent theories of reciprocity, such as tit-for-tat or win-stay-lose-shift, are inflexible automata that lack a theory of mind-the human ability to infer the hidden mental states in others' minds. Here, we develop a model of reciprocity with a theory of mind, the Bayesian Reciprocator. When making decisions, this model does not simply seek to maximize its own payoff. Instead, it also values the payoffs of others-but only to the extent it believes that those others are also cooperating in the same way. To compute its beliefs about others, the Bayesian Reciprocator uses a probabilistic and generative approach to infer the latent preferences, beliefs, and strategies of others through interaction and observation. We evaluate the Bayesian Reciprocator using a generator over games where every interaction is unique, as well as in classic environments such as the iterated prisoner's dilemma. The Bayesian Reciprocator enables the emergence of both direct-reciprocity when games are repeated and indirect-reciprocity when interactions are one-shot but observable to others. In an evolutionary competition, the Bayesian Reciprocator outcompetes existing automata strategies and sustains cooperation across a larger range of environments and noise settings than prior approaches. This work quantifies the advantage of a theory of mind for cooperation in an evolutionary game theoretic framework and suggests avenues for building artificially intelligent agents with more human-like learning mechanisms that can cooperate across many environments.

摘要

通过互惠实现合作的进化理论解释了不相关的自利个体如何共同完成比独自完成更多的事情。最著名的互惠理论，如以牙还牙或赢则继续输则改变，都是缺乏心智理论的僵化自动机制——人类推断他人头脑中隐藏心理状态的能力。在这里，我们开发了一种具有心智理论的互惠模型，即贝叶斯互惠者。在做决策时，这个模型不仅仅寻求最大化自身收益。相反，它也重视他人的收益——但仅限于它认为这些他人也以同样方式合作的程度。为了计算其对他人的信念，贝叶斯互惠者使用一种概率性的生成方法，通过互动和观察来推断他人潜在的偏好、信念和策略。我们在一个游戏生成器中评估贝叶斯互惠者，其中每一次互动都是独特的，同时也在经典环境中进行评估，比如重复囚徒困境。贝叶斯互惠者在游戏重复时能促成直接互惠，在互动是一次性但他人可观察到时能促成间接互惠。在一场进化竞争中，贝叶斯互惠者胜过现有的自动机制策略，并且在比先前方法更大范围的环境和噪声设置中维持合作。这项工作量化了在进化博弈论框架中，心智理论对合作的优势，并为构建具有更类似人类学习机制、能够在多种环境中合作的人工智能主体指明了方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2857/12207496/04dc5d943da7/pnas.2400993122fig01.jpg

相似文献

Evolving general cooperation with a Bayesian theory of mind.与贝叶斯心理理论不断发展的一般合作。

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2400993122. doi: 10.1073/pnas.2400993122. Epub 2025 Jun 16.

How lived experiences of illness trajectories, burdens of treatment, and social inequalities shape service user and caregiver participation in health and social care: a theory-informed qualitative evidence synthesis.疾病轨迹的生活经历、治疗负担和社会不平等如何影响服务使用者和照顾者参与健康和社会护理：一项基于理论的定性证据综合分析

Health Soc Care Deliv Res. 2025 Jun;13(24):1-120. doi: 10.3310/HGTQ8159.

Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。

Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.

Psychological interventions for adults who have sexually offended or are at risk of offending.针对有性犯罪行为或有性犯罪风险的成年人的心理干预措施。

Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD007507. doi: 10.1002/14651858.CD007507.pub2.

Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略

Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂（GLP-1 RAs）减肥效果的网状Meta分析的数量、质量及结果：一项范围综述

Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验：定性证据综合。

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

Quality improvement strategies for diabetes care: Effects on outcomes for adults living with diabetes.糖尿病护理质量改进策略：对成年糖尿病患者结局的影响。

Cochrane Database Syst Rev. 2023 May 31;5(5):CD014513. doi: 10.1002/14651858.CD014513.

Parents' and informal caregivers' views and experiences of communication about routine childhood vaccination: a synthesis of qualitative evidence.父母及非正式照料者关于儿童常规疫苗接种沟通的观点与经历：定性证据综述

Cochrane Database Syst Rev. 2017 Feb 7;2(2):CD011787. doi: 10.1002/14651858.CD011787.pub2.

引用本文的文献

Collective artificial intelligence and evolutionary dynamics.集体人工智能与进化动力学

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2505860122. doi: 10.1073/pnas.2505860122. Epub 2025 Jun 16.

本文引用的文献

Evolution of cooperation through cumulative reciprocity.通过累积互惠实现合作的演变。

Nat Comput Sci. 2022 Oct;2(10):677-686. doi: 10.1038/s43588-022-00334-w. Epub 2022 Oct 20.

A pull versus push framework for reputation.声誉的拉推框架。

Trends Cogn Sci. 2023 Sep;27(9):852-866. doi: 10.1016/j.tics.2023.06.005. Epub 2023 Jul 17.

DreamCoder: growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning.DreamCoder：通过清醒-睡眠贝叶斯程序学习生成可泛化、可解释的知识。

Philos Trans A Math Phys Eng Sci. 2023 Jul 24;381(2251):20220050. doi: 10.1098/rsta.2022.0050. Epub 2023 Jun 5.

Evolutionary stability of cooperation in indirect reciprocity under noisy and private assessment.在存在噪声和私人评估的情况下，间接互惠合作的进化稳定性。

Proc Natl Acad Sci U S A. 2023 May 16;120(20):e2300544120. doi: 10.1073/pnas.2300544120. Epub 2023 May 8.

From partners to populations: A hierarchical Bayesian account of coordination and convention.从伙伴到人群：协调和惯例的层次贝叶斯解释。

Psychol Rev. 2023 Jul;130(4):977-1016. doi: 10.1037/rev0000348. Epub 2022 Apr 14.

A probabilistic map of emotional experiences during competitive social interactions.竞争社会互动中情绪体验的概率图谱。

Nat Commun. 2022 Mar 31;13(1):1718. doi: 10.1038/s41467-022-29372-8.

Cooperation in alternating interactions with memory constraints.在具有记忆约束的交替互动中进行合作。

Nat Commun. 2022 Feb 8;13(1):737. doi: 10.1038/s41467-022-28336-2.

Latent motives guide structure learning during adaptive social choice.潜在动机在适应性社会选择过程中引导结构学习。

Nat Hum Behav. 2022 Mar;6(3):404-414. doi: 10.1038/s41562-021-01207-4. Epub 2021 Nov 8.

Toddlers draw broad negative inferences from wrongdoers' moral violations.幼儿会从犯错者的道德违规行为中得出广泛的负面推论。

Proc Natl Acad Sci U S A. 2021 Sep 28;118(39). doi: 10.1073/pnas.2109045118.

A unified framework of direct and indirect reciprocity.直接互惠和间接互惠的统一框架。

Nat Hum Behav. 2021 Oct;5(10):1292-1302. doi: 10.1038/s41562-021-01114-8. Epub 2021 May 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

与贝叶斯心理理论不断发展的一般合作。

Evolving general cooperation with a Bayesian theory of mind.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献