Suppr超能文献

与贝叶斯心理理论不断发展的一般合作。

Evolving general cooperation with a Bayesian theory of mind.

作者信息

Kleiman-Weiner Max, Vientós Alejandro, Rand David G, Tenenbaum Joshua B

机构信息

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139.

Department of Marketing and International Business, Foster School of Business, University of Washington, Seattle, WA 98195.

出版信息

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2400993122. doi: 10.1073/pnas.2400993122. Epub 2025 Jun 16.

Abstract

Theories of the evolution of cooperation through reciprocity explain how unrelated self-interested individuals can accomplish more together than they can on their own. The most prominent theories of reciprocity, such as tit-for-tat or win-stay-lose-shift, are inflexible automata that lack a theory of mind-the human ability to infer the hidden mental states in others' minds. Here, we develop a model of reciprocity with a theory of mind, the Bayesian Reciprocator. When making decisions, this model does not simply seek to maximize its own payoff. Instead, it also values the payoffs of others-but only to the extent it believes that those others are also cooperating in the same way. To compute its beliefs about others, the Bayesian Reciprocator uses a probabilistic and generative approach to infer the latent preferences, beliefs, and strategies of others through interaction and observation. We evaluate the Bayesian Reciprocator using a generator over games where every interaction is unique, as well as in classic environments such as the iterated prisoner's dilemma. The Bayesian Reciprocator enables the emergence of both direct-reciprocity when games are repeated and indirect-reciprocity when interactions are one-shot but observable to others. In an evolutionary competition, the Bayesian Reciprocator outcompetes existing automata strategies and sustains cooperation across a larger range of environments and noise settings than prior approaches. This work quantifies the advantage of a theory of mind for cooperation in an evolutionary game theoretic framework and suggests avenues for building artificially intelligent agents with more human-like learning mechanisms that can cooperate across many environments.

摘要

通过互惠实现合作的进化理论解释了不相关的自利个体如何共同完成比独自完成更多的事情。最著名的互惠理论,如以牙还牙或赢则继续输则改变,都是缺乏心智理论的僵化自动机制——人类推断他人头脑中隐藏心理状态的能力。在这里,我们开发了一种具有心智理论的互惠模型,即贝叶斯互惠者。在做决策时,这个模型不仅仅寻求最大化自身收益。相反,它也重视他人的收益——但仅限于它认为这些他人也以同样方式合作的程度。为了计算其对他人的信念,贝叶斯互惠者使用一种概率性的生成方法,通过互动和观察来推断他人潜在的偏好、信念和策略。我们在一个游戏生成器中评估贝叶斯互惠者,其中每一次互动都是独特的,同时也在经典环境中进行评估,比如重复囚徒困境。贝叶斯互惠者在游戏重复时能促成直接互惠,在互动是一次性但他人可观察到时能促成间接互惠。在一场进化竞争中,贝叶斯互惠者胜过现有的自动机制策略,并且在比先前方法更大范围的环境和噪声设置中维持合作。这项工作量化了在进化博弈论框架中,心智理论对合作的优势,并为构建具有更类似人类学习机制、能够在多种环境中合作的人工智能主体指明了方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2857/12207496/04dc5d943da7/pnas.2400993122fig01.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验