• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

太多厨子:用于协调多主体协作的贝叶斯推断。

Too Many Cooks: Bayesian Inference for Coordinating Multi-Agent Collaboration.

机构信息

Department of Brain & Cognitive Sciences, Massachusetts Institute of Technology.

Department of Electrical Engineering & Computer Science, Massachusetts Institute of Technology.

出版信息

Top Cogn Sci. 2021 Apr;13(2):414-432. doi: 10.1111/tops.12525. Epub 2021 Apr 7.

DOI:10.1111/tops.12525
PMID:33829670
Abstract

Collaboration requires agents to coordinate their behavior on the fly, sometimes cooperating to solve a single task together and other times dividing it up into sub-tasks to work on in parallel. Underlying the human ability to collaborate is theory-of-mind (ToM), the ability to infer the hidden mental states that drive others to act. Here, we develop Bayesian Delegation, a decentralized multi-agent learning mechanism with these abilities. Bayesian Delegation enables agents to rapidly infer the hidden intentions of others by inverse planning. We test Bayesian Delegation in a suite of multi-agent Markov decision processes inspired by cooking problems. On these tasks, agents with Bayesian Delegation coordinate both their high-level plans (e.g., what sub-task they should work on) and their low-level actions (e.g., avoiding getting in each other's way). When matched with partners that act using the same algorithm, Bayesian Delegation outperforms alternatives. Bayesian Delegation is also a capable ad hoc collaborator and successfully coordinates with other agent types even in the absence of prior experience. Finally, in a behavioral experiment, we show that Bayesian Delegation makes inferences similar to human observers about the intent of others. Together, these results argue for the centrality of ToM for successful decentralized multi-agent collaboration.

摘要

协作要求代理在执行过程中协调其行为,有时共同合作完成单个任务,有时将其划分为子任务并行处理。人类协作的基础是心理理论(ToM),即推断驱使他人行动的隐藏心理状态的能力。在这里,我们开发了贝叶斯委托(Bayesian Delegation),这是一种具有这些能力的去中心化多代理学习机制。贝叶斯委托通过逆规划使代理能够快速推断他人的隐藏意图。我们在一系列受烹饪问题启发的多代理马尔可夫决策过程中测试了贝叶斯委托。在这些任务中,具有贝叶斯委托的代理协调他们的高级计划(例如,他们应该从事哪个子任务)和他们的低级行动(例如,避免相互妨碍)。当与使用相同算法的合作伙伴进行匹配时,贝叶斯委托的表现优于替代方案。贝叶斯委托也是一个有能力的临时协作者,即使在没有先前经验的情况下,也可以成功地与其他代理类型进行协调。最后,在行为实验中,我们表明贝叶斯委托对他人意图的推断与人类观察者相似。总之,这些结果表明心理理论对于成功的去中心化多代理协作至关重要。

相似文献

1
Too Many Cooks: Bayesian Inference for Coordinating Multi-Agent Collaboration.太多厨子:用于协调多主体协作的贝叶斯推断。
Top Cogn Sci. 2021 Apr;13(2):414-432. doi: 10.1111/tops.12525. Epub 2021 Apr 7.
2
Coordination as inference in multi-agent reinforcement learning.多智能体强化学习中的协调作为推理。
Neural Netw. 2024 Apr;172:106101. doi: 10.1016/j.neunet.2024.106101. Epub 2024 Jan 11.
3
Inferring User Intent using Bayesian Theory of Mind in Shared Avatar-Agent Virtual Environments.在共享化身-代理虚拟环境中使用贝叶斯心理理论推断用户意图。
IEEE Trans Vis Comput Graph. 2019 May;25(5):2113-2122. doi: 10.1109/TVCG.2019.2898800. Epub 2019 Feb 14.
4
The developmental origins of naïve psychology in infancy.婴儿期朴素心理学的发展起源。
Adv Child Dev Behav. 2009;37:55-104. doi: 10.1016/s0065-2407(09)03702-1.
5
Introducing tomsup: Theory of mind simulations using Python.介绍 tomsup:使用 Python 进行心理理论模拟。
Behav Res Methods. 2023 Aug;55(5):2197-2231. doi: 10.3758/s13428-022-01827-2. Epub 2022 Aug 11.
6
Win-Stay, Lose-Sample: a simple sequential algorithm for approximating Bayesian inference.赢则保留,输则抽样:一种用于近似贝叶斯推断的简单序贯算法。
Cogn Psychol. 2014 Nov;74:35-65. doi: 10.1016/j.cogpsych.2014.06.003. Epub 2014 Aug 1.
7
Deconstructing Theory-of-Mind Impairment in High-Functioning Adults with Autism.自闭症高功能成人心理理论损伤的解构。
Curr Biol. 2019 Feb 4;29(3):513-519.e6. doi: 10.1016/j.cub.2018.12.039. Epub 2019 Jan 24.
8
Reading people's minds from emotion expressions in interdependent decision making.从相互依存决策中的情绪表达中读取人们的想法。
J Pers Soc Psychol. 2014 Jan;106(1):73-88. doi: 10.1037/a0034251. Epub 2013 Sep 30.
9
Exploring theory of mind after severe traumatic brain injury.探究严重创伤性脑损伤后的心理理论。
Cortex. 2010 Oct;46(9):1088-99. doi: 10.1016/j.cortex.2009.08.014. Epub 2009 Sep 15.
10
On computational models of theory of mind and the imitative reinforcement learning in spiking neural networks.关于心理理论的计算模型和尖峰神经网络中的模仿强化学习。
Sci Rep. 2024 Jan 23;14(1):1945. doi: 10.1038/s41598-024-52299-7.

引用本文的文献

1
Towards fluid human-agent collaboration: From dynamic collaboration patterns to models of theory of mind reasoning.迈向灵活的人机协作:从动态协作模式到心理理论推理模型。
Front Robot AI. 2025 Aug 1;12:1532693. doi: 10.3389/frobt.2025.1532693. eCollection 2025.
2
Human intergroup coordination in a hierarchical multi-agent sensorimotor task arises from concurrent co-optimization.在分层多智能体感觉运动任务中,人类群体间协调源于并发协同优化。
Sci Rep. 2025 Apr 28;15(1):14849. doi: 10.1038/s41598-025-97574-3.
3
The SocialAI school: a framework leveraging developmental psychology toward artificial socio-cultural agents.
社会人工智能学派:一个利用发展心理学构建人工社会文化智能体的框架。
Front Neurorobot. 2024 Oct 9;18:1396359. doi: 10.3389/fnbot.2024.1396359. eCollection 2024.
4
Building machines that learn and think with people.与人类一起学习和思考的机器。
Nat Hum Behav. 2024 Oct;8(10):1851-1863. doi: 10.1038/s41562-024-01991-9. Epub 2024 Oct 22.
5
Group Coordination Catalyzes Individual and Cultural Intelligence.团队协作催化个人与文化智慧。
Open Mind (Camb). 2024 Aug 31;8:1037-1057. doi: 10.1162/opmi_a_00155. eCollection 2024.
6
Enhancement of joint flanker effect in intergroup competition.群体间竞争中联合侧翼效应的增强。
Psych J. 2025 Feb;14(1):94-102. doi: 10.1002/pchj.796. Epub 2024 Aug 21.
7
Using games to understand the mind.用游戏了解心智。
Nat Hum Behav. 2024 Jun;8(6):1035-1043. doi: 10.1038/s41562-024-01878-9. Epub 2024 Jun 21.
8
Emotion prediction as computation over a generative theory of mind.情绪预测作为一种生成心智理论的计算。
Philos Trans A Math Phys Eng Sci. 2023 Jul 24;381(2251):20220047. doi: 10.1098/rsta.2022.0047. Epub 2023 Jun 5.
9
Visuo-motor interference is modulated by task interactivity: A kinematic study.视动干扰受任务交互性的调节:一项运动学研究。
Psychon Bull Rev. 2023 Oct;30(5):1788-1801. doi: 10.3758/s13423-023-02297-z. Epub 2023 May 1.
10
Collaborative decision making is grounded in representations of other people's competence and effort.协作决策基于对他人能力和努力的表现。
J Exp Psychol Gen. 2023 Jun;152(6):1565-1579. doi: 10.1037/xge0001336. Epub 2023 Mar 6.