• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

以最小的奖励转移解决社会困境。

Resolving social dilemmas with minimal reward transfer.

作者信息

Willis Richard, Du Yali, Leibo Joel Z, Luck Michael

机构信息

King's College London, London, UK.

Google DeepMind, London, UK.

出版信息

Auton Agent Multi Agent Syst. 2024;38(2):49. doi: 10.1007/s10458-024-09675-4. Epub 2024 Oct 12.

DOI:10.1007/s10458-024-09675-4
PMID:39398194
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11469975/
Abstract

Social dilemmas present a significant challenge in multi-agent cooperation because individuals are incentivised to behave in ways that undermine socially optimal outcomes. Consequently, self-interested agents often avoid collective behaviour. In response, we formalise social dilemmas and introduce a novel metric, the , to quantify the disparity between individual and group rationality in such scenarios. This metric represents the maximum proportion of their individual rewards that agents can retain while ensuring that a social welfare optimum becomes a dominant strategy. Our approach diverges from traditional concepts of altruism, instead focusing on strategic reward redistribution. By transferring rewards among agents in a manner that aligns individual and group incentives, rational agents will maximise collective welfare while pursuing their own interests. We provide an algorithm to compute efficient transfer structures for an arbitrary number of agents, and introduce novel multi-player social dilemma games to illustrate the effectiveness of our method. This work provides both a descriptive tool for analysing social dilemmas and a prescriptive solution for resolving them via efficient reward transfer contracts. Applications include mechanism design, where we can assess the impact on collaborative behaviour of modifications to models of environments.

摘要

社会困境在多智能体合作中构成了重大挑战,因为个体受激励采取的行为方式会破坏社会最优结果。因此,自利的智能体常常回避集体行为。作为回应,我们将社会困境形式化,并引入一种新颖的度量标准—— ,以量化此类场景中个体理性与群体理性之间的差异。该度量标准表示智能体在确保社会福利最优成为主导策略的同时,能够保留的个体奖励的最大比例。我们的方法有别于传统的利他主义概念,而是专注于策略性奖励再分配。通过以一种使个体激励与群体激励相一致的方式在智能体之间转移奖励,理性智能体在追求自身利益的同时将使集体福利最大化。我们提供了一种算法来为任意数量的智能体计算有效的转移结构,并引入新颖的多人社会困境博弈来阐释我们方法的有效性。这项工作既提供了一种用于分析社会困境的描述性工具,也提供了一种通过高效奖励转移合同来解决这些困境的规范性解决方案。其应用包括机制设计,在此我们可以评估对环境模型进行修改对协作行为的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611d/11469975/0f9ee7f36b0e/10458_2024_9675_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611d/11469975/866c02d7dc1e/10458_2024_9675_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611d/11469975/44c3d27940c1/10458_2024_9675_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611d/11469975/0f9ee7f36b0e/10458_2024_9675_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611d/11469975/866c02d7dc1e/10458_2024_9675_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611d/11469975/44c3d27940c1/10458_2024_9675_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611d/11469975/0f9ee7f36b0e/10458_2024_9675_Fig2_HTML.jpg

相似文献

1
Resolving social dilemmas with minimal reward transfer.以最小的奖励转移解决社会困境。
Auton Agent Multi Agent Syst. 2024;38(2):49. doi: 10.1007/s10458-024-09675-4. Epub 2024 Oct 12.
2
Valuing rewards to others in a prisoner's dilemma game.在囚徒困境博弈中重视给予他人的奖励。
Behav Processes. 2013 Oct;99:145-9. doi: 10.1016/j.beproc.2013.07.008. Epub 2013 Jul 19.
3
Reward, punishment, and cooperation: a meta-analysis.奖励、惩罚与合作:一项元分析。
Psychol Bull. 2011 Jul;137(4):594-615. doi: 10.1037/a0023489.
4
Components of reward in social dilemmas.
J Pers Soc Psychol. 1985 Feb;48(2):364-73. doi: 10.1037//0022-3514.48.2.364.
5
Antisocial pool rewarding does not deter public cooperation.反社会群体奖励并不能阻止公众合作。
Proc Biol Sci. 2015 Oct 7;282(1816):20151975. doi: 10.1098/rspb.2015.1975.
6
Generalized Social Dilemmas: The Evolution of Cooperation in Populations with Variable Group Size.广义社会困境:变群组大小的群体中合作的演化。
Bull Math Biol. 2019 Nov;81(11):4643-4674. doi: 10.1007/s11538-018-00545-1. Epub 2018 Dec 17.
7
Symmetry warrants rational cooperation by co-action in Social Dilemmas.在社会困境中,对称性保证了通过共同行动进行合理合作。
Sci Rep. 2015 Aug 12;5:13071. doi: 10.1038/srep13071.
8
The evolution of anti-social rewarding and its countermeasures in public goods games.公共物品博弈中反社会奖励行为的演变及其对策
Proc Biol Sci. 2015 Jan 7;282(1798):20141994. doi: 10.1098/rspb.2014.1994.
9
Exit, punishment and rewards in commons dilemmas: an experimental study.公共困境中的退出、惩罚与奖励:一项实验研究。
PLoS One. 2013 Aug 1;8(8):e69871. doi: 10.1371/journal.pone.0069871. Print 2013.
10
Dilemma of dilemmas: how collective and individual perspectives can clarify the size dilemma in voluntary linear public goods dilemmas.两难困境中的两难:集体视角与个体视角如何阐明自愿线性公共物品两难中的规模困境
PLoS One. 2015 Mar 23;10(3):e0120379. doi: 10.1371/journal.pone.0120379. eCollection 2015.

本文引用的文献

1
Institutional incentives for the evolution of committed cooperation: ensuring participation is as important as enhancing compliance.制度激励对于坚定合作的演变:确保参与与增强合规性同样重要。
J R Soc Interface. 2022 Mar;19(188):20220036. doi: 10.1098/rsif.2022.0036. Epub 2022 Mar 23.
2
From extortion to generosity, evolution in the Iterated Prisoner's Dilemma.从敲诈勒索到慷慨大方,重复囚徒困境中的进化。
Proc Natl Acad Sci U S A. 2013 Sep 17;110(38):15348-53. doi: 10.1073/pnas.1306246110. Epub 2013 Sep 3.
3
Evolution of extortion in Iterated Prisoner's Dilemma games.
重复囚徒困境博弈中的敲诈勒索行为的演变。
Proc Natl Acad Sci U S A. 2013 Apr 23;110(17):6913-8. doi: 10.1073/pnas.1214834110. Epub 2013 Apr 9.
4
Iterated Prisoner's Dilemma contains strategies that dominate any evolutionary opponent.迭代囚徒困境包含了能够支配任何进化对手的策略。
Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10409-13. doi: 10.1073/pnas.1206569109. Epub 2012 May 21.
5
The evolution of antisocial punishment in optional public goods games.可选公共物品博弈中反社会惩罚的演变。
Nat Commun. 2011 Aug 16;2:434. doi: 10.1038/ncomms1442.
6
Five rules for the evolution of cooperation.合作进化的五条规则。
Science. 2006 Dec 8;314(5805):1560-3. doi: 10.1126/science.1133755.
7
The leading eight: social norms that can maintain cooperation by indirect reciprocity.排名前八位的:能够通过间接互惠维持合作的社会规范。
J Theor Biol. 2006 Apr 21;239(4):435-44. doi: 10.1016/j.jtbi.2005.08.008. Epub 2005 Sep 19.
8
Evolutionary dynamics of biological games.生物博弈的进化动力学
Science. 2004 Feb 6;303(5659):793-9. doi: 10.1126/science.1093411.
9
The struggle to govern the commons.对公地的治理斗争。
Science. 2003 Dec 12;302(5652):1907-12. doi: 10.1126/science.1091015.
10
Learning dynamics in social dilemmas.社会困境中的学习动态
Proc Natl Acad Sci U S A. 2002 May 14;99 Suppl 3(Suppl 3):7229-36. doi: 10.1073/pnas.092080099.