• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

网络互惠性的强化学习解释

Reinforcement learning account of network reciprocity.

作者信息

Ezaki Takahiro, Masuda Naoki

机构信息

PRESTO, Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi, Saitama, Japan.

Department of Engineering Mathematics, University of Bristol, Clifton, Bristol, United Kingdom.

出版信息

PLoS One. 2017 Dec 8;12(12):e0189220. doi: 10.1371/journal.pone.0189220. eCollection 2017.

DOI:10.1371/journal.pone.0189220
PMID:29220413
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5722284/
Abstract

Evolutionary game theory predicts that cooperation in social dilemma games is promoted when agents are connected as a network. However, when networks are fixed over time, humans do not necessarily show enhanced mutual cooperation. Here we show that reinforcement learning (specifically, the so-called Bush-Mosteller model) approximately explains the experimentally observed network reciprocity and the lack thereof in a parameter region spanned by the benefit-to-cost ratio and the node's degree. Thus, we significantly extend previously obtained numerical results.

摘要

进化博弈论预测,当参与者以网络形式相连时,社会困境博弈中的合作会得到促进。然而,当网络随时间固定不变时,人类不一定会表现出更强的相互合作。在此我们表明,强化学习(具体而言,即所谓的布什-莫斯特勒模型)在由收益成本比和节点度所跨越的参数区域内,近似地解释了实验观察到的网络互惠现象及其缺失情况。因此,我们显著扩展了先前获得的数值结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf04/5722284/004c72d72104/pone.0189220.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf04/5722284/c43b1a4e7a54/pone.0189220.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf04/5722284/0a1433ac674f/pone.0189220.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf04/5722284/004c72d72104/pone.0189220.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf04/5722284/c43b1a4e7a54/pone.0189220.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf04/5722284/0a1433ac674f/pone.0189220.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf04/5722284/004c72d72104/pone.0189220.g003.jpg

相似文献

1
Reinforcement learning account of network reciprocity.网络互惠性的强化学习解释
PLoS One. 2017 Dec 8;12(12):e0189220. doi: 10.1371/journal.pone.0189220. eCollection 2017.
2
Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.强化学习解释了条件性合作及其喜怒无常的同类现象。
PLoS Comput Biol. 2016 Jul 20;12(7):e1005034. doi: 10.1371/journal.pcbi.1005034. eCollection 2016 Jul.
3
A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game.在迭代囚徒困境博弈中对时间差分学习的理论分析。
Bull Math Biol. 2009 Nov;71(8):1818-50. doi: 10.1007/s11538-009-9424-8. Epub 2009 May 29.
4
Universal scaling for the dilemma strength in evolutionary games.普适标度下进化博弈中的困境强度。
Phys Life Rev. 2015 Sep;14:1-30. doi: 10.1016/j.plrev.2015.04.033. Epub 2015 May 5.
5
A simple scaling of the effectiveness of supporting mutual cooperation in donor-recipient games by various reciprocity mechanisms.通过各种互惠机制对捐赠者-接受者博弈中支持相互合作有效性的一种简单量化。
Biosystems. 2009 Apr;96(1):29-34. doi: 10.1016/j.biosystems.2008.11.004. Epub 2008 Nov 19.
6
Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's dilemma.具有迭代囚徒困境中动态期望水平的强化学习模型的数值分析。
J Theor Biol. 2011 Jun 7;278(1):55-62. doi: 10.1016/j.jtbi.2011.03.005. Epub 2011 Mar 29.
7
Evolutionary dynamics of the traveler's dilemma and minimum-effort coordination games on complex networks.复杂网络上旅行者困境与最小努力协调博弈的演化动力学
Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Oct;90(4):042134. doi: 10.1103/PhysRevE.90.042134. Epub 2014 Oct 22.
8
Evolutionary prisoner's dilemma game on graphs and social networks with external constraint.具有外部约束的图和社交网络上的进化囚徒困境博弈
J Theor Biol. 2014 Oct 7;358:122-31. doi: 10.1016/j.jtbi.2014.05.038. Epub 2014 Jun 5.
9
Difference of reciprocity effect in two coevolutionary models of presumed two-player and multiplayer games.假定的两人及多人博弈的两种协同进化模型中互惠效应的差异。
Phys Rev E Stat Nonlin Soft Matter Phys. 2013 Jun;87(6):062136. doi: 10.1103/PhysRevE.87.062136. Epub 2013 Jun 25.
10
Payoff-based learning explains the decline in cooperation in public goods games.基于回报的学习解释了公共物品博弈中合作行为的减少。
Proc Biol Sci. 2015 Feb 22;282(1801):20142678. doi: 10.1098/rspb.2014.2678.

引用本文的文献

1
Aspiration dynamics generate robust predictions in heterogeneous populations.吸气动力学在异质人群中产生稳健的预测。
Nat Commun. 2021 May 31;12(1):3250. doi: 10.1038/s41467-021-23548-4.
2
Competing for congestible goods: experimental evidence on parking choice.竞争可耗散商品:停车选择的实验证据。
Sci Rep. 2020 Nov 30;10(1):20803. doi: 10.1038/s41598-020-77711-w.
3
An experimental study of network effects on coordination in asymmetric games.网络效应对非对称博弈协调的实验研究。

本文引用的文献

1
Reinforcement learning accounts for moody conditional cooperation behavior: experimental results.强化学习解释了情绪多变的条件合作行为:实验结果。
Sci Rep. 2017 Jan 10;7:39275. doi: 10.1038/srep39275.
2
Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.强化学习解释了条件性合作及其喜怒无常的同类现象。
PLoS Comput Biol. 2016 Jul 20;12(7):e1005034. doi: 10.1371/journal.pcbi.1005034. eCollection 2016 Jul.
3
Aspiration dynamics in structured population acts as if in a well-mixed one.结构化种群中的抽吸动态表现得如同在充分混合的种群中一样。
Sci Rep. 2019 May 2;9(1):6842. doi: 10.1038/s41598-019-43260-0.
Sci Rep. 2015 Jan 26;5:8014. doi: 10.1038/srep08014.
4
Static network structure can stabilize human cooperation.静态网络结构能够稳定人类合作。
Proc Natl Acad Sci U S A. 2014 Dec 2;111(48):17093-8. doi: 10.1073/pnas.1400406111. Epub 2014 Nov 17.
5
Aspiration dynamics of multi-player games in finite populations.有限种群中多人游戏的动机动态。
J R Soc Interface. 2014 Mar 5;11(94):20140077. doi: 10.1098/rsif.2014.0077. Print 2014 May 6.
6
Learning dynamics explains human behaviour in prisoner's dilemma on networks.学习动力学解释了网络囚徒困境中的人类行为。
J R Soc Interface. 2014 Feb 19;11(94):20131186. doi: 10.1098/rsif.2013.1186. Print 2014 May 6.
7
Quality versus quantity of social ties in experimental cooperative networks.实验合作网络中社会关系的质量与数量。
Nat Commun. 2013;4:2814. doi: 10.1038/ncomms3814.
8
Human cooperation.人类合作。
Trends Cogn Sci. 2013 Aug;17(8):413-25. doi: 10.1016/j.tics.2013.06.003. Epub 2013 Jul 13.
9
Contagion of Cooperation in Static and Fluid Social Networks.静态与动态社会网络中的合作传播
PLoS One. 2013 Jun 19;8(6):e66199. doi: 10.1371/journal.pone.0066199. Print 2013.
10
Evolutionary dynamics of group interactions on structured populations: a review.结构种群中群体相互作用的进化动态:综述。
J R Soc Interface. 2013 Jan 9;10(80):20120997. doi: 10.1098/rsif.2012.0997. Print 2013 Mar 6.