网络互惠性的强化学习解释

Reinforcement learning account of network reciprocity.

作者信息

Ezaki Takahiro, Masuda Naoki

机构信息

PRESTO, Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi, Saitama, Japan.

Department of Engineering Mathematics, University of Bristol, Clifton, Bristol, United Kingdom.

出版信息

PLoS One. 2017 Dec 8;12(12):e0189220. doi: 10.1371/journal.pone.0189220. eCollection 2017.

DOI:10.1371/journal.pone.0189220

PMID:29220413

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5722284/

Abstract

Evolutionary game theory predicts that cooperation in social dilemma games is promoted when agents are connected as a network. However, when networks are fixed over time, humans do not necessarily show enhanced mutual cooperation. Here we show that reinforcement learning (specifically, the so-called Bush-Mosteller model) approximately explains the experimentally observed network reciprocity and the lack thereof in a parameter region spanned by the benefit-to-cost ratio and the node's degree. Thus, we significantly extend previously obtained numerical results.

摘要

进化博弈论预测，当参与者以网络形式相连时，社会困境博弈中的合作会得到促进。然而，当网络随时间固定不变时，人类不一定会表现出更强的相互合作。在此我们表明，强化学习（具体而言，即所谓的布什-莫斯特勒模型）在由收益成本比和节点度所跨越的参数区域内，近似地解释了实验观察到的网络互惠现象及其缺失情况。因此，我们显著扩展了先前获得的数值结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf04/5722284/c43b1a4e7a54/pone.0189220.g001.jpg

相似文献

Reinforcement learning account of network reciprocity.

PLoS One. 2017 Dec 8;12(12):e0189220. doi: 10.1371/journal.pone.0189220. eCollection 2017.

Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.

PLoS Comput Biol. 2016 Jul 20;12(7):e1005034. doi: 10.1371/journal.pcbi.1005034. eCollection 2016 Jul.

A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game.

Bull Math Biol. 2009 Nov;71(8):1818-50. doi: 10.1007/s11538-009-9424-8. Epub 2009 May 29.

Universal scaling for the dilemma strength in evolutionary games.

Phys Life Rev. 2015 Sep;14:1-30. doi: 10.1016/j.plrev.2015.04.033. Epub 2015 May 5.

A simple scaling of the effectiveness of supporting mutual cooperation in donor-recipient games by various reciprocity mechanisms.

Biosystems. 2009 Apr;96(1):29-34. doi: 10.1016/j.biosystems.2008.11.004. Epub 2008 Nov 19.

Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's dilemma.

J Theor Biol. 2011 Jun 7;278(1):55-62. doi: 10.1016/j.jtbi.2011.03.005. Epub 2011 Mar 29.

Evolutionary dynamics of the traveler's dilemma and minimum-effort coordination games on complex networks.

Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Oct;90(4):042134. doi: 10.1103/PhysRevE.90.042134. Epub 2014 Oct 22.

Evolutionary prisoner's dilemma game on graphs and social networks with external constraint.

J Theor Biol. 2014 Oct 7;358:122-31. doi: 10.1016/j.jtbi.2014.05.038. Epub 2014 Jun 5.

Difference of reciprocity effect in two coevolutionary models of presumed two-player and multiplayer games.

Phys Rev E Stat Nonlin Soft Matter Phys. 2013 Jun;87(6):062136. doi: 10.1103/PhysRevE.87.062136. Epub 2013 Jun 25.

Payoff-based learning explains the decline in cooperation in public goods games.

Proc Biol Sci. 2015 Feb 22;282(1801):20142678. doi: 10.1098/rspb.2014.2678.

引用本文的文献

Aspiration dynamics generate robust predictions in heterogeneous populations.

Nat Commun. 2021 May 31;12(1):3250. doi: 10.1038/s41467-021-23548-4.

Competing for congestible goods: experimental evidence on parking choice.

Sci Rep. 2020 Nov 30;10(1):20803. doi: 10.1038/s41598-020-77711-w.

An experimental study of network effects on coordination in asymmetric games.

Sci Rep. 2019 May 2;9(1):6842. doi: 10.1038/s41598-019-43260-0.

本文引用的文献

Reinforcement learning accounts for moody conditional cooperation behavior: experimental results.

Sci Rep. 2017 Jan 10;7:39275. doi: 10.1038/srep39275.

Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.

PLoS Comput Biol. 2016 Jul 20;12(7):e1005034. doi: 10.1371/journal.pcbi.1005034. eCollection 2016 Jul.

Aspiration dynamics in structured population acts as if in a well-mixed one.

Sci Rep. 2015 Jan 26;5:8014. doi: 10.1038/srep08014.

Static network structure can stabilize human cooperation.

Proc Natl Acad Sci U S A. 2014 Dec 2;111(48):17093-8. doi: 10.1073/pnas.1400406111. Epub 2014 Nov 17.

Aspiration dynamics of multi-player games in finite populations.

J R Soc Interface. 2014 Mar 5;11(94):20140077. doi: 10.1098/rsif.2014.0077. Print 2014 May 6.

Learning dynamics explains human behaviour in prisoner's dilemma on networks.

J R Soc Interface. 2014 Feb 19;11(94):20131186. doi: 10.1098/rsif.2013.1186. Print 2014 May 6.

Quality versus quantity of social ties in experimental cooperative networks.

Nat Commun. 2013;4:2814. doi: 10.1038/ncomms3814.

Human cooperation.

Trends Cogn Sci. 2013 Aug;17(8):413-25. doi: 10.1016/j.tics.2013.06.003. Epub 2013 Jul 13.

Contagion of Cooperation in Static and Fluid Social Networks.

PLoS One. 2013 Jun 19;8(6):e66199. doi: 10.1371/journal.pone.0066199. Print 2013.

Evolutionary dynamics of group interactions on structured populations: a review.

J R Soc Interface. 2013 Jan 9;10(80):20120997. doi: 10.1098/rsif.2012.0997. Print 2013 Mar 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

网络互惠性的强化学习解释

Reinforcement learning account of network reciprocity.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献