• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强化学习解释了情绪多变的条件合作行为:实验结果。

Reinforcement learning accounts for moody conditional cooperation behavior: experimental results.

机构信息

National Institute of Informatics, 2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo 101-8430, Japan.

JST, ERATO, Kawarabayashi large graph project, c/o Global Research Center for Big Data Mathematics, NII, 2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo 101-8430, Japan.

出版信息

Sci Rep. 2017 Jan 10;7:39275. doi: 10.1038/srep39275.

DOI:10.1038/srep39275
PMID:28071646
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5223288/
Abstract

In social dilemma games, human participants often show conditional cooperation (CC) behavior or its variant called moody conditional cooperation (MCC), with which they basically tend to cooperate when many other peers have previously cooperated. Recent computational studies showed that CC and MCC behavioral patterns could be explained by reinforcement learning. In the present study, we use a repeated multiplayer prisoner's dilemma game and the repeated public goods game played by human participants to examine whether MCC is observed across different types of game and the possibility that reinforcement learning explains observed behavior. We observed MCC behavior in both games, but the MCC that we observed was different from that observed in the past experiments. In the present study, whether or not a focal participant cooperated previously affected the overall level of cooperation, instead of changing the tendency of cooperation in response to cooperation of other participants in the previous time step. We found that, across different conditions, reinforcement learning models were approximately as accurate as a MCC model in describing the experimental results. Consistent with the previous computational studies, the present results suggest that reinforcement learning may be a major proximate mechanism governing MCC behavior.

摘要

在社会困境博弈中,人类参与者通常表现出条件合作(CC)行为或其变体称为情绪化条件合作(MCC),即当许多其他同伴之前合作时,他们基本上倾向于合作。最近的计算研究表明,CC 和 MCC 行为模式可以用强化学习来解释。在本研究中,我们使用重复的多人囚徒困境博弈和人类参与者玩的重复公共物品博弈,来检验 MCC 是否在不同类型的博弈中存在,以及强化学习是否可以解释观察到的行为。我们在两种游戏中都观察到了 MCC 行为,但我们观察到的 MCC 与过去实验中观察到的不同。在本研究中,焦点参与者之前是否合作会影响整体合作水平,而不是根据前一个时间步的其他参与者的合作来改变合作的趋势。我们发现,在不同的条件下,强化学习模型在描述实验结果方面与 MCC 模型一样准确。与之前的计算研究一致,本研究结果表明,强化学习可能是支配 MCC 行为的主要近因机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563d/5223288/57cc61cd705e/srep39275-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563d/5223288/9a9703fb3136/srep39275-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563d/5223288/91c128075c61/srep39275-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563d/5223288/57cc61cd705e/srep39275-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563d/5223288/9a9703fb3136/srep39275-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563d/5223288/91c128075c61/srep39275-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/563d/5223288/57cc61cd705e/srep39275-f3.jpg

相似文献

1
Reinforcement learning accounts for moody conditional cooperation behavior: experimental results.强化学习解释了情绪多变的条件合作行为:实验结果。
Sci Rep. 2017 Jan 10;7:39275. doi: 10.1038/srep39275.
2
Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.强化学习解释了条件性合作及其喜怒无常的同类现象。
PLoS Comput Biol. 2016 Jul 20;12(7):e1005034. doi: 10.1371/journal.pcbi.1005034. eCollection 2016 Jul.
3
Learning dynamics explains human behaviour in prisoner's dilemma on networks.学习动力学解释了网络囚徒困境中的人类行为。
J R Soc Interface. 2014 Feb 19;11(94):20131186. doi: 10.1098/rsif.2013.1186. Print 2014 May 6.
4
Contingencies of reinforcement in a five-person prisoner's dilemma.五人囚徒困境中的强化偶然性
J Exp Anal Behav. 2004 Sep;82(2):161-76. doi: 10.1901/jeab.2004.82-161.
5
Expectation and cooperation in prisoner's dilemmas: The moderating role of game riskiness.囚徒困境中的期望与合作:博弈风险的调节作用。
Psychon Bull Rev. 2016 Apr;23(2):353-60. doi: 10.3758/s13423-015-0911-7.
6
Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's dilemma.具有迭代囚徒困境中动态期望水平的强化学习模型的数值分析。
J Theor Biol. 2011 Jun 7;278(1):55-62. doi: 10.1016/j.jtbi.2011.03.005. Epub 2011 Mar 29.
7
Dynamic probability of reinforcement for cooperation: Random game termination in the centipede game.合作强化的动态概率:蜈蚣博弈中的随机博弈终止
J Exp Anal Behav. 2018 Mar;109(2):349-364. doi: 10.1002/jeab.320.
8
A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game.在迭代囚徒困境博弈中对时间差分学习的理论分析。
Bull Math Biol. 2009 Nov;71(8):1818-50. doi: 10.1007/s11538-009-9424-8. Epub 2009 May 29.
9
Cooperation and depressive symptoms.合作与抑郁症状。
J Affect Disord. 2013 Sep 25;150(3):1184-7. doi: 10.1016/j.jad.2013.05.011. Epub 2013 May 31.
10
The effect of attachment and environmental manipulations on cooperative behavior in the prisoner's dilemma game.依恋和环境操作对囚徒困境游戏中合作行为的影响。
PLoS One. 2018 Nov 12;13(11):e0205730. doi: 10.1371/journal.pone.0205730. eCollection 2018.

引用本文的文献

1
The Black Box as a Control for Payoff-Based Learning in Economic Games.经济博弈中作为基于收益学习控制手段的黑箱
Games (Basel). 2022 Nov 16;13(6):76. doi: 10.3390/g13060076.
2
Nash equilibria in human sensorimotor interactions explained by Q-learning with intrinsic costs.内禀成本的 Q 学习对人类感觉运动交互中的纳什均衡的解释。
Sci Rep. 2021 Oct 21;11(1):20779. doi: 10.1038/s41598-021-99428-0.
3
Greater effects of mutual cooperation and defection on subsequent cooperation in direct reciprocity games than generalized reciprocity games: Behavioral experiments and analysis using multilevel models.

本文引用的文献

1
Unified and simple understanding for the evolution of conditional cooperators.对条件合作者进化的统一而简单的理解。
Math Biosci. 2016 Dec;282:16-20. doi: 10.1016/j.mbs.2016.09.012. Epub 2016 Sep 28.
2
Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.强化学习解释了条件性合作及其喜怒无常的同类现象。
PLoS Comput Biol. 2016 Jul 20;12(7):e1005034. doi: 10.1371/journal.pcbi.1005034. eCollection 2016 Jul.
3
Conditional cooperation and confusion in public-goods experiments.公共物品实验中的条件性合作与混淆
直接互惠博弈中相互合作和背叛对后续合作的影响大于广义互惠博弈:使用多层次模型的行为实验和分析。
PLoS One. 2020 Nov 19;15(11):e0242607. doi: 10.1371/journal.pone.0242607. eCollection 2020.
4
Seasonal payoff variations and the evolution of cooperation in social dilemmas.季节性收益变化与社会困境中合作的演化。
Sci Rep. 2019 Aug 29;9(1):12575. doi: 10.1038/s41598-019-49075-3.
5
Contrasting temporal difference and opportunity cost reinforcement learning in an empirical money-emergence paradigm.在经验货币涌现范式中对比时间差异和机会成本强化学习。
Proc Natl Acad Sci U S A. 2018 Dec 4;115(49):E11446-E11454. doi: 10.1073/pnas.1813197115. Epub 2018 Nov 15.
6
Reinforcement learning account of network reciprocity.网络互惠性的强化学习解释
PLoS One. 2017 Dec 8;12(12):e0189220. doi: 10.1371/journal.pone.0189220. eCollection 2017.
7
The emergence of altruism as a social norm.利他主义作为一种社会规范的出现。
Sci Rep. 2017 Aug 29;7(1):9684. doi: 10.1038/s41598-017-07712-9.
Proc Natl Acad Sci U S A. 2016 Feb 2;113(5):1291-6. doi: 10.1073/pnas.1509740113. Epub 2016 Jan 19.
4
Directional learning and the provisioning of public goods.定向学习与公共物品的供应
Sci Rep. 2015 Jan 26;5:8010. doi: 10.1038/srep08010.
5
Payoff-based learning explains the decline in cooperation in public goods games.基于回报的学习解释了公共物品博弈中合作行为的减少。
Proc Biol Sci. 2015 Feb 22;282(1801):20142678. doi: 10.1098/rspb.2014.2678.
6
Transition from reciprocal cooperation to persistent behaviour in social dilemmas at the end of adolescence.青少年末期社会困境中从互惠合作到持续行为的转变。
Nat Commun. 2014 Jul 15;5:4362. doi: 10.1038/ncomms5362.
7
A comparative analysis of spatial Prisoner's Dilemma experiments: conditional cooperation and payoff irrelevance.空间囚徒困境实验的比较分析:条件合作与收益无关性
Sci Rep. 2014 Apr 11;4:4615. doi: 10.1038/srep04615.
8
Learning dynamics explains human behaviour in prisoner's dilemma on networks.学习动力学解释了网络囚徒困境中的人类行为。
J R Soc Interface. 2014 Feb 19;11(94):20131186. doi: 10.1098/rsif.2013.1186. Print 2014 May 6.
9
Human cooperation.人类合作。
Trends Cogn Sci. 2013 Aug;17(8):413-25. doi: 10.1016/j.tics.2013.06.003. Epub 2013 Jul 13.
10
Contagion of Cooperation in Static and Fluid Social Networks.静态与动态社会网络中的合作传播
PLoS One. 2013 Jun 19;8(6):e66199. doi: 10.1371/journal.pone.0066199. Print 2013.