• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

亲社会学习:基于模型还是无模型?

Prosocial learning: Model-based or model-free?

作者信息

Navidi Parisa, Saeedpour Sepehr, Ershadmanesh Sara, Hossein Mostafa Miandari, Bahrami Bahador

机构信息

Department of Cognitive Psychology, Institute for Cognitive Science Studies, Tehran, Iran.

Department of Electrical and Computer Engineering, University of Tehran, Tehran, Iran.

出版信息

PLoS One. 2023 Jun 23;18(6):e0287563. doi: 10.1371/journal.pone.0287563. eCollection 2023.

DOI:10.1371/journal.pone.0287563
PMID:37352225
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10289351/
Abstract

Prosocial learning involves the acquisition of knowledge and skills necessary for making decisions that benefit others. We asked if, in the context of value-based decision-making, there is any difference between learning strategies for oneself vs. for others. We implemented a 2-step reinforcement learning paradigm in which participants learned, in separate blocks, to make decisions for themselves or for a present other confederate who evaluated their performance. We replicated the canonical features of the model-based and model-free reinforcement learning in our results. The behaviour of the majority of participants was best explained by a mixture of the model-based and model-free control, while most participants relied more heavily on MB control, and this strategy enhanced their learning success. Regarding our key self-other hypothesis, we did not find any significant difference between the behavioural performances nor in the model-based parameters of learning when comparing self and other conditions.

摘要

亲社会学习涉及获取做出有利于他人的决策所需的知识和技能。我们询问,在基于价值的决策背景下,为自己学习与为他人学习的策略之间是否存在差异。我们实施了一个两步强化学习范式,参与者在不同的模块中分别学习为自己或为评价其表现的当前其他同盟者做出决策。我们在结果中重现了基于模型和无模型强化学习的典型特征。大多数参与者的行为最好用基于模型和无模型控制的混合来解释,而大多数参与者更依赖基于模型的控制,并且这种策略提高了他们的学习成功率。关于我们关键的自我与他人假设,在比较自我和他人条件时,我们没有发现行为表现或基于模型的学习参数之间存在任何显著差异。

相似文献

1
Prosocial learning: Model-based or model-free?亲社会学习:基于模型还是无模型?
PLoS One. 2023 Jun 23;18(6):e0287563. doi: 10.1371/journal.pone.0287563. eCollection 2023.
2
l-DOPA and oxytocin influence the neurocomputational mechanisms of self-benefitting and prosocial reinforcement learning.l-多巴和催产素影响自我获益和亲社会强化学习的神经计算机制。
Neuroimage. 2023 Apr 15;270:119983. doi: 10.1016/j.neuroimage.2023.119983. Epub 2023 Feb 26.
3
Ageing is associated with disrupted reinforcement learning whilst learning to help others is preserved.随着年龄的增长,强化学习会受到干扰,而学习帮助他人的能力则得以保留。
Nat Commun. 2021 Jul 21;12(1):4440. doi: 10.1038/s41467-021-24576-w.
4
Oxytocin modulates neurocomputational mechanisms underlying prosocial reinforcement learning.催产素调节亲社会强化学习背后的神经计算机制。
Prog Neurobiol. 2022 Jun;213:102253. doi: 10.1016/j.pneurobio.2022.102253. Epub 2022 Mar 3.
5
Multi-task reinforcement learning in humans.人类的多任务强化学习。
Nat Hum Behav. 2021 Jun;5(6):764-773. doi: 10.1038/s41562-020-01035-y. Epub 2021 Jan 28.
6
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.
7
Increased Ventromedial Prefrontal Cortex Activity in Adolescence Benefits Prosocial Reinforcement Learning.青少年时期腹内侧前额叶皮质活动增加有利于亲社会强化学习。
Dev Cogn Neurosci. 2021 Dec;52:101018. doi: 10.1016/j.dcn.2021.101018. Epub 2021 Oct 2.
8
(Reinforcement?) Learning to forage optimally.(强化?)学习最优觅食。
Curr Opin Neurobiol. 2017 Oct;46:162-169. doi: 10.1016/j.conb.2017.08.008. Epub 2017 Sep 15.
9
Hunger improves reinforcement-driven but not planned action.饥饿改善了强化驱动但不是计划好的行为。
Cogn Affect Behav Neurosci. 2021 Dec;21(6):1196-1206. doi: 10.3758/s13415-021-00921-w. Epub 2021 Oct 15.
10
Conformist social learning leads to self-organised prevention against adverse bias in risky decision making.从众的社会学习导致了自我组织的预防措施,以避免在风险决策中出现不利偏见。
Elife. 2022 May 10;11:e75308. doi: 10.7554/eLife.75308.

本文引用的文献

1
Neurocomputational mechanisms underlying the subjective value of information.信息主观价值的神经计算机制。
Commun Biol. 2021 Dec 13;4(1):1346. doi: 10.1038/s42003-021-02850-3.
2
Domain specificity versus process specificity: The "social brain" during strategic interaction.领域特殊性与过程特殊性:战略互动中的“社会大脑”。
Neuron. 2021 Oct 20;109(20):3236-3238. doi: 10.1016/j.neuron.2021.09.035.
3
The role of anticipated regret in choosing for others.预期后悔在代际决策中的作用。
Sci Rep. 2021 Jun 15;11(1):12557. doi: 10.1038/s41598-021-91635-z.
4
The actions of others act as a pseudo-reward to drive imitation in the context of social reinforcement learning.他人的行为在社会强化学习的背景下充当了一种虚假奖励,驱动着模仿。
PLoS Biol. 2020 Dec 8;18(12):e3001028. doi: 10.1371/journal.pbio.3001028. eCollection 2020 Dec.
5
Model-based decision making and model-free learning.基于模型的决策制定和无模型学习。
Curr Biol. 2020 Aug 3;30(15):R860-R865. doi: 10.1016/j.cub.2020.06.051.
6
Reinforcement learning across development: What insights can we draw from a decade of research?发展中的强化学习:我们能从十年的研究中得到哪些启示?
Dev Cogn Neurosci. 2019 Dec;40:100733. doi: 10.1016/j.dcn.2019.100733. Epub 2019 Nov 6.
7
Mental labour.脑力劳动。
Nat Hum Behav. 2018 Dec;2(12):899-908. doi: 10.1038/s41562-018-0401-9. Epub 2018 Sep 3.
8
Stress-induced reliance on habitual behavior is moderated by cortisol reactivity.应激引起的对习惯行为的依赖受皮质醇反应性的调节。
Brain Cogn. 2019 Jul;133:60-71. doi: 10.1016/j.bandc.2018.05.005. Epub 2018 May 25.
9
A Dual-Self Model of Impulse Control.冲动控制的双重自我模型。
Am Econ Rev. 2006 Dec;96(5):1449-76. doi: 10.1257/aer.96.5.1449.
10
The role of empathy in experiencing vicarious anxiety.共情在体验替代性焦虑中的作用。
J Exp Psychol Gen. 2017 Aug;146(8):1164-1188. doi: 10.1037/xge0000335. Epub 2017 Jun 19.