• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Vicarious reinforcement learning signals when instructing others.在指导他人时的替代性强化学习信号。
J Neurosci. 2015 Feb 18;35(7):2904-13. doi: 10.1523/JNEUROSCI.3669-14.2015.
2
Reinforcement learning signals in the anterior cingulate cortex code for others' false beliefs.前扣带皮层的强化学习信号为他人的错误信念编码。
Neuroimage. 2013 Jan 1;64:1-9. doi: 10.1016/j.neuroimage.2012.09.010. Epub 2012 Sep 13.
3
The anterior cingulate cortex: monitoring the outcomes of others' decisions.前扣带皮层:监测他人决策的结果。
Soc Neurosci. 2012 Jul;7(4):424-35. doi: 10.1080/17470919.2011.638799. Epub 2011 Nov 25.
4
Encoding of Vicarious Reward Prediction in Anterior Cingulate Cortex and Relationship with Trait Empathy.前扣带回皮质中替代性奖励预测的编码及其与特质共情的关系。
J Neurosci. 2015 Oct 7;35(40):13720-7. doi: 10.1523/JNEUROSCI.1703-15.2015.
5
Processing of action- but not stimulus-related prediction errors differs between active and observational feedback learning.在主动反馈学习和观察性反馈学习中,与动作相关而非与刺激相关的预测误差的处理方式有所不同。
Neuropsychologia. 2015 Jan;66:75-87. doi: 10.1016/j.neuropsychologia.2014.10.036. Epub 2014 Nov 7.
6
The anterior cingulate gyrus signals the net value of others' rewards.前扣带回发出关于他人奖励净值的信号。
J Neurosci. 2014 Apr 30;34(18):6190-200. doi: 10.1523/JNEUROSCI.2701-13.2014.
7
Stimulus-outcome learnability differentially activates anterior cingulate and hippocampus at feedback processing.刺激-结果的可学习性在反馈处理时对前扣带回和海马体产生不同程度的激活。
Learn Mem. 2009 Apr 29;16(5):324-31. doi: 10.1101/lm.1191609. Print 2009 May.
8
Learned predictions of error likelihood in the anterior cingulate cortex.前扣带回皮质中对错误可能性的习得性预测。
Science. 2005 Feb 18;307(5712):1118-21. doi: 10.1126/science.1105783.
9
Reduced error-related activation in two anterior cingulate circuits is related to impaired performance in schizophrenia.两个前扣带回回路中与错误相关的激活减少与精神分裂症患者的表现受损有关。
Brain. 2008 Apr;131(Pt 4):971-86. doi: 10.1093/brain/awm307. Epub 2007 Dec 24.
10
Value and prediction error estimation account for volatility effects in ACC: a model-based fMRI study.价值和预测误差估计解释了 ACC 中的波动效应:基于模型的 fMRI 研究。
Cortex. 2013 Jun;49(6):1627-35. doi: 10.1016/j.cortex.2012.05.008. Epub 2012 May 26.

引用本文的文献

1
The Effects of Teacher Rewards and Their Types on Preschool Children's Selective Trust.教师奖励及其类型对学龄前儿童选择性信任的影响。
Behav Sci (Basel). 2025 Jun 12;15(6):804. doi: 10.3390/bs15060804.
2
Social Risk Coding by Amygdala Activity and Connectivity with the Dorsal Anterior Cingulate Cortex.杏仁核活动以及与背侧前扣带回皮质的连接所进行的社会风险编码
J Neurosci. 2025 Jan 29;45(5):e1149242024. doi: 10.1523/JNEUROSCI.1149-24.2024.
3
Observational reinforcement learning in children and young adults.儿童和青少年的观察性强化学习
NPJ Sci Learn. 2024 Mar 13;9(1):18. doi: 10.1038/s41539-024-00227-9.
4
Expecting the Unexpected: Infants Use Others' Surprise to Revise Their Own Expectations.意料之外的预期:婴儿利用他人的惊讶来修正自己的预期。
Open Mind (Camb). 2024 Mar 1;8:67-83. doi: 10.1162/opmi_a_00117. eCollection 2024.
5
The cultural evolution of teaching.教学的文化演变
Evol Hum Sci. 2023 May 12;5:e14. doi: 10.1017/ehs.2023.14. eCollection 2023.
6
Dissociation of vicarious and experienced rewards by coupling frequency within the same neural pathway.在相同神经通路上通过耦合频率来分离替代性体验奖赏。
Neuron. 2023 Aug 16;111(16):2513-2522.e4. doi: 10.1016/j.neuron.2023.05.020. Epub 2023 Jun 21.
7
How we learn social norms: a three-stage model for social norm learning.我们如何学习社会规范:一个社会规范学习的三阶段模型。
Front Psychol. 2023 Jun 2;14:1153809. doi: 10.3389/fpsyg.2023.1153809. eCollection 2023.
8
Teachers recruit mentalizing regions to represent learners' beliefs.教师招募心理理论区域来代表学习者的信念。
Proc Natl Acad Sci U S A. 2023 May 30;120(22):e2215015120. doi: 10.1073/pnas.2215015120. Epub 2023 May 22.
9
Neural implementation of computational mechanisms underlying the continuous trade-off between cooperation and competition.合作与竞争之间持续权衡背后计算机制的神经实现
Nat Commun. 2022 Nov 11;13(1):6873. doi: 10.1038/s41467-022-34509-w.
10
Distinct neural representations for prosocial and self-benefiting effort.亲社会和利己努力的独特神经表现。
Curr Biol. 2022 Oct 10;32(19):4172-4185.e7. doi: 10.1016/j.cub.2022.08.010. Epub 2022 Aug 26.

本文引用的文献

1
The Ultimatum Game and the brain: a meta-analysis of neuroimaging studies. ultimatum 游戏与大脑:神经影像学研究的荟萃分析。
Neurosci Biobehav Rev. 2014 Nov;47:549-58. doi: 10.1016/j.neubiorev.2014.10.014.
2
The neurobiology of rewards and values in social decision making.社会决策中的奖励和价值的神经生物学。
Nat Rev Neurosci. 2014 Aug;15(8):549-62. doi: 10.1038/nrn3776. Epub 2014 Jul 2.
3
The anterior cingulate gyrus signals the net value of others' rewards.前扣带回发出关于他人奖励净值的信号。
J Neurosci. 2014 Apr 30;34(18):6190-200. doi: 10.1523/JNEUROSCI.2701-13.2014.
4
Social learning in humans and other animals.人类和其他动物的社会学习。
Front Neurosci. 2014 Mar 31;8:58. doi: 10.3389/fnins.2014.00058. eCollection 2014.
5
The role of the midcingulate cortex in monitoring others' decisions.扣带前回在监控他人决策中的作用。
Front Neurosci. 2013 Dec 20;7:251. doi: 10.3389/fnins.2013.00251. eCollection 2013.
6
The behavioral and neural mechanisms underlying the tracking of expertise.专长追踪的行为和神经机制。
Neuron. 2013 Dec 18;80(6):1558-71. doi: 10.1016/j.neuron.2013.10.024.
7
The role of the striatum in social behavior.纹状体在社会行为中的作用。
Front Neurosci. 2013 Dec 10;7:233. doi: 10.3389/fnins.2013.00233.
8
From conflict management to reward-based decision making: actors and critics in primate medial frontal cortex.从冲突管理到基于奖励的决策:灵长类动物内侧前额叶皮层的作用者和批评者。
Neurosci Biobehav Rev. 2014 Oct;46 Pt 1:44-57. doi: 10.1016/j.neubiorev.2013.11.003. Epub 2013 Nov 15.
9
Toward a neural basis for social behavior.朝向社会行为的神经基础。
Neuron. 2013 Oct 30;80(3):816-26. doi: 10.1016/j.neuron.2013.10.038.
10
Activity of striatal neurons reflects social action and own reward.纹状体神经元的活动反映了社会行为和自身奖励。
Proc Natl Acad Sci U S A. 2013 Oct 8;110(41):16634-9. doi: 10.1073/pnas.1211342110. Epub 2013 Sep 23.

在指导他人时的替代性强化学习信号。

Vicarious reinforcement learning signals when instructing others.

作者信息

Apps Matthew A J, Lesage Elise, Ramnani Narender

机构信息

Nuffield Department of Clinical Neuroscience, University of Oxford, Oxford OX1 9DU, United Kingdom, Department of Experimental Psychology, University of Oxford, Oxford OX1 2JD, United Kingdom, Department of Psychology, Royal Holloway, University of London, Surrey TW20 0EX, United Kingdom, and

Department of Psychology, Royal Holloway, University of London, Surrey TW20 0EX, United Kingdom, and Neuroimaging Research Branch, Intramural Research Program, National Institute on Drug Abuse, National Institutes of Health, Baltimore, Maryland 21224.

出版信息

J Neurosci. 2015 Feb 18;35(7):2904-13. doi: 10.1523/JNEUROSCI.3669-14.2015.

DOI:10.1523/JNEUROSCI.3669-14.2015
PMID:25698730
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4331622/
Abstract

Reinforcement learning (RL) theory posits that learning is driven by discrepancies between the predicted and actual outcomes of actions (prediction errors [PEs]). In social environments, learning is often guided by similar RL mechanisms. For example, teachers monitor the actions of students and provide feedback to them. This feedback evokes PEs in students that guide their learning. We report the first study that investigates the neural mechanisms that underpin RL signals in the brain of a teacher. Neurons in the anterior cingulate cortex (ACC) signal PEs when learning from the outcomes of one's own actions but also signal information when outcomes are received by others. Does a teacher's ACC signal PEs when monitoring a student's learning? Using fMRI, we studied brain activity in human subjects (teachers) as they taught a confederate (student) action-outcome associations by providing positive or negative feedback. We examined activity time-locked to the students' responses, when teachers infer student predictions and know actual outcomes. We fitted a RL-based computational model to the behavior of the student to characterize their learning, and examined whether a teacher's ACC signals when a student's predictions are wrong. In line with our hypothesis, activity in the teacher's ACC covaried with the PE values in the model. Additionally, activity in the teacher's insula and ventromedial prefrontal cortex covaried with the predicted value according to the student. Our findings highlight that the ACC signals PEs vicariously for others' erroneous predictions, when monitoring and instructing their learning. These results suggest that RL mechanisms, processed vicariously, may underpin and facilitate teaching behaviors.

摘要

强化学习(RL)理论认为,学习是由行动的预测结果与实际结果之间的差异(预测误差[PEs])驱动的。在社会环境中,学习通常由类似的强化学习机制引导。例如,教师会监控学生的行为并给予他们反馈。这种反馈会在学生中引发预测误差,从而引导他们的学习。我们报告了第一项研究,该研究调查了教师大脑中强化学习信号背后的神经机制。前扣带回皮质(ACC)中的神经元在从自身行动结果中学习时会发出预测误差信号,但在他人接收结果时也会发出信息信号。当教师监控学生的学习时,其ACC会发出预测误差信号吗?我们使用功能磁共振成像(fMRI)研究了人类受试者(教师)在通过提供正面或负面反馈来教授一名同伙(学生)行动-结果关联时的大脑活动。我们检查了与学生反应时间锁定的活动,此时教师推断学生的预测并知道实际结果。我们将基于强化学习的计算模型应用于学生的行为,以表征他们的学习情况,并检查当学生的预测错误时教师的ACC是否发出信号。与我们的假设一致,教师ACC中的活动与模型中的预测误差值相关。此外,教师脑岛和腹内侧前额叶皮质的活动与根据学生情况预测的值相关。我们的研究结果表明,在监控和指导他人学习时,ACC会替代他人的错误预测发出预测误差信号。这些结果表明,通过替代方式处理的强化学习机制可能是教学行为的基础并促进教学行为。