社交具有特殊性：一个关于利用评价性反馈进行教学和从中学习的规范框架。

Social is special: A normative framework for teaching with and learning from evaluative feedback.

作者信息

Ho Mark K, MacGlashan James, Littman Michael L, Cushman Fiery

机构信息

Department of Cognitive, Linguistic & Psychological Sciences, Brown University, Box 1821, Providence, RI 02912, United States.

Department of Computer Science, Brown University, 115 Waterman St, Providence, RI 02906, United States.

出版信息

Cognition. 2017 Oct;167:91-106. doi: 10.1016/j.cognition.2017.03.006. Epub 2017 Mar 22.

DOI:10.1016/j.cognition.2017.03.006

PMID:28341268

Abstract

Humans often attempt to influence one another's behavior using rewards and punishments. How does this work? Psychologists have often assumed that "evaluative feedback" influences behavior via standard learning mechanisms that learn from environmental contingencies. On this view, teaching with evaluative feedback involves leveraging learning systems designed to maximize an organism's positive outcomes. Yet, despite its parsimony, programs of research predicated on this assumption, such as ones in developmental psychology, animal behavior, and human-robot interaction, have had limited success. We offer an explanation by analyzing the logic of evaluative feedback and show that specialized learning mechanisms are uniquely favored in the case of evaluative feedback from a social partner. Specifically, evaluative feedback works best when it is treated as communicating information about the value of an action rather than as a form of reward to be maximized. This account suggests that human learning from evaluative feedback depends on inferences about communicative intent, goals and other mental states-much like learning from other sources, such as demonstration, observation and instruction. Because these abilities are especially developed in humans, the present account also explains why evaluative feedback is far more widespread in humans than non-human animals.

摘要

人类常常试图通过奖励和惩罚来影响彼此的行为。这是如何起作用的呢？心理学家们常常假定“评价性反馈”是通过从环境偶然性中学习的标准学习机制来影响行为的。按照这种观点，利用评价性反馈进行教学涉及利用旨在使生物体的积极结果最大化的学习系统。然而，尽管这种观点简洁明了，但基于这一假设的研究项目，比如发展心理学、动物行为学以及人机交互方面的研究，取得的成功却很有限。我们通过分析评价性反馈的逻辑给出了一种解释，并表明在来自社会伙伴的评价性反馈的情况下，专门的学习机制具有独特的优势。具体而言，当评价性反馈被视为传达有关一种行为的价值的信息，而不是被视为一种要最大化的奖励形式时，它的效果最佳。这种解释表明，人类从评价性反馈中学习依赖于对交际意图、目标和其他心理状态的推断——这与从其他来源（如示范、观察和指导）学习非常相似。由于这些能力在人类中尤其发达，所以目前的解释也说明了为什么评价性反馈在人类中比在非人类动物中更为普遍。

相似文献

Social is special: A normative framework for teaching with and learning from evaluative feedback.

Cognition. 2017 Oct;167:91-106. doi: 10.1016/j.cognition.2017.03.006. Epub 2017 Mar 22.

People teach with rewards and punishments as communication, not reinforcements.

J Exp Psychol Gen. 2019 Mar;148(3):520-549. doi: 10.1037/xge0000569.

Social stress reactivity alters reward and punishment learning.

Soc Cogn Affect Neurosci. 2011 Jun;6(3):311-20. doi: 10.1093/scan/nsq041. Epub 2010 May 7.

Punishment is Organized around Principles of Communicative Inference.

Cognition. 2021 Mar;208:104544. doi: 10.1016/j.cognition.2020.104544. Epub 2020 Dec 28.

Effects of reward and punishment on learning from errors in smokers.

Drug Alcohol Depend. 2018 Jul 1;188:32-38. doi: 10.1016/j.drugalcdep.2018.03.028. Epub 2018 Apr 30.

Learning from social rewards predicts individual differences in self-reported social ability.

J Exp Psychol Gen. 2014 Feb;143(1):332-9. doi: 10.1037/a0031511. Epub 2013 Jan 21.

Modulation of cognitive flexibility by reward and punishment in BALB/cJ and BALB/cByJ mice.

Behav Brain Res. 2020 Jan 27;378:112294. doi: 10.1016/j.bbr.2019.112294. Epub 2019 Oct 15.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback.

IEEE Trans Neural Netw. 2008 Feb;19(2):230-44. doi: 10.1109/TNN.2007.905839.

Drift diffusion model of reward and punishment learning in schizophrenia: Modeling and experimental data.

Behav Brain Res. 2015 Sep 15;291:147-154. doi: 10.1016/j.bbr.2015.05.024. Epub 2015 May 22.

引用本文的文献

The influence of social feedback on reward learning in the Iowa gambling task.

Front Psychol. 2024 May 2;15:1292808. doi: 10.3389/fpsyg.2024.1292808. eCollection 2024.

Human and nonhuman norms: a dimensional framework.

Philos Trans R Soc Lond B Biol Sci. 2024 Mar 11;379(1897):20230026. doi: 10.1098/rstb.2023.0026. Epub 2024 Jan 22.

Machiavellian strategist or cultural learner? Mentalizing and learning over development in a resource-sharing game.

Evol Hum Sci. 2021 Mar 10;3:e14. doi: 10.1017/ehs.2021.11. eCollection 2021.

The cultural evolution of teaching.

Evol Hum Sci. 2023 May 12;5:e14. doi: 10.1017/ehs.2023.14. eCollection 2023.

Rethinking Norm Psychology.

Perspect Psychol Sci. 2024 Jan;19(1):12-38. doi: 10.1177/17456916221112075. Epub 2023 Jul 13.

Entering into a self-regulated learning mode prevents detrimental effects of feedback removal on memory.

NPJ Sci Learn. 2023 Jan 6;8(1):2. doi: 10.1038/s41539-022-00150-x.

Learning from other minds: An optimistic critique of reinforcement learning models of social learning.

Curr Opin Behav Sci. 2021 Apr;38:110-115. doi: 10.1016/j.cobeha.2021.01.006. Epub 2021 Mar 23.

Leveraging artificial intelligence to improve people's planning strategies.

Proc Natl Acad Sci U S A. 2022 Mar 22;119(12):e2117432119. doi: 10.1073/pnas.2117432119. Epub 2022 Mar 16.

Emotion prediction errors guide socially adaptive behaviour.

Nat Hum Behav. 2021 Oct;5(10):1391-1401. doi: 10.1038/s41562-021-01213-6. Epub 2021 Oct 19.

Trusting and learning from others: immediate and long-term effects of learning from observation and advice.

Proc Biol Sci. 2021 Oct 27;288(1961):20211414. doi: 10.1098/rspb.2021.1414. Epub 2021 Oct 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

社交具有特殊性：一个关于利用评价性反馈进行教学和从中学习的规范框架。

Social is special: A normative framework for teaching with and learning from evaluative feedback.

作者信息

Ho Mark K, MacGlashan James, Littman Michael L, Cushman Fiery

机构信息

Department of Cognitive, Linguistic & Psychological Sciences, Brown University, Box 1821, Providence, RI 02912, United States.

Department of Computer Science, Brown University, 115 Waterman St, Providence, RI 02906, United States.

出版信息

Cognition. 2017 Oct;167:91-106. doi: 10.1016/j.cognition.2017.03.006. Epub 2017 Mar 22.

DOI:10.1016/j.cognition.2017.03.006

PMID:28341268

Abstract

摘要

社交具有特殊性：一个关于利用评价性反馈进行教学和从中学习的规范框架。

Social is special: A normative framework for teaching with and learning from evaluative feedback.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

社交具有特殊性：一个关于利用评价性反馈进行教学和从中学习的规范框架。

Social is special: A normative framework for teaching with and learning from evaluative feedback.

作者信息

机构信息

出版信息

相似文献

引用本文的文献