观察性学习中价值的乐观偏见。

Optimistic biases in observational learning of value.

机构信息

Wellcome Trust Centre for Neuroimaging at UCL, 12 Queen Square, London WC1N 3BG, UK.

出版信息

Cognition. 2011 Jun;119(3):394-402. doi: 10.1016/j.cognition.2011.02.004. Epub 2011 Feb 26.

DOI:10.1016/j.cognition.2011.02.004

PMID:21354558

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3081069/

Abstract

Action-outcome contingencies can be learnt either by active trial-and-error, or vicariously, by observing the outcomes of actions performed by others. The extant literature is ambiguous as to which of these modes of learning is more effective, as controlled comparisons of operant and observational learning are rare. Here, we contrasted human operant and observational value learning, assessing implicit and explicit measures of learning from positive and negative reinforcement. Compared to direct operant learning, we show observational learning is associated with an optimistic over-valuation of low-value options, a pattern apparent both in participants' choice preferences and their explicit post-hoc estimates of value. Learning of higher value options showed no such bias. We suggest that such a bias can be explained as a tendency for optimistic underestimation of the chance of experiencing negative events, an optimism repressed when information is gathered through direct operant learning.

摘要

行为-结果关联既可以通过主动试错，也可以通过观察他人的行为及其结果来间接习得。然而，关于哪种学习模式更为有效，现有文献的结论并不明确，因为很少有对操作性学习和观察性学习的对照研究。在这里，我们比较了人类的操作性学习和观察性学习，评估了从正强化和负强化中学习的内隐和外显测量。与直接操作性学习相比，我们发现观察性学习与对低价值选项的过度乐观高估有关，这种模式在参与者的选择偏好和他们对价值的明确事后估计中都很明显。对于高价值选项的学习则没有这种偏见。我们认为，这种偏见可以解释为对经历负面事件的可能性的乐观低估，当通过直接操作性学习收集信息时，这种乐观主义会被压抑。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2233/3081069/dc7a152df469/gr1.jpg

相似文献

Optimistic biases in observational learning of value.观察性学习中价值的乐观偏见。

Cognition. 2011 Jun;119(3):394-402. doi: 10.1016/j.cognition.2011.02.004. Epub 2011 Feb 26.

A reinforcement learning mechanism responsible for the valuation of free choice.一种负责自由选择估值的强化学习机制。

Neuron. 2014 Aug 6;83(3):551-7. doi: 10.1016/j.neuron.2014.06.035. Epub 2014 Jul 24.

Try and try again: Post-error boost of an implicit measure of agency.不断尝试：错误后对能动性内隐测量的促进作用。

Q J Exp Psychol (Hove). 2018 Jul;71(7):1584-1595. doi: 10.1080/17470218.2017.1350871. Epub 2018 Jan 1.

Neural correlates of the divergence of instrumental probability distributions.神经关联的工具概率分布的发散。

J Neurosci. 2013 Jul 24;33(30):12519-27. doi: 10.1523/JNEUROSCI.1353-13.2013.

The involvement of model-based but not model-free learning signals during observational reward learning in the absence of choice.在无选择情况下观察性奖励学习过程中基于模型而非无模型学习信号的参与。

J Neurophysiol. 2016 Jun 1;115(6):3195-203. doi: 10.1152/jn.00046.2016. Epub 2016 Apr 6.

Implicit Valuation of the Near-Miss is Dependent on Outcome Context.近错失值的内隐评估取决于结果情境。

J Gambl Stud. 2018 Mar;34(1):181-197. doi: 10.1007/s10899-017-9705-3.

The neural coding of expected and unexpected monetary performance outcomes: dissociations between active and observational learning.预期和意外货币绩效结果的神经编码：主动学习和观察学习之间的分离。

Behav Brain Res. 2012 Feb 1;227(1):241-51. doi: 10.1016/j.bbr.2011.10.042. Epub 2011 Nov 6.

Information about action outcomes differentially affects learning from self-determined versus imposed choices.关于行动结果的信息会对自主选择和强制选择的学习产生不同的影响。

Nat Hum Behav. 2020 Oct;4(10):1067-1079. doi: 10.1038/s41562-020-0919-5. Epub 2020 Aug 3.

Decision-making impairments in the context of intact reward sensitivity in schizophrenia.精神分裂症中奖励敏感性正常情况下的决策障碍

Biol Psychiatry. 2008 Jul 1;64(1):62-9. doi: 10.1016/j.biopsych.2008.02.015. Epub 2008 Apr 2.

Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.在一项运动决策任务中，信用分配受机构影响，而不受感官预测误差影响。

J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.

引用本文的文献

Asymmetric coupling of action and outcome valence in active and observational feedback learning.在主动和观察反馈学习中，动作和结果效价的非对称耦合。

Psychol Res. 2021 Jun;85(4):1553-1566. doi: 10.1007/s00426-020-01340-1. Epub 2020 Apr 22.

Other People's Money: The Role of Reciprocity and Social Uncertainty in Decisions for Others.他人的金钱：互惠与社会不确定性在为他人做决策中的作用。

J Neurosci Psychol Econ. 2017 Jun-Sep;10(2-3):59-80. doi: 10.1037/npe0000063.

Vicarious neural processing of outcomes during observational learning.观察学习过程中结果的替代性神经加工。

PLoS One. 2013 Sep 5;8(9):e73879. doi: 10.1371/journal.pone.0073879. eCollection 2013.

Social learning as a way to overcome choice-induced preferences? Insights from humans and rhesus macaques.社会学习作为克服选择诱导偏好的一种方式？来自人类和恒河猴的见解。

Front Neurosci. 2012 Sep 3;6:127. doi: 10.3389/fnins.2012.00127. eCollection 2012.

本文引用的文献

A key role for similarity in vicarious reward.替代性奖励中相似性的关键作用。

Science. 2009 May 15;324(5929):900. doi: 10.1126/science.1170539.

Medial orbitofrontal cortex codes relative rather than absolute value of financial rewards in humans.内侧眶额皮质编码人类金融奖励的相对价值而非绝对价值。

Eur J Neurosci. 2008 May;27(9):2213-8. doi: 10.1111/j.1460-9568.2008.06202.x.

Motor-cortical beta oscillations are modulated by correctness of observed action.运动皮层β振荡受观察到的动作正确性的调节。

Neuroimage. 2008 Apr 1;40(2):767-775. doi: 10.1016/j.neuroimage.2007.12.018. Epub 2007 Dec 23.

Judgment under Uncertainty: Heuristics and Biases.《不确定性下的判断：启发式与偏差》

Science. 1974 Sep 27;185(4157):1124-31. doi: 10.1126/science.185.4157.1124.

Orbitofrontal cortex mediates outcome encoding in Pavlovian but not instrumental conditioning.眶额叶皮质在巴甫洛夫条件反射而非工具性条件反射中介导结果编码。

J Neurosci. 2007 May 2;27(18):4819-25. doi: 10.1523/JNEUROSCI.5443-06.2007.

Brain responses to outcomes of one's own and other's performance in a gambling task.在赌博任务中大脑对自身和他人表现结果的反应。

Neuroreport. 2006 Nov 6;17(16):1747-51. doi: 10.1097/01.wnr.0000239960.98813.50.

Dynamic response-by-response models of matching behavior in rhesus monkeys.恒河猴匹配行为中逐个反应的动态模型。

J Exp Anal Behav. 2005 Nov;84(3):555-79. doi: 10.1901/jeab.2005.110-04.

On adaptation, maximization, and reinforcement learning among cognitive strategies.论认知策略中的适应性、最大化与强化学习。

Psychol Rev. 2005 Oct;112(4):912-931. doi: 10.1037/0033-295X.112.4.912.

The dark side of emotion in decision-making: when individuals with decreased emotional reactions make more advantageous decisions.决策中情绪的阴暗面：情绪反应减弱的个体如何做出更有利的决策。

Brain Res Cogn Brain Res. 2005 Apr;23(1):85-92. doi: 10.1016/j.cogbrainres.2005.01.006.

Decisions from experience and the effect of rare events in risky choice.基于经验的决策以及罕见事件在风险选择中的影响。

Psychol Sci. 2004 Aug;15(8):534-9. doi: 10.1111/j.0956-7976.2004.00715.x.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

观察性学习中价值的乐观偏见。

Optimistic biases in observational learning of value.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献