Suppr超能文献

观察性学习中价值的乐观偏见。

Optimistic biases in observational learning of value.

机构信息

Wellcome Trust Centre for Neuroimaging at UCL, 12 Queen Square, London WC1N 3BG, UK.

出版信息

Cognition. 2011 Jun;119(3):394-402. doi: 10.1016/j.cognition.2011.02.004. Epub 2011 Feb 26.

Abstract

Action-outcome contingencies can be learnt either by active trial-and-error, or vicariously, by observing the outcomes of actions performed by others. The extant literature is ambiguous as to which of these modes of learning is more effective, as controlled comparisons of operant and observational learning are rare. Here, we contrasted human operant and observational value learning, assessing implicit and explicit measures of learning from positive and negative reinforcement. Compared to direct operant learning, we show observational learning is associated with an optimistic over-valuation of low-value options, a pattern apparent both in participants' choice preferences and their explicit post-hoc estimates of value. Learning of higher value options showed no such bias. We suggest that such a bias can be explained as a tendency for optimistic underestimation of the chance of experiencing negative events, an optimism repressed when information is gathered through direct operant learning.

摘要

行为-结果关联既可以通过主动试错,也可以通过观察他人的行为及其结果来间接习得。然而,关于哪种学习模式更为有效,现有文献的结论并不明确,因为很少有对操作性学习和观察性学习的对照研究。在这里,我们比较了人类的操作性学习和观察性学习,评估了从正强化和负强化中学习的内隐和外显测量。与直接操作性学习相比,我们发现观察性学习与对低价值选项的过度乐观高估有关,这种模式在参与者的选择偏好和他们对价值的明确事后估计中都很明显。对于高价值选项的学习则没有这种偏见。我们认为,这种偏见可以解释为对经历负面事件的可能性的乐观低估,当通过直接操作性学习收集信息时,这种乐观主义会被压抑。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2233/3081069/dc7a152df469/gr1.jpg

相似文献

1
Optimistic biases in observational learning of value.
Cognition. 2011 Jun;119(3):394-402. doi: 10.1016/j.cognition.2011.02.004. Epub 2011 Feb 26.
2
A reinforcement learning mechanism responsible for the valuation of free choice.
Neuron. 2014 Aug 6;83(3):551-7. doi: 10.1016/j.neuron.2014.06.035. Epub 2014 Jul 24.
3
Try and try again: Post-error boost of an implicit measure of agency.
Q J Exp Psychol (Hove). 2018 Jul;71(7):1584-1595. doi: 10.1080/17470218.2017.1350871. Epub 2018 Jan 1.
4
Neural correlates of the divergence of instrumental probability distributions.
J Neurosci. 2013 Jul 24;33(30):12519-27. doi: 10.1523/JNEUROSCI.1353-13.2013.
5
The involvement of model-based but not model-free learning signals during observational reward learning in the absence of choice.
J Neurophysiol. 2016 Jun 1;115(6):3195-203. doi: 10.1152/jn.00046.2016. Epub 2016 Apr 6.
6
Implicit Valuation of the Near-Miss is Dependent on Outcome Context.
J Gambl Stud. 2018 Mar;34(1):181-197. doi: 10.1007/s10899-017-9705-3.
7
The neural coding of expected and unexpected monetary performance outcomes: dissociations between active and observational learning.
Behav Brain Res. 2012 Feb 1;227(1):241-51. doi: 10.1016/j.bbr.2011.10.042. Epub 2011 Nov 6.
8
Information about action outcomes differentially affects learning from self-determined versus imposed choices.
Nat Hum Behav. 2020 Oct;4(10):1067-1079. doi: 10.1038/s41562-020-0919-5. Epub 2020 Aug 3.
9
Decision-making impairments in the context of intact reward sensitivity in schizophrenia.
Biol Psychiatry. 2008 Jul 1;64(1):62-9. doi: 10.1016/j.biopsych.2008.02.015. Epub 2008 Apr 2.
10
Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.
J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.

引用本文的文献

1
Asymmetric coupling of action and outcome valence in active and observational feedback learning.
Psychol Res. 2021 Jun;85(4):1553-1566. doi: 10.1007/s00426-020-01340-1. Epub 2020 Apr 22.
2
Other People's Money: The Role of Reciprocity and Social Uncertainty in Decisions for Others.
J Neurosci Psychol Econ. 2017 Jun-Sep;10(2-3):59-80. doi: 10.1037/npe0000063.
3
Vicarious neural processing of outcomes during observational learning.
PLoS One. 2013 Sep 5;8(9):e73879. doi: 10.1371/journal.pone.0073879. eCollection 2013.
4
Social learning as a way to overcome choice-induced preferences? Insights from humans and rhesus macaques.
Front Neurosci. 2012 Sep 3;6:127. doi: 10.3389/fnins.2012.00127. eCollection 2012.

本文引用的文献

1
A key role for similarity in vicarious reward.
Science. 2009 May 15;324(5929):900. doi: 10.1126/science.1170539.
2
Medial orbitofrontal cortex codes relative rather than absolute value of financial rewards in humans.
Eur J Neurosci. 2008 May;27(9):2213-8. doi: 10.1111/j.1460-9568.2008.06202.x.
3
Motor-cortical beta oscillations are modulated by correctness of observed action.
Neuroimage. 2008 Apr 1;40(2):767-775. doi: 10.1016/j.neuroimage.2007.12.018. Epub 2007 Dec 23.
4
Judgment under Uncertainty: Heuristics and Biases.
Science. 1974 Sep 27;185(4157):1124-31. doi: 10.1126/science.185.4157.1124.
5
Orbitofrontal cortex mediates outcome encoding in Pavlovian but not instrumental conditioning.
J Neurosci. 2007 May 2;27(18):4819-25. doi: 10.1523/JNEUROSCI.5443-06.2007.
6
Brain responses to outcomes of one's own and other's performance in a gambling task.
Neuroreport. 2006 Nov 6;17(16):1747-51. doi: 10.1097/01.wnr.0000239960.98813.50.
7
Dynamic response-by-response models of matching behavior in rhesus monkeys.
J Exp Anal Behav. 2005 Nov;84(3):555-79. doi: 10.1901/jeab.2005.110-04.
8
On adaptation, maximization, and reinforcement learning among cognitive strategies.
Psychol Rev. 2005 Oct;112(4):912-931. doi: 10.1037/0033-295X.112.4.912.
9
The dark side of emotion in decision-making: when individuals with decreased emotional reactions make more advantageous decisions.
Brain Res Cogn Brain Res. 2005 Apr;23(1):85-92. doi: 10.1016/j.cogbrainres.2005.01.006.
10
Decisions from experience and the effect of rare events in risky choice.
Psychol Sci. 2004 Aug;15(8):534-9. doi: 10.1111/j.0956-7976.2004.00715.x.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验