Suppr超能文献

刺激辨别能力可能会使基于价值的概率学习产生偏差。

Stimulus discriminability may bias value-based probabilistic learning.

作者信息

Schutte Iris, Slagter Heleen A, Collins Anne G E, Frank Michael J, Kenemans J Leon

机构信息

Department of Experimental Psychology and Psychopharmacology, Helmholtz Institute, Utrecht University, Utrecht, The Netherlands.

Department of Psychology and ABC, University of Amsterdam, Amsterdam, The Netherlands.

出版信息

PLoS One. 2017 May 8;12(5):e0176205. doi: 10.1371/journal.pone.0176205. eCollection 2017.

Abstract

Reinforcement learning tasks are often used to assess participants' tendency to learn more from the positive or more from the negative consequences of one's action. However, this assessment often requires comparison in learning performance across different task conditions, which may differ in the relative salience or discriminability of the stimuli associated with more and less rewarding outcomes, respectively. To address this issue, in a first set of studies, participants were subjected to two versions of a common probabilistic learning task. The two versions differed with respect to the stimulus (Hiragana) characters associated with reward probability. The assignment of character to reward probability was fixed within version but reversed between versions. We found that performance was highly influenced by task version, which could be explained by the relative perceptual discriminability of characters assigned to high or low reward probabilities, as assessed by a separate discrimination experiment. Participants were more reliable in selecting rewarding characters that were more discriminable, leading to differences in learning curves and their sensitivity to reward probability. This difference in experienced reinforcement history was accompanied by performance biases in a test phase assessing ability to learn from positive vs. negative outcomes. In a subsequent large-scale web-based experiment, this impact of task version on learning and test measures was replicated and extended. Collectively, these findings imply a key role for perceptual factors in guiding reward learning and underscore the need to control stimulus discriminability when making inferences about individual differences in reinforcement learning.

摘要

强化学习任务通常用于评估参与者从自身行为的积极后果或消极后果中学习更多的倾向。然而,这种评估通常需要比较不同任务条件下的学习表现,而这些条件可能在与奖励程度或多或少相关的刺激的相对显著性或可辨别性方面存在差异。为了解决这个问题,在第一组研究中,参与者接受了一个常见概率学习任务的两个版本。这两个版本在与奖励概率相关的刺激(平假名)字符方面有所不同。字符与奖励概率的分配在版本内是固定的,但在不同版本之间是相反的。我们发现表现受到任务版本的高度影响,这可以通过在一个单独的辨别实验中评估的、分配给高或低奖励概率的字符的相对感知可辨别性来解释。参与者在选择更具可辨别性的奖励字符时更可靠,这导致了学习曲线及其对奖励概率的敏感性存在差异。这种经历的强化历史差异在评估从积极与消极结果中学习能力的测试阶段伴随着表现偏差。在随后的大规模基于网络的实验中,任务版本对学习和测试指标的这种影响得到了重复和扩展。总体而言,这些发现暗示了感知因素在指导奖励学习中的关键作用,并强调在推断强化学习中的个体差异时控制刺激可辨别性的必要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a1c/5421749/ba9e2a42213d/pone.0176205.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验