Norton Elyse H, Fleming Stephen M, Daw Nathaniel D, Landy Michael S
Department of Psychology, New York University, New York, New York, United States of America.
Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom.
PLoS Comput Biol. 2017 Jan 3;13(1):e1005304. doi: 10.1371/journal.pcbi.1005304. eCollection 2017 Jan.
Humans often make decisions based on uncertain sensory information. Signal detection theory (SDT) describes detection and discrimination decisions as a comparison of stimulus "strength" to a fixed decision criterion. However, recent research suggests that current responses depend on the recent history of stimuli and previous responses, suggesting that the decision criterion is updated trial-by-trial. The mechanisms underpinning criterion setting remain unknown. Here, we examine how observers learn to set a decision criterion in an orientation-discrimination task under both static and dynamic conditions. To investigate mechanisms underlying trial-by-trial criterion placement, we introduce a novel task in which participants explicitly set the criterion, and compare it to a more traditional discrimination task, allowing us to model this explicit indication of criterion dynamics. In each task, stimuli were ellipses with principal orientations drawn from two categories: Gaussian distributions with different means and equal variance. In the covert-criterion task, observers categorized a displayed ellipse. In the overt-criterion task, observers adjusted the orientation of a line that served as the discrimination criterion for a subsequently presented ellipse. We compared performance to the ideal Bayesian learner and several suboptimal models that varied in both computational and memory demands. Under static and dynamic conditions, we found that, in both tasks, observers used suboptimal learning rules. In most conditions, a model in which the recent history of past samples determines a belief about category means fit the data best for most observers and on average. Our results reveal dynamic adjustment of discrimination criterion, even after prolonged training, and indicate how decision criteria are updated over time.
人类常常基于不确定的感官信息做出决策。信号检测理论(SDT)将检测和辨别决策描述为刺激“强度”与固定决策标准的比较。然而,最近的研究表明,当前的反应取决于刺激的近期历史和先前的反应,这表明决策标准是逐次试验更新的。支撑标准设定的机制仍然未知。在这里,我们研究观察者如何在静态和动态条件下的方向辨别任务中学习设定决策标准。为了研究逐次试验标准设定的潜在机制,我们引入了一项新颖的任务,让参与者明确设定标准,并将其与更传统的辨别任务进行比较,从而使我们能够对这种标准动态的明确指示进行建模。在每个任务中,刺激都是椭圆,其主方向来自两类:具有不同均值和相等方差的高斯分布。在隐蔽标准任务中,观察者对显示的椭圆进行分类。在公开标准任务中,观察者调整一条线的方向,该线作为随后呈现的椭圆的辨别标准。我们将表现与理想贝叶斯学习者以及在计算和记忆需求方面各不相同的几个次优模型进行了比较。在静态和动态条件下,我们发现,在这两个任务中,观察者都使用了次优学习规则。在大多数情况下,一个模型,即过去样本的近期历史决定对类别均值的信念,最能拟合大多数观察者的数据以及平均数据。我们的结果揭示了即使经过长时间训练后辨别标准的动态调整,并指出了决策标准如何随时间更新。