Institute of Pathophysiology, University Medical Center of the Johannes Gutenberg University Mainz, Germany.
Centre for Cognitive Science, Institute of Psychology, Technical University of Darmstadt, Germany.
J Exp Anal Behav. 2024 May;121(3):294-313. doi: 10.1002/jeab.908. Epub 2024 Mar 1.
Discrimination performance in perceptual choice tasks is known to reflect both sensory discriminability and nonsensory response bias. In the framework of signal detection theory, these aspects of discrimination performance are quantified through separate measures, sensitivity (d') for sensory discriminability and decision criterion (c) for response bias. However, it is unknown how response bias (i.e., criterion) changes at the single-trial level as a consequence of reinforcement history. We subjected rats to a two-stimulus two-response conditional discrimination task with auditory stimuli and induced response bias through unequal reinforcement probabilities for the two responses. We compared three signal-detection-theory-based criterion learning models with respect to their ability to fit experimentally observed fluctuations of response bias on a trial-by-trial level. These models shift the criterion by a fixed step (1) after each reinforced response or (2) after each nonreinforced response or (3) after both. We find that all three models fail to capture essential aspects of the data. Prompted by the observation that steady-state criterion values conformed well to a behavioral model of signal detection based on the generalized matching law, we constructed a trial-based version of this model and find that it provides a superior account of response bias fluctuations under changing reinforcement contingencies.
在感知选择任务中,辨别性能被认为反映了感觉辨别力和非感觉反应偏差。在信号检测理论的框架内,通过单独的测量来量化辨别性能的这些方面,即感觉辨别力的敏感性 (d') 和反应偏差的决策标准 (c)。然而,不知道在强化历史的影响下,反应偏差(即标准)如何在单次试验水平上发生变化。我们让大鼠接受了一个具有听觉刺激的两刺激两反应条件辨别任务,并通过对两种反应的不等强化概率来诱导反应偏差。我们比较了三种基于信号检测理论的决策标准学习模型,以确定它们在单次试验水平上拟合实验观察到的反应偏差波动的能力。这些模型在每次强化反应后 (1) 或每次非强化反应后 (2) 或两者后 (3) 以固定的步长移动标准。我们发现所有三种模型都无法捕捉数据的重要方面。由于观察到稳态标准值与基于广义匹配律的信号检测行为模型很好地吻合,我们构建了这个模型的基于试验的版本,并发现它在变化的强化条件下提供了对反应偏差波动的更好解释。