Department of Biopsychology, Institute of Cognitive Neuroscience, Faculty of Psychology, GAFO 05/620, University of Bochum, Bochum, Germany.
J Exp Anal Behav. 2011 Sep;96(2):155-76. doi: 10.1901/jeab.2011.96-155.
Pigeons responded in a perceptual categorization task with six different stimuli (shades of gray), three of which were to be classified as "light" or "dark", respectively. Reinforcement probability for correct responses was varied from 0.2 to 0.6 across blocks of sessions and was unequal for correct light and dark responses. Introduction of a new reinforcement contingency resulted in a biphasic process of adjustment: First, choices were strongly biased towards the favored alternative, which was followed by a shift of preference back towards unbiased choice allocation. The data are well described by a signal detection model in which adjustment to a change in reinforcement contingency is modeled as the change of a criterion along a decision axis with fixed stimulus distributions. Moreover, the model shows that pigeons, after an initial overadjustment, distribute their responses almost optimally, although the overall benefit from doing so is extremely small. The strong and swift effect of minute changes in overall reinforcement probability precludes a choice strategy directly maximizing expected value, contrary to the assumption of signal detection theory. Instead, the rapid adjustments observed can be explained by a model in which reinforcement probabilities for each action, contingent on perceived stimulus intensity, determine choice allocation.
鸽子在一个感知分类任务中对六种不同的刺激(灰度)做出反应,其中三种被分别归类为“亮”或“暗”。在整个会话块中,正确反应的强化概率从 0.2 到 0.6 不等,而且正确的亮反应和暗反应的强化概率也不相等。引入新的强化条件会导致调整出现双相过程:首先,选择强烈偏向于有利的选择,然后偏好又回到无偏差的选择分配。数据可以很好地用信号检测模型来描述,其中对强化条件变化的调整被建模为沿着决策轴改变准则,而刺激分布是固定的。此外,该模型表明,鸽子在最初的过度调整后,几乎可以最优地分配它们的反应,尽管这样做的总体收益极小。整体强化概率的微小变化会产生强烈而迅速的影响,这与信号检测理论的假设相反,即不能直接采用最大化预期值的选择策略。相反,观察到的快速调整可以用一个模型来解释,其中每个动作的强化概率取决于感知到的刺激强度,决定了选择分配。