Urcuioli Peter J
Department of Psychological Sciences, Purdue University, 703 Third Street, West Lafayette, IN 47907-2081, USA.
Learn Behav. 2005 Feb;33(1):1-21. doi: 10.3758/bf03196047.
The role of the reinforcer in instrumental discriminations has often been viewed as that of facilitating associative learning between a reinforced response and the discriminative stimulus that occasions it. The differential-outcome paradigm introduced by Trapold (1970), however, has provided compelling evidence that reinforcers are also part of what is learned in discrimination tasks. Specifically, when the availability of different reinforcing outcomes is signaled by different discriminative stimuli, the conditioned anticipation of those outcomes can provide another source of stimulus control over responding. This article reviews how such control develops and how it can be revealed, its impact on behavior, and different possible mechanisms that could mediate the behavioral effects. The main conclusion is that differential-outcome effects are almost entirely explicable in terms of the cue properties of outcome expectancies-namely, that conditioned expectancies acquire discriminative control just like any other discriminative or conditional stimulus in instrumental learning.
在工具性辨别中,强化物的作用通常被视为促进强化反应与引发该反应的辨别性刺激之间的联想学习。然而,特拉波尔德(1970年)引入的差异结果范式提供了令人信服的证据,表明强化物也是辨别任务中所学内容的一部分。具体而言,当不同的辨别性刺激预示着不同强化结果的可得性时,对这些结果的条件预期可以为反应提供另一种刺激控制源。本文回顾了这种控制是如何发展的、如何被揭示、其对行为的影响以及可能介导行为效应的不同机制。主要结论是,差异结果效应几乎完全可以根据结果预期的线索特性来解释——也就是说,条件预期在工具性学习中获得辨别控制,就像任何其他辨别性或条件性刺激一样。