Elliffe Douglas, Davison Michael
The University of Auckland, New Zealand.
Behav Processes. 2010 May;84(1):381-9. doi: 10.1016/j.beproc.2009.11.009. Epub 2009 Nov 24.
Six pigeons responded on a four-key concurrent variable-interval schedule in which a 27:9:3:1 distribution of reinforcers between the keys changed every 10 reinforcers. Their behaviour quickly came under the control of this changing four-way reinforcer ratio. However, preference between a pair of keys depended not only on the relative reinforcer rates on those keys, but also on the absolute levels of those rates. This contradicts the constant-ratio rule that underpins the matching approach to choice, but is predicted by a contingency-discriminability model that assumes that organisms may occasionally misattribute reinforcers to a response that did not produce them. Reinforcers produced strong preference pulses, or transient increases in responding on the just-reinforced key. Despite accurate tracking of the reinforcer ratio, reinforcers obtained late in components and from leaner keys still produced strong pulses, suggesting both extended and local control of behaviour. Patterns of switching between keys were graded and similarly controlled by the reinforcer rates on each key. Whether considered in terms of switching, local preference pulses, or extended preference, behaviour was controlled by a rapidly changing four-way reinforcer ratio in a graduated, continuous manner that is unlikely to be explained by a simple heuristic such as fix-and-sample.
六只鸽子在一个四键并发可变间隔程序中做出反应,其中按键之间强化物的分配比例为27:9:3:1,每10次强化物发放后会发生变化。它们的行为很快受到这种不断变化的四路强化物比例的控制。然而,一对按键之间的偏好不仅取决于这些按键上相对的强化物发放率,还取决于这些发放率的绝对水平。这与支撑选择匹配方法的恒定比例规则相矛盾,但由一种偶然辨别模型预测,该模型假设生物体可能偶尔会将强化物错误地归因于并未产生强化物的反应。强化物产生强烈的偏好脉冲,即在刚得到强化的按键上反应出现短暂增加。尽管准确跟踪了强化物比例,但在各成分后期以及从较少强化物的按键获得的强化物仍会产生强烈脉冲,这表明行为受到长期和局部的控制。按键之间的切换模式是分级的,并且同样受到每个按键上强化物发放率的控制。无论是从切换、局部偏好脉冲还是长期偏好的角度来看,行为都以一种渐进、连续的方式受到快速变化的四路强化物比例的控制,这种方式不太可能用诸如固定和采样这样的简单启发式方法来解释。