Shimp C P
J Exp Anal Behav. 1966 Jul;9(4):443-55. doi: 10.1901/jeab.1966.9-443.
A single principle, "momentary maximizing", may account for much of a pigeon's steady-state behavior in both probability learning and concurrent variable interval experiments. The principle states that a pigeon tends to choose the alternative that momentarily has the higher probability of reinforcement. A successive discrimination procedure, which produced matching in an earlier experiment, produced here a tendency to maximize if training were adequately extended. Maximizing was produced also by other procedures, in which no reinforcing event was presented on some trials: one procedure did and two did not provide a bird with information about the availability of reinforcement on a key after an unreinforced response on the other key. The latter two procedures were analogous to concurrent variable interval schedules in two respects: the reinforcement probability on one key increased while a bird responded on the other key; and they produced matching. But sequential statistics suggested that matching resulted from momentary maximizing. Depending on the procedure, the tendency to maximize produced different relative frequencies of pecking a key for a fixed relative frequency of reinforcement. Computer simulation of maximizing behavior in several concurrent variable interval schedules produced matching and sequential statistics similar to those produced by a real bird.
单一原则,即“瞬间最大化”,或许可以解释鸽子在概率学习和并发可变间隔实验中的大部分稳态行为。该原则表明,鸽子倾向于选择瞬间强化概率更高的选项。在早期实验中产生匹配效应的连续辨别程序,如果训练时间足够长,在此处会产生一种最大化的倾向。其他程序也会产生最大化效应,在某些试验中不呈现强化事件:一种程序会,而两种程序不会在鸽子对另一个按键做出无强化反应后,为其提供关于某个按键上强化可用性的信息。后两种程序在两个方面类似于并发可变间隔时间表:当鸽子在另一个按键上做出反应时,一个按键上的强化概率会增加;并且它们产生了匹配效应。但序列统计表明,匹配是由瞬间最大化导致的。根据程序的不同,对于固定的强化相对频率,最大化倾向会产生不同的啄键相对频率。对几种并发可变间隔时间表中的最大化行为进行计算机模拟,产生了与真实鸽子产生的匹配效应和序列统计结果相似的结果。