Williams A M, Lattal K A
West Virginia University, USA.
J Exp Anal Behav. 1999 Mar;71(2):187-94. doi: 10.1901/jeab.1999.71-187.
The role of the response-reinforcer relation in maintaining operant behavior under conditions of delayed reinforcement was investigated by using a two-operandum (i.e., two-key) procedure with pigeons. Responding on one key was reinforced under a tandem variable-interval differential-reinforcement-of-other-behavior (tandem VI DRO) schedule. The schedule defined a resetting unsignaled delay-of-reinforcement procedure in that a response was required when the interfood interval of the VI schedule lapsed, but further responding during the DRO component on either key reset the time interval. This ensured a fixed delay duration between any response and reinforcement. Responding on another key, physically identical to the first one except for spatial location, otherwise was without consequence. The location of the key correlated with the delay-of-reinforcement procedure varied between sessions according to a semirandom sequence. Differences in response rates between the two keys were greater, with proportionally higher rates on the key correlated with the delay-of-reinforcement procedure, the longer the delay-of-reinforcement procedure remained correlated with the same key. Differences in responding on the two keys also increased within individual sessions. These results suggest that the response-reinforcer relation is the primary determinant of responding when responding is acquired and maintained with delayed reinforcement.
通过对鸽子采用双操作柄(即双按键)程序,研究了反应 - 强化物关系在延迟强化条件下维持操作性行为中的作用。在一个串联可变间隔其他行为差异强化(串联 VI DRO)时间表下,对一个按键的反应会得到强化。该时间表定义了一种重置无信号强化延迟程序,即当 VI 时间表的食物间隔结束时需要做出反应,但在 DRO 成分期间对任何一个按键的进一步反应都会重置时间间隔。这确保了任何反应与强化之间有固定的延迟持续时间。对另一个按键的反应,除了空间位置外与第一个按键在物理上相同,否则没有结果。与强化延迟程序相关的按键位置在各实验环节之间根据半随机序列变化。两个按键之间的反应速率差异越大,与强化延迟程序相关的按键上的比例速率越高,强化延迟程序与同一按键保持相关的时间就越长。在单个实验环节中,两个按键上的反应差异也会增加。这些结果表明,当通过延迟强化获得并维持反应时,反应 - 强化物关系是反应的主要决定因素。