Dragoi V, Staddon J E
Department of Psychology, Duke University, USA.
Psychol Rev. 1999 Jan;106(1):20-61. doi: 10.1037/0033-295x.106.1.20.
Existing models of operant learning are relatively insensitive to historical properties of behavior and applicable to only limited data sets. This article proposes a minimal set of principles based on short-term and long-term memory mechanisms that can explain the major static and dynamic properties of operant behavior in both single-choice and multiresponse situations. The critical features of the theory are as follows: (a) The key property of conditioning is assessment of the degree of association between responses and reinforcement and between stimuli and reinforcement; (b) the contingent reinforcement is represented by learning expectancy, which is the combined prediction of response-reinforcement and stimulus-reinforcement associations; (c) the operant response is controlled by the interplay between facilitatory and suppressive variables that integrate differences between expected (long-term) and experienced (short-term) events; and (d) very-long-term effects are encoded by a consolidated memory that is sensitive to the entire reinforcement history. The model predicts the major qualitative features of operant phenomena and then suggests an experimental test of theoretical predictions about the joint effects of reinforcement probability and amount of training on operant choice. We hypothesize that the set of elementary principles that we propose may help resolve the long-standing debate about the fundamental variables controlling operant conditioning.
现有的操作性学习模型对行为的历史特性相对不敏感,且仅适用于有限的数据集。本文基于短期和长期记忆机制提出了一组最小原则,这些原则能够解释在单选项和多反应情境中操作性行为的主要静态和动态特性。该理论的关键特征如下:(a)条件作用的关键特性是评估反应与强化之间以及刺激与强化之间的关联程度;(b)偶然强化由学习期望来表征,学习期望是反应-强化和刺激-强化关联的综合预测;(c)操作性反应由促进性和抑制性变量之间的相互作用控制,这些变量整合了预期(长期)和经历(短期)事件之间的差异;(d)非常长期的效应由对整个强化历史敏感的巩固记忆进行编码。该模型预测了操作性现象的主要定性特征,然后提出了一项关于强化概率和训练量对操作性选择的联合效应的理论预测的实验测试。我们假设,我们提出的这组基本原 则可能有助于解决关于控制操作性条件作用的基本变量的长期争论。