Modelling Animal Decisions (MAD) Group, School of Biological Sciences, University of Bristol, England.
J Exp Anal Behav. 2012 Nov;98(3):355-67. doi: 10.1901/jeab.2012.98-355.
The relationship between positive and negative reinforcement and the symmetry of Thorndike's law of effect are unresolved issues in operant psychology. Here we show that, for a given pattern of responding on variable interval (VI) schedules with the same programmed rate of food rewards (positive reinforcement VI) or electric shocks (negative reinforcement VI), there is a fundamental mathematical equivalence between reward gain and shock reduction. We also provide the first normative account of how animals should respond on a negative VI schedule, showing that it is better to space responses evenly than to respond with a variable interresponse time (IRT). Published data from rats, however, indicate that these animals respond irregularly, often with a burst of activity immediately following a shock. While this is irrational in the experimental setting, it may represent an appropriate response to the heterogeneity of stimuli commonly encountered in natural environments. We discuss the broader implications of our analysis for understanding how animals evaluate appetitive and aversive stimuli.
正强化和负强化与桑代克效果律的对称性是操作性心理学中尚未解决的问题。在这里,我们表明,对于具有相同程序的食物奖励(正强化 VI)或电击(负强化 VI)的可变间隔(VI)时间表上的给定响应模式,奖励收益和电击减少之间存在根本的数学等价性。我们还首次提供了关于动物应如何在负 VI 时间表上做出反应的规范说明,表明均匀地间隔响应比使用可变的间隔时间(IRT)做出响应更好。然而,来自大鼠的已发表数据表明,这些动物的反应不规则,通常在电击后立即出现一阵活动。虽然这在实验环境中是不合理的,但它可能代表了对自然环境中常见的刺激异质性的适当反应。我们讨论了我们的分析对理解动物如何评估奖赏和厌恶性刺激的更广泛意义。