University of Otago, Dunedin, New Zealand.
J Exp Anal Behav. 2009 Jul;92(1):17-39. doi: 10.1901/jeab.2009.92-17.
Three experiments using human participants varied the distribution of point-gain reinforcers or point-loss punishers in two-alternative signal-detection procedures. Experiment 1 varied the distribution of point-gain reinforcers for correct responses (Group A) and point-loss punishers for errors (Group B) across conditions. Response bias varied systematically as a function of the relative reinforcer or punisher frequencies. Experiment 2 arranged two conditions - one where an unequal ratio of reinforcement (5ratio1 or 1ratio5) was presented without punishment (R-only), and another where the same reinforcer ratio was presented with an equal distribution of point-loss punishers (R+P). Response bias was significantly greater in the R-only condition than the R+P condition, supporting a subtractive model of punishment. Experiment 3 varied the distribution of point-gain reinforcers for correct responses across four unequal reinforcer ratios (5ratio1, 2ratio1, 1ratio2, 1ratio5) both without (R-only) and with (R+P) an equal distribution of point-loss punishers for errors. Response bias varied systematically with changes in relative reinforcer frequency for both R-only and R+P conditions, with 5 out of 8 participants showing increases in sensitivity estimates from R-only to R+P conditions. Overall, the results indicated that punishers have similar but opposite effects to reinforcers in detection procedures and that combined reinforcer and punisher effects might be better modeled by a subtractive punishment model than an additive punishment model, consistent with research using concurrent-schedule choice procedures.
三个使用人类参与者的实验在两种替代信号检测程序中改变了点增益强化物或点损失惩罚的分布。实验 1 在条件之间改变了正确反应的点增益强化物(A 组)和错误的点损失惩罚(B 组)的分布。反应偏差系统地随相对强化物或惩罚物频率的变化而变化。实验 2 安排了两种条件——一种是没有惩罚(仅 R)时呈现不等的强化比(5 比 1 或 1 比 5),另一种是在呈现相同的强化比时呈现等分布的点损失惩罚(R+P)。在仅 R 条件下的反应偏差明显大于 R+P 条件,支持惩罚的减法模型。实验 3 在四个不等的强化比(5 比 1、2 比 1、1 比 2、1 比 5)中改变了正确反应的点增益强化物的分布,同时在错误时也呈现了等分布的点损失惩罚(仅 R 和 R+P)。在仅 R 和 R+P 条件下,反应偏差都随相对强化物频率的变化而系统地变化,8 名参与者中有 5 名从仅 R 条件到 R+P 条件的敏感性估计值增加。总的来说,结果表明惩罚在检测程序中具有与强化物相似但相反的效果,并且强化和惩罚的综合效果可能通过减法惩罚模型而不是加法惩罚模型更好地建模,这与使用并发计划选择程序的研究一致。