Thrailkill Eric A, Trask Sydney, Vidal Pedro, Alcalá José A, Bouton Mark E
Department of Psychological Science, University of Vermont.
J Exp Psychol Anim Learn Cogn. 2018 Oct;44(4):370-384. doi: 10.1037/xan0000188.
Goal-directed actions are instrumental behaviors whose performance depends on the organism's knowledge of the reinforcing outcome's value. In contrast, habits are instrumental behaviors that are insensitive to the outcome's current value. Although habits in everyday life are typically controlled by stimuli that occasion them, most research has studied habits using free-operant procedures in which no discrete stimuli are present to occasion the response. We therefore studied habit learning when rats were reinforced for lever pressing on a random-interval 30-s schedule in the presence of a discriminative stimulus (S) but not in its absence. In Experiment 1, devaluing the reinforcer with taste aversion conditioning weakened instrumental responding in a 30-s S after 4, 22, and 66 sessions of instrumental training. Even extensive practice thus produced goal-directed action, not habit. Experiments 2 and 3 contrastingly found habit when the duration of S was increased from 30 s to 8 min. Experiment 4 then found habit with the 30-s S when it always contained a reinforcer; goal-directed action was maintained when reinforcers were earned at the same rate but occurred in only 50% of Ss (as in the previous experiments). The results challenge the view that habits are an inevitable consequence of repeated reinforcement (as in the law of effect) and instead suggest that discriminated habits develop when the reinforcer becomes predictable. Under those conditions, organisms may pay less attention to their behavior, much as they pay less attention to signals associated with predicted reinforcers in Pavlovian conditioning. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
目标导向行为是一种工具性的行为,其表现取决于生物体对强化结果价值的认知。相比之下,习惯是一种工具性的行为,对结果的当前价值不敏感。虽然日常生活中的习惯通常由引发它们的刺激所控制,但大多数研究使用的是自由操作程序来研究习惯,在这种程序中,没有离散的刺激来引发反应。因此,我们研究了大鼠在有辨别性刺激(S)存在但无辨别性刺激时,按随机间隔30秒的时间表按压杠杆并得到强化的习惯学习情况。在实验1中,通过味觉厌恶条件作用使强化物贬值,在进行了4、22和66次工具性训练后,削弱了在30秒S条件下的工具性反应。因此,即使是大量的练习也产生了目标导向行为,而不是习惯。相反,在实验2和实验3中,当S的持续时间从30秒增加到8分钟时,发现了习惯。实验4随后发现,当30秒的S总是包含强化物时会产生习惯;当以相同的速率获得强化物但仅在50%的S中出现时(如之前的实验),则维持目标导向行为。这些结果挑战了习惯是重复强化的必然结果(如效果律所述)这一观点,相反表明,当强化物变得可预测时,辨别性习惯就会形成。在这些条件下,生物体可能会对自己的行为关注较少,就像它们在经典条件作用中对与预测强化物相关的信号关注较少一样。(《心理学文摘数据库记录》(c)2018美国心理学会,保留所有权利)