Balleine B W, Dickinson A
Department of Psychology, UCLA, Los Angeles, CA 90095-1563, USA.
Neuropharmacology. 1998 Apr-May;37(4-5):407-19. doi: 10.1016/s0028-3908(98)00033-1.
Instrumental behaviour is controlled by two systems: a stimulus-response habit mechanism and a goal-directed process that involves two forms of learning. The first is learning about the instrumental contingency between the response and reward, whereas the second consists of the acquisition of incentive value by the reward. Evidence for contingency learning comes from studies of reward devaluation and from demonstrations that instrumental performance is sensitive not only the probability of contiguous reward but also to the probability of unpaired rewards. The process of incentive learning is evident in the acquisition of control over performance by primary motivational states. Preliminary lesion studies of the rat suggest that the prelimbic area of prefrontal cortex plays a role in the contingency learning, whereas the incentive learning for food rewards involves the insular cortex.
一种是刺激-反应习惯机制,另一种是涉及两种学习形式的目标导向过程。第一种是学习反应与奖励之间的工具性偶然联系,而第二种则包括奖励获得激励价值的过程。偶然联系学习的证据来自奖励贬值研究,以及工具性表现不仅对连续奖励的概率敏感,而且对非配对奖励的概率也敏感的证明。激励学习过程在主要动机状态对行为表现的控制习得中很明显。对大鼠的初步损伤研究表明,前额叶皮层的前边缘区在偶然联系学习中起作用,而食物奖励的激励学习涉及岛叶皮层。