Skelin Ivan, Hakstol Rhys, VanOyen Jenn, Mudiayi Dominic, Molina Leonardo A, Holec Victoria, Hong Nancy S, Euston David R, McDonald Robert J, Gruber Aaron J
Department of Neuroscience, Canadian Centre for Behavioural Neuroscience, University of Lethbridge, 4401 University Dr. W., T1K 3M4, Lethbridge, AB, Canada.
Eur J Neurosci. 2014 May;39(10):1655-63. doi: 10.1111/ejn.12518. Epub 2014 Mar 6.
We used focal brain lesions in rats to examine how dorsomedial (DMS) and dorsolateral (DLS) regions of the striatum differently contribute to response adaptation driven by the delivery or omission of rewards. Rats performed a binary choice task under two modes: one in which responses were rewarded on half of the trials regardless of choice; and another 'competitive' one in which only unpredictable choices were rewarded. In both modes, control animals were more likely to use a predictable lose-switch strategy than animals with lesions of either DMS or DLS. Animals with lesions of DMS presumably relied more on DLS for behavioural control, and generated repetitive responses in the first mode. These animals then shifted to a random response strategy in the competitive mode, thereby performing better than controls or animals with DLS lesions. Analysis using computational models of reinforcement learning indicated that animals with striatal lesions, particularly of the DLS, had blunted reward sensitivity and less stochasticity in the choice mechanism. These results provide further evidence that the rodent DLS is involved in rapid response adaptation that is more sophisticated than that embodied by the classic notion of habit formation driven by gradual stimulus-response learning.
我们利用大鼠的局灶性脑损伤来研究纹状体的背内侧(DMS)和背外侧(DLS)区域如何以不同方式促进由奖励的给予或缺失所驱动的反应适应。大鼠在两种模式下执行二元选择任务:一种模式是无论选择如何,在一半的试验中做出反应会得到奖励;另一种“竞争性”模式是只有不可预测的选择才会得到奖励。在这两种模式下,与DMS或DLS损伤的动物相比,对照动物更有可能采用可预测的输-换策略。DMS损伤的动物可能更多地依赖DLS进行行为控制,并在第一种模式下产生重复性反应。然后这些动物在竞争性模式下转向随机反应策略,从而比对照动物或DLS损伤的动物表现更好。使用强化学习计算模型进行的分析表明,纹状体损伤的动物,特别是DLS损伤的动物,奖励敏感性降低,选择机制中的随机性也较小。这些结果进一步证明,啮齿动物的DLS参与了比由渐进性刺激-反应学习驱动的经典习惯形成概念所体现的更为复杂的快速反应适应。