Beeler Jeff A, Daw Nathaniel, Frazier Cristianne R M, Zhuang Xiaoxi
Department of Neurobiology, University of Chicago, Chicago, IL, USA.
Front Behav Neurosci. 2010 Nov 4;4:170. doi: 10.3389/fnbeh.2010.00170. eCollection 2010.
The impact of dopamine on adaptive behavior in a naturalistic environment is largely unexamined. Experimental work suggests that phasic dopamine is central to reinforcement learning whereas tonic dopamine may modulate performance without altering learning per se; however, this idea has not been developed formally or integrated with computational models of dopamine function. We quantitatively evaluate the role of tonic dopamine in these functions by studying the behavior of hyperdopaminergic DAT knockdown mice in an instrumental task in a semi-naturalistic homecage environment. In this "closed economy" paradigm, subjects earn all of their food by pressing either of two levers, but the relative cost for food on each lever shifts frequently. Compared to wild-type mice, hyperdopaminergic mice allocate more lever presses on high-cost levers, thus working harder to earn a given amount of food and maintain their body weight. However, both groups show a similarly quick reaction to shifts in lever cost, suggesting that the hyperdominergic mice are not slower at detecting changes, as with a learning deficit. We fit the lever choice data using reinforcement learning models to assess the distinction between acquisition and expression the models formalize. In these analyses, hyperdopaminergic mice displayed normal learning from recent reward history but diminished capacity to exploit this learning: a reduced coupling between choice and reward history. These data suggest that dopamine modulates the degree to which prior learning biases action selection and consequently alters the expression of learned, motivated behavior.
多巴胺在自然环境中对适应性行为的影响在很大程度上尚未得到研究。实验工作表明,相位性多巴胺对强化学习至关重要,而持续性多巴胺可能在不改变学习本身的情况下调节行为表现;然而,这一观点尚未得到正式阐述,也未与多巴胺功能的计算模型相结合。我们通过研究超多巴胺能多巴胺转运体(DAT)基因敲除小鼠在半自然的笼内环境中的一项操作性任务中的行为,定量评估持续性多巴胺在这些功能中的作用。在这种“封闭经济”范式中,实验对象通过按压两个杠杆中的任何一个来获取所有食物,但每个杠杆上食物的相对成本会频繁变化。与野生型小鼠相比,超多巴胺能小鼠在高成本杠杆上分配了更多的杠杆按压次数,因此为获取一定量的食物并维持体重而更加努力。然而,两组小鼠对杠杆成本变化的反应同样迅速,这表明超多巴胺能小鼠在检测变化方面并不比学习缺陷型小鼠慢。我们使用强化学习模型拟合杠杆选择数据,以评估模型所形式化的习得与表现之间的区别。在这些分析中,超多巴胺能小鼠从近期奖励历史中表现出正常的学习能力,但利用这种学习的能力有所下降:选择与奖励历史之间的耦合减少。这些数据表明,多巴胺调节先前学习对行动选择产生偏差的程度,从而改变习得的、有动机行为的表现。