Department of Psychology, University of Cambridge, Downing Street, Cambridge, CB2 3EB, UK.
Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, CB2 3EB, UK.
Neuropsychopharmacology. 2019 Dec;44(13):2163-2173. doi: 10.1038/s41386-019-0381-0. Epub 2019 Apr 6.
Addiction is regarded as a disorder of inflexible choice with behavior dominated by immediate positive rewards over longer-term negative outcomes. However, the psychological mechanisms underlying the effects of self-administered drugs on behavioral flexibility are not well understood. To investigate whether drug exposure causes asymmetric effects on positive and negative outcomes we used a reversal learning procedure to assess how reward contingencies are utilized to guide behavior in rats previously exposed to intravenous cocaine self-administration (SA). Twenty-four rats were screened for anxiety in an open field prior to acquisition of cocaine SA over six daily sessions with subsequent long-access cocaine SA for 7 days. Control rats (n = 24) were trained to lever-press for food under a yoked schedule of reinforcement. Higher rates of cocaine SA were predicted by increased anxiety and preceded impaired reversal learning, expressed by a decrease in lose-shift as opposed to win-stay probability. A model-free reinforcement learning algorithm revealed that rats with high, but not low cocaine escalation failed to exploit previous reward learning and were more likely to repeat the same response as the previous trial. Eight-day withdrawal from high cocaine escalation was associated, respectively, with increased and decreased dopamine receptor D2 (DRD2) and serotonin receptor 2C (HTR2C) expression in the ventral striatum compared with controls. Dopamine receptor D1 (DRD1) expression was also significantly reduced in the orbitofrontal cortex of high cocaine-escalating rats. These findings indicate that withdrawal from escalated cocaine SA disrupts how negative feedback is used to guide goal-directed behavior for natural reinforcers and that trait anxiety may be a latent variable underlying this interaction.
成瘾被认为是一种行为灵活性障碍,其行为主要由即时的正奖励主导,而不是长期的负后果。然而,自我给药对行为灵活性影响的心理机制尚不清楚。为了研究药物暴露是否对正性和负性结果产生不对称影响,我们使用反转学习程序来评估先前接受过静脉内可卡因自我给药(SA)的大鼠如何利用奖励关联来指导行为。24 只大鼠在进行可卡因 SA 六天的日常训练之前,在一个开阔场中进行了焦虑筛查,随后进行了 7 天的长时间可卡因 SA。对照大鼠(n=24)被训练在强化的束缚日程下按压杠杆以获取食物。较高的可卡因 SA 率由焦虑增加预测,并先于反转学习受损,表现为与赢留相比,失移概率降低。无模型强化学习算法表明,高可卡因上升而非低可卡因上升的大鼠未能利用先前的奖励学习,并且更有可能重复与前一次试验相同的反应。与对照组相比,高可卡因上升大鼠的腹侧纹状体中多巴胺受体 D2(DRD2)和 5-羟色胺受体 2C(HTR2C)表达分别增加和减少,而眶额皮质中的多巴胺受体 D1(DRD1)表达也显著降低。这些发现表明,从可卡因 SA 上升中戒断会干扰负反馈如何用于指导自然强化物的目标导向行为,而特质焦虑可能是这种相互作用的潜在变量。