Center for Molecular and Behavioral Neuroscience, Rutgers University, Newark, NJ.
Department of Neurology and Neurosurgery, Montreal Neurological Institute, McGill University, Montreal, Canada.
Nicotine Tob Res. 2020 Feb 6;22(2):164-171. doi: 10.1093/ntr/nty136.
Alterations in dopamine signaling play a key role in reinforcement learning and nicotine addiction, but the relationship between these two processes has not been well characterized. We investigated this relationship in young adult smokers using a combination of behavioral and computational measures of reinforcement learning.
We asked moderately dependent smokers to engage in a reinforcement learning task three times: smoking as usual, smoking abstinence, and cigarette consumption. Participants' trial-to-trial training choices were modeled using a reinforcement learning model that calculates separate learning rates associated with positive and negative prediction errors.
We found that learning from positive prediction error signals is reduced during smoking abstinence and enhanced following cigarette consumption. By contrast, learning from negative prediction error signals was enhanced during smoking abstinence and reduced following cigarette consumption. Finally, when tested with novel pairs of stimuli, participants were relatively better at selecting the positive feedback predicting stimuli than avoiding the negative feedback predicting stimuli during the smoking as usual session, a pattern that reversed following cigarette consumption.
These findings provide a specific computational account of altered reinforcement learning induced by smoking state (abstinence and consumption) and may represent a unique target for treatment of nicotine addiction.
This study illustrates the potential of computational psychiatry for understanding reinforcement learning deficits associated with substance use disorders in general and nicotine addiction in particular. We found that learning from positive prediction error signals is reduced during smoking abstinence and enhanced following cigarette consumption. By contrast, learning from negative prediction error signals was enhanced during smoking abstinence and reduced following cigarette consumption. By highlighting important computational differences between three states of smoking, these findings hold out promise for integrating experimental, computational, and theoretical analyses of decision-making function together with research on addiction-related disorders.
多巴胺信号的改变在强化学习和尼古丁成瘾中起着关键作用,但这两个过程之间的关系尚未得到很好的描述。我们使用强化学习的行为和计算测量相结合的方法,研究了年轻成年吸烟者的这种关系。
我们要求中度依赖的吸烟者进行三次强化学习任务:照常吸烟、戒烟和吸烟。使用一种强化学习模型对参与者的试验到试验训练选择进行建模,该模型计算与正和负预测误差相关的单独学习率。
我们发现,在戒烟期间,从正预测误差信号中学习的能力降低,而在吸烟后增强。相比之下,在戒烟期间,从负预测误差信号中学习的能力增强,而在吸烟后降低。最后,当用新的刺激对进行测试时,参与者在通常吸烟的情况下,相对于避免负反馈预测刺激,更善于选择预测正反馈的刺激,而在吸烟后,这种模式发生逆转。
这些发现为吸烟状态(戒烟和吸烟)引起的强化学习改变提供了特定的计算解释,可能代表治疗尼古丁成瘾的独特目标。
这项研究说明了计算精神病学在理解一般物质使用障碍和尼古丁成瘾相关的强化学习缺陷方面的潜力。我们发现,在戒烟期间,从正预测误差信号中学习的能力降低,而在吸烟后增强。相比之下,在戒烟期间,从负预测误差信号中学习的能力增强,而在吸烟后降低。通过突出吸烟三种状态之间的重要计算差异,这些发现为整合决策功能的实验、计算和理论分析与成瘾相关障碍的研究提供了希望。