Wiehler Antonius, Peters Jan
Department of Systems Neuroscience, University Medical Centre Hamburg-Eppendorf, Hamburg, Germany.
Institut du Cerveau et de la Moelle épinière (ICM), INSERM U 1127, CNRS UMR 7225, Sorbonne Universités Paris, France.
Comput Psychiatr. 2024 Mar 20;8(1):23-45. doi: 10.5334/cpsy.104. eCollection 2024.
Gambling disorder is associated with deficits in reward-based learning, but the underlying computational mechanisms are still poorly understood. Here, we examined this issue using a stationary reinforcement learning task in combination with computational modeling and functional resonance imaging (fMRI) in individuals that regular participate in gambling (n = 23, seven fulfilled one to three DSM 5 criteria for gambling disorder, sixteen fulfilled four or more) and matched controls (n = 23). As predicted, the gambling group exhibited substantially reduced accuracy, whereas overall response times (RTs) were not reliably different between groups. We then used comprehensive modeling using reinforcement learning drift diffusion models (RLDDMs) in combination with hierarchical Bayesian parameter estimation to shed light on the computational underpinnings of this performance deficit. In both groups, an RLDDM in which both non-decision time and decision threshold (boundary separation) changed over the course of the experiment accounted for the data best. The model showed good parameter and model recovery, and posterior predictive checks revealed that, in both groups, the model accurately reproduced the evolution of accuracies and RTs over time. Modeling revealed that, compared to controls, the learning impairment in the gambling group was linked to a more rapid reduction in decision thresholds over time, and a reduced impact of value-differences on the drift rate. The gambling group also showed shorter non-decision times. FMRI analyses replicated effects of prediction error coding in the ventral striatum and value coding in the ventro-medial prefrontal cortex, but there was no credible evidence for group differences in these effects. Taken together, our findings show that reinforcement learning impairments in disordered gambling are linked to both maladaptive decision threshold adjustments and a reduced consideration of option values in the choice process.
赌博障碍与基于奖励的学习缺陷有关,但潜在的计算机制仍知之甚少。在此,我们通过一项静态强化学习任务,结合计算建模和功能磁共振成像(fMRI),对这一问题进行了研究,研究对象为经常参与赌博的个体(n = 23,其中7人符合1至3条《精神疾病诊断与统计手册》第5版(DSM 5)的赌博障碍标准,16人符合4条或更多标准)以及匹配的对照组(n = 23)。正如预期的那样,赌博组的准确率大幅降低,而两组之间的总体反应时间(RTs)没有可靠的差异。然后,我们使用强化学习漂移扩散模型(RLDDMs)结合分层贝叶斯参数估计进行综合建模,以阐明这种性能缺陷的计算基础。在两组中,一个在实验过程中非决策时间和决策阈值(边界分离)都发生变化的RLDDM对数据的拟合效果最佳。该模型显示出良好的参数和模型恢复能力,后验预测检验表明,在两组中,该模型都准确地再现了准确率和反应时间随时间的变化。建模结果显示,与对照组相比,赌博组的学习障碍与决策阈值随时间更快的降低以及价值差异对漂移率的影响减小有关。赌博组的非决策时间也更短。功能磁共振成像分析重复了腹侧纹状体中预测误差编码和腹内侧前额叶皮质中价值编码的效应,但没有可靠证据表明这些效应存在组间差异。综上所述,我们的研究结果表明,无序赌博中的强化学习障碍与适应不良的决策阈值调整以及选择过程中对选项价值的考虑减少有关。