Centre for Brain, Mind and Markets, The University of Melbourne, Carlton, Australia.
Center for the Promotion of Social Data Science Education and Research, Hitotsubashi University, Tokyo, Japan.
PLoS Biol. 2023 Mar 14;21(3):e3002031. doi: 10.1371/journal.pbio.3002031. eCollection 2023 Mar.
Obsessive-compulsive disorder (OCD) and pathological gambling (PG) are accompanied by deficits in behavioural flexibility. In reinforcement learning, this inflexibility can reflect asymmetric learning from outcomes above and below expectations. In alternative frameworks, it reflects perseveration independent of learning. Here, we examine evidence for asymmetric reward-learning in OCD and PG by leveraging model-based functional magnetic resonance imaging (fMRI). Compared with healthy controls (HC), OCD patients exhibited a lower learning rate for worse-than-expected outcomes, which was associated with the attenuated encoding of negative reward prediction errors in the dorsomedial prefrontal cortex and the dorsal striatum. PG patients showed higher and lower learning rates for better- and worse-than-expected outcomes, respectively, accompanied by higher encoding of positive reward prediction errors in the anterior insula than HC. Perseveration did not differ considerably between the patient groups and HC. These findings elucidate the neural computations of reward-learning that are altered in OCD and PG, providing a potential account of behavioural inflexibility in those mental disorders.
强迫症(OCD)和病理性赌博(PG)伴随着行为灵活性的缺陷。在强化学习中,这种不灵活性可以反映出对预期结果上下的不对称学习。在替代框架中,它反映了与学习无关的坚持。在这里,我们通过利用基于模型的功能磁共振成像(fMRI)来检查 OCD 和 PG 中不对称奖励学习的证据。与健康对照组(HC)相比,OCD 患者对预期不佳的结果的学习率较低,这与背侧前额叶皮层和背侧纹状体中负奖励预测误差的编码减弱有关。PG 患者对预期较好和较差的结果的学习率分别较高和较低,伴随着前岛叶中正奖励预测误差的编码较高,与 HC 相比。在患者组和 HC 之间,坚持差异并不明显。这些发现阐明了在 OCD 和 PG 中改变的奖励学习的神经计算,为这些精神障碍中的行为灵活性提供了潜在的解释。