Suppr超能文献

使用强化学习模型和威斯康星卡片分类测验模拟法分离额叶病变引起的规则价值学习缺陷

Dissociating Frontal Lobe Lesion Induced Deficits in Rule Value Learning Using Reinforcement Learning Models and a WCST Analog.

作者信息

Capkova Lucie, Ainsworth Matthew, Mansouri Farshad A, Buckley Mark J

机构信息

Department of Experimental Psychology, University of Oxford, Oxford OX1 3SR, United Kingdom

Cognitive Neuroscience Laboratory, Department of Physiology, Monash Biomedicine Discovery Institute, Monash University, Melbourne, Victoria 3800, Australia.

出版信息

eNeuro. 2025 May 20;12(5). doi: 10.1523/ENEURO.0117-25.2025. Print 2025 May.

Abstract

Distinct frontal regions make dissociable contributions to rule-guided decision-making, including the ability to learn and exploit associations between abstract rules and reward value, maintain those rules in memory, and evaluate choice outcomes. Value-based learning can be quantified using reinforcement learning (RL) models predicting optimal trial-wise choices and estimating learning rates, which can then be related to the intact functioning of specific brain areas by combining a modeling approach with lesion-behavioral data. We applied a three-parameter feedback-dependent RL model to behavioral data obtained from macaques with circumscribed lesions to the principal sulcus (PS), anterior cingulate cortex (ACC), orbitofrontal cortex (OFC), superior dorsolateral prefrontal cortex (sdlPFC), and frontopolar cortex (FPC) performing a Wisconsin card sorting task (WCST) analog. Our modeling-based approach identified distinct lesion effects on component cognitive mechanisms contributing to WCST performance. OFC lesions decreased the rate of rule value updating following both positive and negative feedback. In contrast, we found no deficit in rule value updating following PS lesions, which instead made monkeys less likely to repeat correct choices when rule values were well established, suggesting a crucial role of the PS in the working memory maintenance of rule representations. Finally, ACC lesions produced a specific deficit in learning from negative feedback, as well as impaired the ability to repeat choices following highly surprising reward, supporting a proposed role for ACC in flexibly switching between a trial-and-error mode and a working memory mode in response to increased error likelihood.

摘要

不同的额叶区域对规则引导的决策有不同的贡献,包括学习和利用抽象规则与奖励价值之间关联的能力、在记忆中维持这些规则的能力以及评估选择结果的能力。基于价值的学习可以使用强化学习(RL)模型进行量化,该模型预测最优的逐次试验选择并估计学习率,然后通过将建模方法与损伤行为数据相结合,将这些学习率与特定脑区的完整功能联系起来。我们将一个三参数反馈依赖的RL模型应用于从患有局限于中央沟(PS)、前扣带回皮质(ACC)、眶额皮质(OFC)、背外侧前额叶皮质(sdlPFC)和额极皮质(FPC)损伤的猕猴获得的行为数据,这些猕猴执行了威斯康星卡片分类任务(WCST)模拟实验。我们基于建模的方法确定了对WCST表现有贡献的组成认知机制的不同损伤效应。OFC损伤降低了正负反馈后规则价值更新的速率。相比之下,我们发现PS损伤后规则价值更新没有缺陷,反而在规则价值确立后使猴子重复正确选择的可能性降低,这表明PS在规则表征的工作记忆维持中起关键作用。最后,ACC损伤在从负反馈学习方面产生了特定缺陷,并且在获得高度意外奖励后重复选择的能力受损,支持了ACC在响应错误可能性增加时在试错模式和工作记忆模式之间灵活切换的提议作用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验