使用强化学习模型和威斯康星卡片分类测验模拟法分离额叶病变引起的规则价值学习缺陷

Dissociating Frontal Lobe Lesion Induced Deficits in Rule Value Learning Using Reinforcement Learning Models and a WCST Analog.

作者信息

Capkova Lucie, Ainsworth Matthew, Mansouri Farshad A, Buckley Mark J

机构信息

Department of Experimental Psychology, University of Oxford, Oxford OX1 3SR, United Kingdom

Cognitive Neuroscience Laboratory, Department of Physiology, Monash Biomedicine Discovery Institute, Monash University, Melbourne, Victoria 3800, Australia.

出版信息

eNeuro. 2025 May 20;12(5). doi: 10.1523/ENEURO.0117-25.2025. Print 2025 May.

DOI:10.1523/ENEURO.0117-25.2025

PMID:40393730

Abstract

Distinct frontal regions make dissociable contributions to rule-guided decision-making, including the ability to learn and exploit associations between abstract rules and reward value, maintain those rules in memory, and evaluate choice outcomes. Value-based learning can be quantified using reinforcement learning (RL) models predicting optimal trial-wise choices and estimating learning rates, which can then be related to the intact functioning of specific brain areas by combining a modeling approach with lesion-behavioral data. We applied a three-parameter feedback-dependent RL model to behavioral data obtained from macaques with circumscribed lesions to the principal sulcus (PS), anterior cingulate cortex (ACC), orbitofrontal cortex (OFC), superior dorsolateral prefrontal cortex (sdlPFC), and frontopolar cortex (FPC) performing a Wisconsin card sorting task (WCST) analog. Our modeling-based approach identified distinct lesion effects on component cognitive mechanisms contributing to WCST performance. OFC lesions decreased the rate of rule value updating following both positive and negative feedback. In contrast, we found no deficit in rule value updating following PS lesions, which instead made monkeys less likely to repeat correct choices when rule values were well established, suggesting a crucial role of the PS in the working memory maintenance of rule representations. Finally, ACC lesions produced a specific deficit in learning from negative feedback, as well as impaired the ability to repeat choices following highly surprising reward, supporting a proposed role for ACC in flexibly switching between a trial-and-error mode and a working memory mode in response to increased error likelihood.

摘要

不同的额叶区域对规则引导的决策有不同的贡献，包括学习和利用抽象规则与奖励价值之间关联的能力、在记忆中维持这些规则的能力以及评估选择结果的能力。基于价值的学习可以使用强化学习（RL）模型进行量化，该模型预测最优的逐次试验选择并估计学习率，然后通过将建模方法与损伤行为数据相结合，将这些学习率与特定脑区的完整功能联系起来。我们将一个三参数反馈依赖的RL模型应用于从患有局限于中央沟（PS）、前扣带回皮质（ACC）、眶额皮质（OFC）、背外侧前额叶皮质（sdlPFC）和额极皮质（FPC）损伤的猕猴获得的行为数据，这些猕猴执行了威斯康星卡片分类任务（WCST）模拟实验。我们基于建模的方法确定了对WCST表现有贡献的组成认知机制的不同损伤效应。OFC损伤降低了正负反馈后规则价值更新的速率。相比之下，我们发现PS损伤后规则价值更新没有缺陷，反而在规则价值确立后使猴子重复正确选择的可能性降低，这表明PS在规则表征的工作记忆维持中起关键作用。最后，ACC损伤在从负反馈学习方面产生了特定缺陷，并且在获得高度意外奖励后重复选择的能力受损，支持了ACC在响应错误可能性增加时在试错模式和工作记忆模式之间灵活切换的提议作用。

相似文献

Dissociating Frontal Lobe Lesion Induced Deficits in Rule Value Learning Using Reinforcement Learning Models and a WCST Analog.

eNeuro. 2025 May 20;12(5). doi: 10.1523/ENEURO.0117-25.2025. Print 2025 May.

Dissociable components of rule-guided behavior depend on distinct medial and prefrontal regions.

Science. 2009 Jul 3;325(5936):52-8. doi: 10.1126/science.1172377.

Role of the monkey orbitofrontal cortex in processing the choice history during reward-based decision-making.

Cereb Cortex. 2025 Jun 4;35(6). doi: 10.1093/cercor/bhaf147.

Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.

Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Impact of symmetry in local learning rules on predictive neural representations and generalization in spatial navigation.

PLoS Comput Biol. 2025 Jun 23;21(6):e1013056. doi: 10.1371/journal.pcbi.1013056. eCollection 2025 Jun.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Adefovir dipivoxil and pegylated interferon alfa-2a for the treatment of chronic hepatitis B: a systematic review and economic evaluation.

Health Technol Assess. 2006 Aug;10(28):iii-iv, xi-xiv, 1-183. doi: 10.3310/hta10280.

Atypical antipsychotics for disruptive behaviour disorders in children and youths.

Cochrane Database Syst Rev. 2017 Aug 9;8(8):CD008559. doi: 10.1002/14651858.CD008559.pub3.

本文引用的文献

Memories or decisions? Bridging accounts of frontopolar function.

Neuropsychologia. 2025 May 3;211:109119. doi: 10.1016/j.neuropsychologia.2025.109119. Epub 2025 Mar 8.

A Comparison of Rapid Rule-Learning Strategies in Humans and Monkeys.

J Neurosci. 2024 Jul 10;44(28):e0231232024. doi: 10.1523/JNEUROSCI.0231-23.2024.

Mapping causal links between prefrontal cortical regions and intra-individual behavioral variability.

Nat Commun. 2024 Jan 2;15(1):140. doi: 10.1038/s41467-023-44341-5.

A frontopolar-temporal circuit determines the impact of social information in macaque decision making.

Neuron. 2024 Jan 3;112(1):84-92.e6. doi: 10.1016/j.neuron.2023.09.035. Epub 2023 Oct 19.

Frontal and temporal coding dynamics in successive steps of complex behavior.

Neuron. 2023 Feb 1;111(3):430-443.e3. doi: 10.1016/j.neuron.2022.11.004. Epub 2022 Dec 5.

Frontopolar cortex shapes brain network structure across prefrontal and posterior cingulate cortex.

Prog Neurobiol. 2022 Oct;217:102314. doi: 10.1016/j.pneurobio.2022.102314. Epub 2022 Jul 4.

Dimension of visual information interacts with working memory in monkeys and humans.

Sci Rep. 2022 Mar 29;12(1):5335. doi: 10.1038/s41598-022-09367-7.

Asymmetric reinforcement learning facilitates human inference of transitive relations.

Nat Hum Behav. 2022 Apr;6(4):555-564. doi: 10.1038/s41562-021-01263-w. Epub 2022 Jan 31.

The neural substrate and underlying mechanisms of executive control fluctuations in primates.

Prog Neurobiol. 2022 Feb;209:102216. doi: 10.1016/j.pneurobio.2022.102216. Epub 2022 Jan 4.

Parallel model-based and model-free reinforcement learning for card sorting performance.

Sci Rep. 2020 Sep 22;10(1):15464. doi: 10.1038/s41598-020-72407-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用强化学习模型和威斯康星卡片分类测验模拟法分离额叶病变引起的规则价值学习缺陷

Dissociating Frontal Lobe Lesion Induced Deficits in Rule Value Learning Using Reinforcement Learning Models and a WCST Analog.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献