• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

认知控制可预测基于模型的强化学习的使用情况。

Cognitive control predicts use of model-based reinforcement learning.

作者信息

Otto A Ross, Skatova Anya, Madlon-Kay Seth, Daw Nathaniel D

机构信息

New York University.

出版信息

J Cogn Neurosci. 2015 Feb;27(2):319-33. doi: 10.1162/jocn_a_00709.

DOI:10.1162/jocn_a_00709
PMID:25170791
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4387848/
Abstract

Accounts of decision-making and its neural substrates have long posited the operation of separate, competing valuation systems in the control of choice behavior. Recent theoretical and experimental work suggest that this classic distinction between behaviorally and neurally dissociable systems for habitual and goal-directed (or more generally, automatic and controlled) choice may arise from two computational strategies for reinforcement learning (RL), called model-free and model-based RL, but the cognitive or computational processes by which one system may dominate over the other in the control of behavior is a matter of ongoing investigation. To elucidate this question, we leverage the theoretical framework of cognitive control, demonstrating that individual differences in utilization of goal-related contextual information--in the service of overcoming habitual, stimulus-driven responses--in established cognitive control paradigms predict model-based behavior in a separate, sequential choice task. The behavioral correspondence between cognitive control and model-based RL compellingly suggests that a common set of processes may underpin the two behaviors. In particular, computational mechanisms originally proposed to underlie controlled behavior may be applicable to understanding the interactions between model-based and model-free choice behavior.

摘要

关于决策及其神经基础的描述长期以来一直假定,在选择行为的控制中存在着独立的、相互竞争的估值系统。最近的理论和实验工作表明,这种在行为和神经上可分离的习惯性和目标导向性(或更一般地说,自动和受控)选择系统之间的经典区分,可能源于强化学习(RL)的两种计算策略,即无模型RL和基于模型的RL。但是,在行为控制中一个系统可能比另一个系统占主导地位的认知或计算过程,仍是一个正在研究的问题。为了阐明这个问题,我们利用认知控制的理论框架,证明在既定的认知控制范式中,为克服习惯性的、刺激驱动的反应而利用与目标相关的情境信息的个体差异,能够预测在一个单独的顺序选择任务中的基于模型的行为。认知控制与基于模型的RL之间的行为对应有力地表明,可能有一组共同的过程支撑这两种行为。特别是,最初提出的作为受控行为基础的计算机制,可能适用于理解基于模型和无模型选择行为之间的相互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/5c9026e7eb0c/nihms675805f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/f203332a63be/nihms675805f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/2e193d308e4b/nihms675805f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/9e4688d52871/nihms675805f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/5c9026e7eb0c/nihms675805f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/f203332a63be/nihms675805f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/2e193d308e4b/nihms675805f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/9e4688d52871/nihms675805f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a921/4387848/5c9026e7eb0c/nihms675805f4.jpg

相似文献

1
Cognitive control predicts use of model-based reinforcement learning.认知控制可预测基于模型的强化学习的使用情况。
J Cogn Neurosci. 2015 Feb;27(2):319-33. doi: 10.1162/jocn_a_00709.
2
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.
3
Working-memory capacity protects model-based learning from stress.工作记忆容量能保护基于模型的学习免受压力影响。
Proc Natl Acad Sci U S A. 2013 Dec 24;110(52):20941-6. doi: 10.1073/pnas.1312011110. Epub 2013 Dec 9.
4
A new computational account of cognitive control over reinforcement-based decision-making: Modeling of a probabilistic learning task.一种关于对基于强化的决策进行认知控制的新计算解释:概率学习任务的建模
Neural Netw. 2015 Nov;71:112-23. doi: 10.1016/j.neunet.2015.08.006. Epub 2015 Aug 20.
5
Navigating complex decision spaces: Problems and paradigms in sequential choice.导航复杂决策空间:序列选择中的问题和范式。
Psychol Bull. 2014 Mar;140(2):466-86. doi: 10.1037/a0033455. Epub 2013 Jul 8.
6
The curse of planning: dissecting multiple reinforcement-learning systems by taxing the central executive.计划的诅咒:通过征税中央执行系统来剖析多个强化学习系统。
Psychol Sci. 2013 May;24(5):751-61. doi: 10.1177/0956797612463080. Epub 2013 Apr 4.
7
Model-Free and Model-Based Influences in Addiction-Related Behaviors.无模型和基于模型的影响因素在成瘾相关行为中的作用。
Biol Psychiatry. 2019 Jun 1;85(11):936-945. doi: 10.1016/j.biopsych.2018.12.017. Epub 2019 Jan 4.
8
Neural Index of Reinforcement Learning Predicts Improved Stimulus-Response Retention under High Working Memory Load.神经强化学习指数预测在高工作记忆负荷下改善刺激-反应保持。
J Neurosci. 2023 Apr 26;43(17):3131-3143. doi: 10.1523/JNEUROSCI.1274-22.2023. Epub 2023 Mar 17.
9
Model-Based Reasoning in Humans Becomes Automatic with Training.人类基于模型的推理经过训练后会变得自动化。
PLoS Comput Biol. 2015 Sep 17;11(9):e1004463. doi: 10.1371/journal.pcbi.1004463. eCollection 2015 Sep.
10
Model-based learning protects against forming habits.基于模型的学习可防止形成习惯。
Cogn Affect Behav Neurosci. 2015 Sep;15(3):523-36. doi: 10.3758/s13415-015-0347-6.

引用本文的文献

1
The interplay between executive functions and updating predictive representations.执行功能与更新预测表征之间的相互作用。
Sci Rep. 2025 Aug 20;15(1):30555. doi: 10.1038/s41598-025-14876-2.
2
Forward Planning in a Population-Based Alcohol Use Disorder Sample.基于人群的酒精使用障碍样本中的前瞻性规划
Addict Biol. 2025 Aug;30(8):e70072. doi: 10.1111/adb.70072.
3
Model-based algorithms shape automatic evaluative processing.基于模型的算法塑造自动评价性加工。

本文引用的文献

1
Disorders of compulsivity: a common bias towards learning habits.强迫性障碍:对学习习惯的一种常见偏好。
Mol Psychiatry. 2015 Mar;20(3):345-52. doi: 10.1038/mp.2014.44. Epub 2014 May 20.
2
Anterior cingulate cortex instigates adaptive switches in choice by integrating immediate and delayed components of value in ventromedial prefrontal cortex.前扣带皮层通过整合腹内侧前额叶皮层中即时和延迟的价值成分来引发选择的适应性转变。
J Neurosci. 2014 Feb 26;34(9):3340-9. doi: 10.1523/JNEUROSCI.4313-13.2014.
3
Neural computations underlying arbitration between model-based and model-free learning.
Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2417068122. doi: 10.1073/pnas.2417068122. Epub 2025 Jun 20.
4
Using computational models of learning to advance cognitive behavioral therapy.利用学习的计算模型推进认知行为疗法。
Commun Psychol. 2025 Apr 27;3(1):72. doi: 10.1038/s44271-025-00251-4.
5
Rewards transiently and automatically enhance sustained attention.奖励会短暂且自动地增强持续注意力。
J Exp Psychol Gen. 2025 Apr;154(4):1063-1079. doi: 10.1037/xge0001727. Epub 2025 Jan 20.
6
Maturation of striatal dopamine supports the development of habitual behavior through adolescence.纹状体多巴胺的成熟通过青春期支持习惯性行为的发展。
bioRxiv. 2025 Jan 6:2025.01.06.631527. doi: 10.1101/2025.01.06.631527.
7
Cognitive tasks, anatomical MRI, and functional MRI data evaluating the construct of self-regulation.评估自我调节结构的认知任务、解剖磁共振成像和功能磁共振成像数据。
Sci Data. 2024 Jul 20;11(1):809. doi: 10.1038/s41597-024-03636-y.
8
Personalized Cognitive Health in Psychiatry: Current State and the Promise of Computational Methods.精神病学中的个性化认知健康:现状和计算方法的前景。
Schizophr Bull. 2024 Aug 27;50(5):1028-1038. doi: 10.1093/schbul/sbae108.
9
Gambling Environment Exposure Increases Temporal Discounting but Improves Model-Based Control in Regular Slot-Machine Gamblers.接触赌博环境会增加时间折扣,但会改善经常玩老虎机的赌徒基于模型的控制能力。
Comput Psychiatr. 2022 Jul 5;6(1):142-165. doi: 10.5334/cpsy.84. eCollection 2022.
10
Cognitive Control in Schizophrenia: Advances in Computational Approaches.精神分裂症中的认知控制:计算方法的进展
Curr Dir Psychol Sci. 2024 Feb;33(1):35-42. doi: 10.1177/09637214231205220. Epub 2023 Nov 10.
基于模型和无模型学习之间仲裁的神经计算。
Neuron. 2014 Feb 5;81(3):687-99. doi: 10.1016/j.neuron.2013.11.028.
4
Of goals and habits: age-related and individual differences in goal-directed decision-making.关于目标和习惯:目标导向决策中的年龄相关和个体差异。
Front Neurosci. 2013 Dec 24;7:253. doi: 10.3389/fnins.2013.00253. eCollection 2013.
5
Working-memory capacity protects model-based learning from stress.工作记忆容量能保护基于模型的学习免受压力影响。
Proc Natl Acad Sci U S A. 2013 Dec 24;110(52):20941-6. doi: 10.1073/pnas.1312011110. Epub 2013 Dec 9.
6
Disruption of dorsolateral prefrontal cortex decreases model-based in favor of model-free control in humans.背外侧前额叶皮层的破坏导致人类从基于模型的控制转向基于模型的控制。
Neuron. 2013 Nov 20;80(4):914-9. doi: 10.1016/j.neuron.2013.08.009. Epub 2013 Oct 24.
7
Goals and habits in the brain.大脑中的目标和习惯。
Neuron. 2013 Oct 16;80(2):312-25. doi: 10.1016/j.neuron.2013.09.007.
8
Extraversion differentiates between model-based and model-free strategies in a reinforcement learning task.外向性在强化学习任务中区分基于模型和无模型策略。
Front Hum Neurosci. 2013 Sep 3;7:525. doi: 10.3389/fnhum.2013.00525. eCollection 2013.
9
What is the subjective cost of cognitive effort? Load, trait, and aging effects revealed by economic preference.认知努力的主观成本是多少?通过经济偏好揭示的负载、特质和老化效应。
PLoS One. 2013 Jul 22;8(7):e68210. doi: 10.1371/journal.pone.0068210. Print 2013.
10
The expected value of control: an integrative theory of anterior cingulate cortex function.控制的预期价值:前扣带皮层功能的综合理论。
Neuron. 2013 Jul 24;79(2):217-40. doi: 10.1016/j.neuron.2013.07.007.