• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

序贯决策中的回溯再评估:两个系统的故事。

Retrospective revaluation in sequential decision making: a tale of two systems.

机构信息

Department of Psychology and Princeton Neuroscience Institute, Princeton University.

Department of Psychology, University of Texas at Austin.

出版信息

J Exp Psychol Gen. 2014 Feb;143(1):182-94. doi: 10.1037/a0030844. Epub 2012 Dec 10.

DOI:10.1037/a0030844
PMID:23230992
Abstract

Recent computational theories of decision making in humans and animals have portrayed 2 systems locked in a battle for control of behavior. One system--variously termed model-free or habitual--favors actions that have previously led to reward, whereas a second--called the model-based or goal-directed system--favors actions that causally lead to reward according to the agent's internal model of the environment. Some evidence suggests that control can be shifted between these systems using neural or behavioral manipulations, but other evidence suggests that the systems are more intertwined than a competitive account would imply. In 4 behavioral experiments, using a retrospective revaluation design and a cognitive load manipulation, we show that human decisions are more consistent with a cooperative architecture in which the model-free system controls behavior, whereas the model-based system trains the model-free system by replaying and simulating experience.

摘要

近期关于人类和动物决策的计算理论描绘了两个系统为了控制行为而进行的斗争。一个系统——被称为无模型或习惯的系统——倾向于之前导致奖励的行动,而第二个系统——称为基于模型或目标导向的系统——则倾向于根据主体对环境的内部模型,导致奖励的行动。一些证据表明,可以通过神经或行为操作在这些系统之间进行控制,但其他证据表明,这些系统比竞争理论所暗示的更为交织。在四项行为实验中,我们使用回溯再评价设计和认知负荷操作,表明人类决策更符合合作架构,其中无模型系统控制行为,而基于模型的系统通过重播和模拟经验来训练无模型系统。

相似文献

1
Retrospective revaluation in sequential decision making: a tale of two systems.序贯决策中的回溯再评估:两个系统的故事。
J Exp Psychol Gen. 2014 Feb;143(1):182-94. doi: 10.1037/a0030844. Epub 2012 Dec 10.
2
How to set the switches on this thing.这东西的开关怎么设置。
Curr Opin Neurobiol. 2012 Dec;22(6):1068-74. doi: 10.1016/j.conb.2012.05.011. Epub 2012 Jun 15.
3
A new computational account of cognitive control over reinforcement-based decision-making: Modeling of a probabilistic learning task.一种关于对基于强化的决策进行认知控制的新计算解释:概率学习任务的建模
Neural Netw. 2015 Nov;71:112-23. doi: 10.1016/j.neunet.2015.08.006. Epub 2015 Aug 20.
4
Reward-dependent learning in neuronal networks for planning and decision making.用于规划和决策的神经网络中基于奖励的学习。
Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0.
5
[Neural mechanisms of decision making].[决策的神经机制]
Brain Nerve. 2008 Sep;60(9):1017-27.
6
Dorsal anterior cingulate cortex integrates reinforcement history to guide voluntary behavior.背侧前扣带回皮层整合强化历史以指导自愿行为。
Cortex. 2008 May;44(5):548-59. doi: 10.1016/j.cortex.2007.08.013. Epub 2007 Dec 23.
7
Model-based reinforcement learning under concurrent schedules of reinforcement in rodents.啮齿动物在并发强化程序下基于模型的强化学习
Learn Mem. 2009 Apr 29;16(5):315-23. doi: 10.1101/lm.1295509. Print 2009 May.
8
Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.整合强化学习与基底神经节的最优决策理论。
Neural Comput. 2011 Apr;23(4):817-51. doi: 10.1162/NECO_a_00103. Epub 2011 Jan 11.
9
Goal-proximity decision-making.目标接近决策。
Cogn Sci. 2013 May-Jun;37(4):757-74. doi: 10.1111/cogs.12034. Epub 2013 Mar 29.
10
Reinforcement learning and decision making in monkeys during a competitive game.猴子在竞争性游戏中的强化学习与决策
Brain Res Cogn Brain Res. 2004 Dec;22(1):45-58. doi: 10.1016/j.cogbrainres.2004.07.007.

引用本文的文献

1
Model-based algorithms shape automatic evaluative processing.基于模型的算法塑造自动评价性加工。
Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2417068122. doi: 10.1073/pnas.2417068122. Epub 2025 Jun 20.
2
Proactive and reactive construction of memory-based preferences.基于记忆的偏好的主动和被动构建。
Nat Commun. 2025 Feb 13;16(1):1618. doi: 10.1038/s41467-025-56183-4.
3
Maturation of striatal dopamine supports the development of habitual behavior through adolescence.纹状体多巴胺的成熟通过青春期支持习惯性行为的发展。
bioRxiv. 2025 Jan 6:2025.01.06.631527. doi: 10.1101/2025.01.06.631527.
4
A recurrent network model of planning explains hippocampal replay and human behavior.一种规划的循环网络模型解释了海马体重放和人类行为。
Nat Neurosci. 2024 Jul;27(7):1340-1348. doi: 10.1038/s41593-024-01675-7. Epub 2024 Jun 7.
5
"Leap before you look": Conditions that suppress explicit, knowledge-based learning during visuomotor adaptation.“盲目行动”:在视动适应过程中抑制明确的基于知识的学习的条件。
J Exp Psychol Hum Percept Perform. 2024 Aug;50(8):785-807. doi: 10.1037/xhp0001210. Epub 2024 May 16.
6
Multiple and subject-specific roles of uncertainty in reward-guided decision-making.不确定性在奖励引导决策中的多种特定主体作用。
bioRxiv. 2024 Sep 12:2024.03.27.587016. doi: 10.1101/2024.03.27.587016.
7
Proactive and reactive construction of memory-based preferences.基于记忆的偏好的主动和被动构建。
bioRxiv. 2024 Dec 17:2023.12.10.570977. doi: 10.1101/2023.12.10.570977.
8
Predictions about reward outcomes in rhesus monkeys.预测猕猴的奖励结果。
Behav Neurosci. 2024 Feb;138(1):43-58. doi: 10.1037/bne0000573. Epub 2023 Dec 7.
9
Making green growth a reality: Reconciling sobriety with stakeholders' satisfaction.实现绿色增长:在清醒与利益相关者满意度之间寻求平衡。
PLoS One. 2023 Aug 23;18(8):e0284487. doi: 10.1371/journal.pone.0284487. eCollection 2023.
10
A reinforcement-based mechanism for discontinuous learning.基于强化的非连续学习机制。
Proc Natl Acad Sci U S A. 2022 Dec 6;119(49):e2215352119. doi: 10.1073/pnas.2215352119. Epub 2022 Nov 28.