Department of Psychology and Princeton Neuroscience Institute, Princeton University.
Department of Psychology, University of Texas at Austin.
J Exp Psychol Gen. 2014 Feb;143(1):182-94. doi: 10.1037/a0030844. Epub 2012 Dec 10.
Recent computational theories of decision making in humans and animals have portrayed 2 systems locked in a battle for control of behavior. One system--variously termed model-free or habitual--favors actions that have previously led to reward, whereas a second--called the model-based or goal-directed system--favors actions that causally lead to reward according to the agent's internal model of the environment. Some evidence suggests that control can be shifted between these systems using neural or behavioral manipulations, but other evidence suggests that the systems are more intertwined than a competitive account would imply. In 4 behavioral experiments, using a retrospective revaluation design and a cognitive load manipulation, we show that human decisions are more consistent with a cooperative architecture in which the model-free system controls behavior, whereas the model-based system trains the model-free system by replaying and simulating experience.
近期关于人类和动物决策的计算理论描绘了两个系统为了控制行为而进行的斗争。一个系统——被称为无模型或习惯的系统——倾向于之前导致奖励的行动,而第二个系统——称为基于模型或目标导向的系统——则倾向于根据主体对环境的内部模型,导致奖励的行动。一些证据表明,可以通过神经或行为操作在这些系统之间进行控制,但其他证据表明,这些系统比竞争理论所暗示的更为交织。在四项行为实验中,我们使用回溯再评价设计和认知负荷操作,表明人类决策更符合合作架构,其中无模型系统控制行为,而基于模型的系统通过重播和模拟经验来训练无模型系统。