Huys Quentin J M, Lally Níall, Faulkner Paul, Eshel Neir, Seifritz Erich, Gershman Samuel J, Dayan Peter, Roiser Jonathan P
Translational Neuromodeling Unit, Institute of Biomedical Engineering, University of Zürich and Swiss Federal Institute of Technology (ETH) Zürich, 8032 Zurich, Switzerland; Department of Psychiatry, Psychotherapy and Psychosomatics, Hospital of Psychiatry, University of Zürich, 8032 Zurich, Switzerland;
Institute of Cognitive Neuroscience, University College London, London WC1N 3AR, United Kingdom; Experimental Therapeutics & Pathophysiology Branch, Intramural Research Program, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20892;
Proc Natl Acad Sci U S A. 2015 Mar 10;112(10):3098-103. doi: 10.1073/pnas.1414219112. Epub 2015 Feb 9.
Humans routinely formulate plans in domains so complex that even the most powerful computers are taxed. To do so, they seem to avail themselves of many strategies and heuristics that efficiently simplify, approximate, and hierarchically decompose hard tasks into simpler subtasks. Theoretical and cognitive research has revealed several such strategies; however, little is known about their establishment, interaction, and efficiency. Here, we use model-based behavioral analysis to provide a detailed examination of the performance of human subjects in a moderately deep planning task. We find that subjects exploit the structure of the domain to establish subgoals in a way that achieves a nearly maximal reduction in the cost of computing values of choices, but then combine partial searches with greedy local steps to solve subtasks, and maladaptively prune the decision trees of subtasks in a reflexive manner upon encountering salient losses. Subjects come idiosyncratically to favor particular sequences of actions to achieve subgoals, creating novel complex actions or "options."
人类经常在极其复杂的领域制定计划,即使是最强大的计算机在处理这些领域时也会不堪重负。为了做到这一点,他们似乎会运用许多策略和启发法,这些策略和启发法能有效地简化、近似并将艰巨任务分层分解为更简单的子任务。理论和认知研究已经揭示了几种这样的策略;然而,对于它们的建立、相互作用和效率却知之甚少。在这里,我们使用基于模型的行为分析来详细考察人类受试者在一个适度深度的规划任务中的表现。我们发现,受试者利用领域结构以一种几乎能最大程度降低计算选择值成本的方式来确立子目标,但随后会将部分搜索与贪婪的局部步骤相结合来解决子任务,并且在遇到显著损失时会以一种反射性的方式不适应地修剪子任务的决策树。受试者会独特地倾向于采用特定的行动序列来实现子目标,从而创造出新颖的复杂行动或“选项”。