Gureckis Todd M, Love Bradley C
Department of Psychology, New York University, 6 Washington Place, New York, NY 10003, USA.
Department of Psychology, The University of Texas at Austin, 1 University Station A8000, Austin, Texas 78712, USA.
Cognition. 2009 Dec;113(3):293-313. doi: 10.1016/j.cognition.2009.03.013. Epub 2009 May 8.
Successful investors seeking returns, animals foraging for food, and pilots controlling aircraft all must take into account how their current decisions will impact their future standing. One challenge facing decision makers is that options that appear attractive in the short-term may not turn out best in the long run. In this paper, we explore human learning in a dynamic decision making task which places short- and long-term rewards in conflict. Our goal in these studies was to evaluate how people's mental representation of a task affects their ability to discover an optimal decision strategy. We find that perceptual cues that readily align with the underlying state of the task environment help people overcome the impulsive appeal of short-term rewards. Our experimental manipulations, predictions, and analyses are motivated by current work in reinforcement learning which details how learners value delayed outcomes in sequential tasks and the importance that "state" identification plays in effective learning.
寻求回报的成功投资者、觅食的动物以及操控飞机的飞行员都必须考虑他们当前的决策将如何影响其未来状况。决策者面临的一个挑战是,短期内看似有吸引力的选择从长远来看可能并非最佳。在本文中,我们在一个动态决策任务中探索人类学习,该任务使短期和长期奖励产生冲突。我们在这些研究中的目标是评估人们对任务的心理表征如何影响他们发现最优决策策略的能力。我们发现,与任务环境的潜在状态易于匹配的感知线索有助于人们克服短期奖励的冲动吸引力。我们的实验操纵、预测和分析是受强化学习当前研究工作的推动,该研究详细阐述了学习者如何在序列任务中评估延迟结果,以及“状态”识别在有效学习中所起的重要作用。