Program in Neuroscience, Harvard Medical School, Boston, Massachusetts, United States of America.
Department of Psychology and Center for Brain Science, Harvard University, Cambridge, Massachusetts, United States of America.
PLoS Comput Biol. 2020 Apr 6;16(4):e1007594. doi: 10.1371/journal.pcbi.1007594. eCollection 2020 Apr.
We propose that humans spontaneously organize environments into clusters of states that support hierarchical planning, enabling them to tackle challenging problems by breaking them down into sub-problems at various levels of abstraction. People constantly rely on such hierarchical presentations to accomplish tasks big and small-from planning one's day, to organizing a wedding, to getting a PhD-often succeeding on the very first attempt. We formalize a Bayesian model of hierarchy discovery that explains how humans discover such useful abstractions. Building on principles developed in structure learning and robotics, the model predicts that hierarchy discovery should be sensitive to the topological structure, reward distribution, and distribution of tasks in the environment. In five simulations, we show that the model accounts for previously reported effects of environment structure on planning behavior, such as detection of bottleneck states and transitions. We then test the novel predictions of the model in eight behavioral experiments, demonstrating how the distribution of tasks and rewards can influence planning behavior via the discovered hierarchy, sometimes facilitating and sometimes hindering performance. We find evidence that the hierarchy discovery process unfolds incrementally across trials. Finally, we propose how hierarchy discovery and hierarchical planning might be implemented in the brain. Together, these findings present an important advance in our understanding of how the brain might use Bayesian inference to discover and exploit the hidden hierarchical structure of the environment.
我们提出,人类会自发地将环境组织成状态簇,这些状态簇支持层次规划,使他们能够通过将问题分解为不同抽象层次的子问题来解决具有挑战性的问题。人们经常依靠这种层次化的表示来完成大大小小的任务——从规划一天的活动,到组织婚礼,再到获得博士学位——通常一次就能成功。我们提出了一种贝叶斯层次结构发现模型,该模型解释了人类如何发现这种有用的抽象。该模型基于结构学习和机器人技术中开发的原理,预测层次结构发现应该对环境中的拓扑结构、奖励分布和任务分布敏感。在五个模拟中,我们表明该模型解释了先前报道的环境结构对规划行为的影响,例如对瓶颈状态和转换的检测。然后,我们在八个行为实验中测试了该模型的新预测,展示了任务和奖励的分布如何通过发现的层次结构影响规划行为,有时会促进,有时会阻碍表现。我们有证据表明,层次结构发现过程在试验中是逐步展开的。最后,我们提出了层次结构发现和层次规划可能在大脑中实现的方式。总的来说,这些发现代表了我们对大脑如何利用贝叶斯推理来发现和利用环境中隐藏的层次结构的理解的重要进展。