Paul G. Allen School of Computer Science and Engineering and Center for Neurotechnology, University of Washington, Seattle, WA 98195, U.S.A.
Neural Comput. 2023 Dec 12;36(1):1-32. doi: 10.1162/neco_a_01627.
There is growing interest in predictive coding as a model of how the brain learns through predictions and prediction errors. Predictive coding models have traditionally focused on sensory coding and perception. Here we introduce active predictive coding (APC) as a unifying model for perception, action, and cognition. The APC model addresses important open problems in cognitive science and AI, including (1) how we learn compositional representations (e.g., part-whole hierarchies for equivariant vision) and (2) how we solve large-scale planning problems, which are hard for traditional reinforcement learning, by composing complex state dynamics and abstract actions from simpler dynamics and primitive actions. By using hypernetworks, self-supervised learning, and reinforcement learning, APC learns hierarchical world models by combining task-invariant state transition networks and task-dependent policy networks at multiple abstraction levels. We illustrate the applicability of the APC model to active visual perception and hierarchical planning. Our results represent, to our knowledge, the first proof-of-concept demonstration of a unified approach to addressing the part-whole learning problem in vision, the nested reference frames learning problem in cognition, and the integrated state-action hierarchy learning problem in reinforcement learning.
人们对预测编码作为大脑通过预测和预测误差进行学习的模型越来越感兴趣。预测编码模型传统上侧重于感官编码和感知。在这里,我们引入主动预测编码 (APC) 作为感知、行动和认知的统一模型。APC 模型解决了认知科学和人工智能中的重要开放性问题,包括 (1) 我们如何学习组合表示(例如,用于等变视觉的整体-部分层次结构),以及 (2) 我们如何通过组合复杂的状态动态和抽象动作,解决传统强化学习难以解决的大规模规划问题,这些动作源自更简单的动态和基本动作。通过使用超网络、自我监督学习和强化学习,APC 通过在多个抽象级别上组合任务不变的状态转移网络和任务相关的策略网络来学习分层的世界模型。我们说明了 APC 模型在主动视觉感知和分层规划中的适用性。据我们所知,我们的结果代表了第一个统一方法的概念验证演示,该方法解决了视觉中的整体-部分学习问题、认知中的嵌套参考框架学习问题以及强化学习中的集成状态-动作层次学习问题。