Friston Karl J, Lin Marco, Frith Christopher D, Pezzulo Giovanni, Hobson J Allan, Ondobaka Sasha
Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London WC1N 3BG, U.K.
Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London WC1N 3BG, and Institute of Philosophy, School of Advanced Studies, University of London EC1E 7HU, U.K.
Neural Comput. 2017 Oct;29(10):2633-2683. doi: 10.1162/neco_a_00999. Epub 2017 Aug 4.
This article offers a formal account of curiosity and insight in terms of active (Bayesian) inference. It deals with the dual problem of inferring states of the world and learning its statistical structure. In contrast to current trends in machine learning (e.g., deep learning), we focus on how people attain insight and understanding using just a handful of observations, which are solicited through curious behavior. We use simulations of abstract rule learning and approximate Bayesian inference to show that minimizing (expected) variational free energy leads to active sampling of novel contingencies. This epistemic behavior closes explanatory gaps in generative models of the world, thereby reducing uncertainty and satisfying curiosity. We then move from epistemic learning to model selection or structure learning to show how abductive processes emerge when agents test plausible hypotheses about symmetries (i.e., invariances or rules) in their generative models. The ensuing Bayesian model reduction evinces mechanisms associated with sleep and has all the hallmarks of "aha" moments. This formulation moves toward a computational account of consciousness in the pre-Cartesian sense of sharable knowledge (i.e., con: "together"; scire: "to know").
本文从主动(贝叶斯)推理的角度对好奇心和洞察力进行了形式化阐述。它探讨了推断世界状态和学习其统计结构的双重问题。与当前机器学习的趋势(如深度学习)不同,我们关注的是人们如何仅通过少量观察来获得洞察力和理解,这些观察是通过好奇行为引发的。我们使用抽象规则学习和近似贝叶斯推理的模拟来表明,最小化(预期)变分自由能会导致对新的意外情况进行主动采样。这种认知行为填补了世界生成模型中的解释空白,从而减少了不确定性并满足了好奇心。然后,我们从认知学习转向模型选择或结构学习,以展示当智能体在其生成模型中测试关于对称性(即不变性或规则)的合理假设时,归纳过程是如何出现的。随之而来的贝叶斯模型简化揭示了与睡眠相关的机制,并具有“顿悟”时刻的所有特征。这种表述朝着笛卡尔之前可共享知识意义上的意识计算解释迈进(即con:“一起”;scire:“知道”)。