Da Costa Lancelot, Parr Thomas, Sengupta Biswa, Friston Karl
Department of Mathematics, Imperial College London, London SW7 2AZ, UK.
Wellcome Centre for Human Neuroimaging, Institute of Neurology, University College London, London WC1N 3BG, UK.
Entropy (Basel). 2021 Apr 12;23(4):454. doi: 10.3390/e23040454.
Active inference is a normative framework for explaining behaviour under the free energy principle-a theory of self-organisation originating in neuroscience. It specifies neuronal dynamics for state-estimation in terms of a descent on (variational) free energy-a measure of the fit between an internal (generative) model and sensory observations. The free energy gradient is a prediction error-plausibly encoded in the average membrane potentials of neuronal populations. Conversely, the expected probability of a state can be expressed in terms of neuronal firing rates. We show that this is consistent with current models of neuronal dynamics and establish face validity by synthesising plausible electrophysiological responses. We then show that these neuronal dynamics approximate natural gradient descent, a well-known optimisation algorithm from information geometry that follows the steepest descent of the objective in information space. We compare the information length of belief updating in both schemes, a measure of the distance travelled in information space that has a direct interpretation in terms of metabolic cost. We show that neural dynamics under active inference are metabolically efficient and suggest that neural representations in biological agents may evolve by approximating steepest descent in information space towards the point of optimal inference.
主动推理是一种规范框架,用于解释基于自由能原理的行为,自由能原理是一种起源于神经科学的自组织理论。它根据(变分)自由能(一种衡量内部(生成)模型与感官观察之间拟合度的指标)的下降来指定用于状态估计的神经元动力学。自由能梯度是一种预测误差,可能编码在神经元群体的平均膜电位中。相反,状态的预期概率可以用神经元放电率来表示。我们表明,这与当前的神经元动力学模型一致,并通过合成合理的电生理反应来建立表面效度。然后我们表明,这些神经元动力学近似于自然梯度下降,自然梯度下降是信息几何中的一种著名优化算法,它沿着信息空间中目标函数的最速下降方向进行。我们比较了两种方案中信念更新的信息长度,信息长度是信息空间中移动距离的一种度量,在代谢成本方面有直接的解释。我们表明,主动推理下的神经动力学在代谢上是高效的,并提出生物智能体中的神经表征可能通过在信息空间中朝着最优推理点近似最速下降而进化。