Suppr超能文献

主动推理与学习

Active inference and learning.

作者信息

Friston Karl, FitzGerald Thomas, Rigoli Francesco, Schwartenbeck Philipp, O Doherty John, Pezzulo Giovanni

机构信息

The Wellcome Trust Centre for Neuroimaging, UCL, 12 Queen Square, London, United Kingdom.

The Wellcome Trust Centre for Neuroimaging, UCL, 12 Queen Square, London, United Kingdom; Max-Planck⿿UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom.

出版信息

Neurosci Biobehav Rev. 2016 Sep;68:862-879. doi: 10.1016/j.neubiorev.2016.06.022. Epub 2016 Jun 29.

Abstract

This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity.

摘要

本文提供了一种关于选择行为和学习的主动推理解释。它着重于目标导向行为和习惯性行为之间的区别,以及它们如何相互关联。我们表明,当智能体配备状态-动作策略时,习惯会自然地(且自动地)从顺序策略优化中产生。在主动推理中,行为具有探索性(认知性)和利用性(实用性)两个方面,分别对模糊性和风险敏感,其中认知性(解决模糊性)行为促成实用性(寻求奖励)行为以及随后习惯的出现。尽管目标导向策略和习惯性策略通常与基于模型和无模型的方案相关联,但我们发现更重要的区别在于无信念和基于信念的方案之间。潜在的(变分)信念更新为包括多巴胺反应的传递、逆向学习、习惯形成和贬值在内的多种现象提供了一个全面的(如果是隐喻性的)过程理论。最后,我们表明在没有模糊性的情况下,主动推理简化为经典的(贝尔曼)方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/473b/5167251/eda558db8298/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验