Suppr超能文献

回溯模型基推理指导无模型信用分配。

Retrospective model-based inference guides model-free credit assignment.

机构信息

Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, 10-12 Russell Square, London, WC1B 5EH, UK.

Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3BG, United Kingdom.

出版信息

Nat Commun. 2019 Feb 14;10(1):750. doi: 10.1038/s41467-019-08662-8.

Abstract

An extensive reinforcement learning literature shows that organisms assign credit efficiently, even under conditions of state uncertainty. However, little is known about credit-assignment when state uncertainty is subsequently resolved. Here, we address this problem within the framework of an interaction between model-free (MF) and model-based (MB) control systems. We present and support experimentally a theory of MB retrospective-inference. Within this framework, a MB system resolves uncertainty that prevailed when actions were taken thus guiding an MF credit-assignment. Using a task in which there was initial uncertainty about the lotteries that were chosen, we found that when participants' momentary uncertainty about which lottery had generated an outcome was resolved by provision of subsequent information, participants preferentially assigned credit within a MF system to the lottery they retrospectively inferred was responsible for this outcome. These findings extend our knowledge about the range of MB functions and the scope of system interactions.

摘要

大量强化学习文献表明,即使在状态不确定的情况下,生物也能有效地分配信用。然而,当状态不确定性随后得到解决时,关于信用分配的了解甚少。在这里,我们在无模型(MF)和基于模型(MB)控制系统之间的交互框架内解决了这个问题。我们提出并通过实验支持了 MB 回溯推理的理论。在这个框架内,MB 系统解决了在采取行动时存在的不确定性,从而指导 MF 信用分配。使用一个初始时对所选择的彩票存在不确定性的任务,我们发现,当参与者对产生结果的彩票的瞬间不确定性通过提供后续信息得到解决时,参与者在 MF 系统中更倾向于将信用分配给他们回溯推断对这一结果负责的彩票。这些发现扩展了我们对 MB 功能范围和系统交互范围的认识。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验