Suppr超能文献

回溯模型基推理指导无模型信用分配。

Retrospective model-based inference guides model-free credit assignment.

机构信息

Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, 10-12 Russell Square, London, WC1B 5EH, UK.

Wellcome Centre for Human Neuroimaging, University College London, London, WC1N 3BG, United Kingdom.

出版信息

Nat Commun. 2019 Feb 14;10(1):750. doi: 10.1038/s41467-019-08662-8.

Abstract

An extensive reinforcement learning literature shows that organisms assign credit efficiently, even under conditions of state uncertainty. However, little is known about credit-assignment when state uncertainty is subsequently resolved. Here, we address this problem within the framework of an interaction between model-free (MF) and model-based (MB) control systems. We present and support experimentally a theory of MB retrospective-inference. Within this framework, a MB system resolves uncertainty that prevailed when actions were taken thus guiding an MF credit-assignment. Using a task in which there was initial uncertainty about the lotteries that were chosen, we found that when participants' momentary uncertainty about which lottery had generated an outcome was resolved by provision of subsequent information, participants preferentially assigned credit within a MF system to the lottery they retrospectively inferred was responsible for this outcome. These findings extend our knowledge about the range of MB functions and the scope of system interactions.

摘要

大量强化学习文献表明,即使在状态不确定的情况下,生物也能有效地分配信用。然而,当状态不确定性随后得到解决时,关于信用分配的了解甚少。在这里,我们在无模型(MF)和基于模型(MB)控制系统之间的交互框架内解决了这个问题。我们提出并通过实验支持了 MB 回溯推理的理论。在这个框架内,MB 系统解决了在采取行动时存在的不确定性,从而指导 MF 信用分配。使用一个初始时对所选择的彩票存在不确定性的任务,我们发现,当参与者对产生结果的彩票的瞬间不确定性通过提供后续信息得到解决时,参与者在 MF 系统中更倾向于将信用分配给他们回溯推断对这一结果负责的彩票。这些发现扩展了我们对 MB 功能范围和系统交互范围的认识。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验