Suppr超能文献

人类序贯决策中的结构学习。

Structure learning in human sequential decision-making.

机构信息

Department of Computer Science and Engineering, University of Minnesota, Minneapolis, Minnesota, United States of America.

出版信息

PLoS Comput Biol. 2010 Dec 2;6(12):e1001003. doi: 10.1371/journal.pcbi.1001003.

Abstract

Studies of sequential decision-making in humans frequently find suboptimal performance relative to an ideal actor that has perfect knowledge of the model of how rewards and events are generated in the environment. Rather than being suboptimal, we argue that the learning problem humans face is more complex, in that it also involves learning the structure of reward generation in the environment. We formulate the problem of structure learning in sequential decision tasks using Bayesian reinforcement learning, and show that learning the generative model for rewards qualitatively changes the behavior of an optimal learning agent. To test whether people exhibit structure learning, we performed experiments involving a mixture of one-armed and two-armed bandit reward models, where structure learning produces many of the qualitative behaviors deemed suboptimal in previous studies. Our results demonstrate humans can perform structure learning in a near-optimal manner.

摘要

人类的序贯决策研究经常发现,相对于具有对环境中奖励和事件生成模型的完美知识的理想行为者,人类的表现并不理想。我们认为,人类面临的学习问题更加复杂,因为它还涉及到学习环境中奖励生成的结构。我们使用贝叶斯强化学习来制定序贯决策任务中的结构学习问题,并表明学习奖励生成的生成模型从本质上改变了最优学习代理的行为。为了测试人们是否表现出结构学习,我们进行了涉及单臂和双臂强盗奖励模型混合的实验,其中结构学习产生了先前研究中被认为是次优的许多定性行为。我们的结果表明,人类可以近乎最优地进行结构学习。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1070/2996460/e2812975d78e/pcbi.1001003.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验