• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于模型规划中的模型是什么?

What Is the Model in Model-Based Planning?

机构信息

Department of Psychology and Center for Brain Science, Harvard University.

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology.

出版信息

Cogn Sci. 2021 Jan;45(1):e12928. doi: 10.1111/cogs.12928.

DOI:10.1111/cogs.12928
PMID:33398907
Abstract

Flexibility is one of the hallmarks of human problem-solving. In everyday life, people adapt to changes in common tasks with little to no additional training. Much of the existing work on flexibility in human problem-solving has focused on how people adapt to tasks in new domains by drawing on solutions from previously learned domains. In real-world tasks, however, humans must generalize across a wide range of within-domain variation. In this work we argue that representational abstraction plays an important role in such within-domain generalization. We then explore the nature of this representational abstraction in realistically complex tasks like video games by demonstrating how the same model-based planning framework produces distinct generalization behaviors under different classes of task representation. Finally, we compare the behavior of agents with these task representations to humans in a series of novel grid-based video game tasks. Our results provide evidence for the claim that within-domain flexibility in humans derives from task representations composed of propositional rules written in terms of objects and relational categories.

摘要

灵活性是人类解决问题的标志之一。在日常生活中,人们几乎无需额外培训就能适应常见任务中的变化。在人类解决问题的灵活性方面,现有大量工作都集中在人们如何通过借鉴先前学习领域的解决方案来适应新领域的任务。然而,在现实任务中,人们必须跨广泛的领域内变化进行泛化。在这项工作中,我们认为表示抽象在这种领域内泛化中起着重要作用。然后,我们通过展示同一基于模型的规划框架如何在不同类别的任务表示下产生不同的泛化行为,探索了在像视频游戏这样现实复杂任务中这种表示抽象的性质。最后,我们在一系列新的基于网格的视频游戏任务中,将具有这些任务表示的代理与人类的行为进行了比较。我们的结果为以下主张提供了证据,即人类的领域内灵活性源自由对象和关系类别术语表述的命题规则组成的任务表示。

相似文献

1
What Is the Model in Model-Based Planning?基于模型规划中的模型是什么?
Cogn Sci. 2021 Jan;45(1):e12928. doi: 10.1111/cogs.12928.
2
A theory of relation learning and cross-domain generalization.一种关系学习与跨领域泛化理论。
Psychol Rev. 2022 Oct;129(5):999-1041. doi: 10.1037/rev0000346. Epub 2022 Feb 3.
3
Transforming task representations to perform novel tasks.将任务表示转化为执行新任务。
Proc Natl Acad Sci U S A. 2020 Dec 29;117(52):32970-32981. doi: 10.1073/pnas.2008852117. Epub 2020 Dec 10.
4
Learning and generalization of within-category representations in a rule-based category structure.基于规则的类别结构中类别内表征的学习与泛化
Atten Percept Psychophys. 2020 Jul;82(5):2448-2462. doi: 10.3758/s13414-020-02024-z.
5
Enhanced "learning to learn" through a hierarchical dual-learning system: the case of action video game players.通过层次化双学习系统增强“学习能力”:以动作视频游戏玩家为例。
BMC Psychol. 2024 Aug 30;12(1):460. doi: 10.1186/s40359-024-01952-x.
6
Simplicity and generalization: Short-cutting abstraction in children's object categorizations.简单性与泛化:儿童物体分类中的捷径式抽象
Cognition. 2008 Sep;108(3):626-38. doi: 10.1016/j.cognition.2008.05.002. Epub 2008 Jun 18.
7
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
8
Generalizing to generalize: Humans flexibly switch between compositional and conjunctive structures during reinforcement learning.泛化以实现泛化:在强化学习过程中,人类可以灵活地在组合结构和连接结构之间切换。
PLoS Comput Biol. 2020 Apr 13;16(4):e1007720. doi: 10.1371/journal.pcbi.1007720. eCollection 2020 Apr.
9
On the Generalization of Simple Alternating Category Structures.简单交替范畴结构的推广。
Cogn Sci. 2021 Apr;45(4):e12972. doi: 10.1111/cogs.12972.
10
From features to categories: The development of inductive generalization.从特征到类别:归纳推广的发展。
J Exp Psychol Learn Mem Cogn. 2023 Oct;49(10):1615-1634. doi: 10.1037/xlm0001263. Epub 2023 Jun 12.

引用本文的文献

1
The consequences of AI training on human decision-making.人工智能训练对人类决策的影响。
Proc Natl Acad Sci U S A. 2024 Aug 13;121(33):e2408731121. doi: 10.1073/pnas.2408731121. Epub 2024 Aug 6.
2
The neural architecture of theory-based reinforcement learning.基于理论的强化学习的神经结构。
Neuron. 2023 Apr 19;111(8):1331-1344.e8. doi: 10.1016/j.neuron.2023.01.023. Epub 2023 Mar 9.