• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鼠标追踪揭示了在没有基于模型的选择的情况下的结构知识。

Mouse tracking reveals structure knowledge in the absence of model-based choice.

机构信息

Zurich Center for Neuroeconomics, Department of Economics, University of Zurich, Blümlisalpstrasse, 10 8006, Zurich, Switzerland.

Department of Economics, The Ohio State University, 1945 North High Street, Columbus, OH, 43210, USA.

出版信息

Nat Commun. 2020 Apr 20;11(1):1893. doi: 10.1038/s41467-020-15696-w.

DOI:10.1038/s41467-020-15696-w
PMID:32312966
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7170897/
Abstract

Converging evidence has demonstrated that humans exhibit two distinct strategies when learning in complex environments. One is model-free learning, i.e., simple reinforcement of rewarded actions, and the other is model-based learning, which considers the structure of the environment. Recent work has argued that people exhibit little model-based behavior unless it leads to higher rewards. Here we use mouse tracking to study model-based learning in stochastic and deterministic (pattern-based) environments of varying difficulty. In both tasks participants' mouse movements reveal that they learned the structures of their environments, despite the fact that standard behavior-based estimates suggested no such learning in the stochastic task. Thus, we argue that mouse tracking can reveal whether subjects have structure knowledge, which is necessary but not sufficient for model-based choice.

摘要

已有大量证据表明,人类在复杂环境中学习时有两种截然不同的策略。一种是无模型学习,即对奖励动作进行简单强化,另一种是基于模型的学习,即考虑环境的结构。最近的研究认为,除非基于模型的行为能带来更高的回报,否则人们很少表现出基于模型的行为。在这里,我们使用鼠标跟踪技术来研究随机和确定性(基于模式)环境中的基于模型的学习,这两种任务的参与者的鼠标运动都表明,他们学习了环境的结构,尽管基于标准行为的估计表明,在随机任务中没有这样的学习。因此,我们认为鼠标跟踪可以揭示被试是否具有结构知识,这是基于模型选择所必需的,但不是充分的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/aac092be246e/41467_2020_15696_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/3b26a1a8fee2/41467_2020_15696_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/e326f94c1b68/41467_2020_15696_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/58274408d9fb/41467_2020_15696_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/aac092be246e/41467_2020_15696_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/3b26a1a8fee2/41467_2020_15696_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/e326f94c1b68/41467_2020_15696_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/58274408d9fb/41467_2020_15696_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc16/7170897/aac092be246e/41467_2020_15696_Fig4_HTML.jpg

相似文献

1
Mouse tracking reveals structure knowledge in the absence of model-based choice.鼠标追踪揭示了在没有基于模型的选择的情况下的结构知识。
Nat Commun. 2020 Apr 20;11(1):1893. doi: 10.1038/s41467-020-15696-w.
2
[Mathematical models of decision making and learning].[决策与学习的数学模型]
Brain Nerve. 2008 Jul;60(7):791-8.
3
Model-based reinforcement learning under concurrent schedules of reinforcement in rodents.啮齿动物在并发强化程序下基于模型的强化学习
Learn Mem. 2009 Apr 29;16(5):315-23. doi: 10.1101/lm.1295509. Print 2009 May.
4
Entropy-based metrics for predicting choice behavior based on local response to reward.基于熵的指标,用于预测基于奖励局部响应的选择行为。
Nat Commun. 2021 Nov 12;12(1):6567. doi: 10.1038/s41467-021-26784-w.
5
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
6
Switching costs in stochastic environments drive the emergence of matching behaviour in animal decision-making through the promotion of reward learning strategies.在随机环境中,转换成本通过促进奖励学习策略,推动动物决策中匹配行为的出现。
Sci Rep. 2021 Dec 8;11(1):23593. doi: 10.1038/s41598-021-02979-5.
7
Structure learning in human sequential decision-making.人类序贯决策中的结构学习。
PLoS Comput Biol. 2010 Dec 2;6(12):e1001003. doi: 10.1371/journal.pcbi.1001003.
8
The ubiquity of model-based reinforcement learning.基于模型的强化学习无处不在。
Curr Opin Neurobiol. 2012 Dec;22(6):1075-81. doi: 10.1016/j.conb.2012.08.003. Epub 2012 Sep 6.
9
The impact of learning on perceptual decisions and its implication for speed-accuracy tradeoffs.学习对知觉决策的影响及其对速度-准确性权衡的意义。
Nat Commun. 2020 Jun 2;11(1):2757. doi: 10.1038/s41467-020-16196-7.
10
Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.新颖性不是惊喜:人类在序列决策中的探索和适应行为。
PLoS Comput Biol. 2021 Jun 3;17(6):e1009070. doi: 10.1371/journal.pcbi.1009070. eCollection 2021 Jun.

引用本文的文献

1
The Trembling Hand Unraveled: Motor Dynamics and Choice Inconsistency.颤抖之手的剖析:运动动力学与选择不一致性
J Neurosci. 2025 Aug 6;45(32):e1312242025. doi: 10.1523/JNEUROSCI.1312-24.2025.
2
Striatal arbitration between choice strategies guides few-shot adaptation.选择策略之间的纹状体仲裁引导少样本适应。
Nat Commun. 2025 Feb 20;16(1):1811. doi: 10.1038/s41467-025-57049-5.
3
Using smartphones to optimise and scale-up the assessment of model-based planning.利用智能手机优化并扩大基于模型规划的评估。
Commun Psychol. 2023 Nov 1;1(1):31. doi: 10.1038/s44271-023-00031-y.
4
Perceived warmth and competence predict callback rates in meta-analyzed North American labor market experiments.感知的温暖和能力可以预测北美劳动力市场实验的回调率。
PLoS One. 2024 Jul 10;19(7):e0304723. doi: 10.1371/journal.pone.0304723. eCollection 2024.
5
Attribute latencies causally shape intertemporal decisions.属性潜伏期会对跨期决策产生因果影响。
Nat Commun. 2024 Apr 5;15(1):2948. doi: 10.1038/s41467-024-46657-2.
6
Disrupted state transition learning as a computational marker of compulsivity.紊乱的状态转换学习作为强迫行为的计算标记物。
Psychol Med. 2023 Apr;53(5):2095-2105. doi: 10.1017/S0033291721003846. Epub 2021 Sep 24.
7
Explicit knowledge of task structure is a primary determinant of human model-based action.任务结构的明确知识是人类基于模型行为的主要决定因素。
Nat Hum Behav. 2022 Aug;6(8):1126-1141. doi: 10.1038/s41562-022-01346-2. Epub 2022 May 19.
8
Humans perseverate on punishment avoidance goals in multigoal reinforcement learning.在多目标强化学习中,人类执着于避免惩罚的目标。
Elife. 2022 Feb 24;11:e74402. doi: 10.7554/eLife.74402.
9
Using dynamic monitoring of choices to predict and understand risk preferences.利用选择的动态监测来预测和理解风险偏好。
Proc Natl Acad Sci U S A. 2020 Dec 15;117(50):31738-31747. doi: 10.1073/pnas.2010056117. Epub 2020 Nov 24.
10
The Anterior Cingulate Cortex Predicts Future States to Mediate Model-Based Action Selection.前扣带皮层预测未来状态,以介导基于模型的动作选择。
Neuron. 2021 Jan 6;109(1):149-163.e7. doi: 10.1016/j.neuron.2020.10.013. Epub 2020 Nov 4.