人类空间导航的强化学习模型比较。

A comparison of reinforcement learning models of human spatial navigation.

机构信息

School of Psychology, Georgia Institute of Technology, Atlanta, USA.

School of Economics, Georgia Institute of Technology, Atlanta, USA.

出版信息

Sci Rep. 2022 Aug 17;12(1):13923. doi: 10.1038/s41598-022-18245-1.

DOI:10.1038/s41598-022-18245-1

PMID:35978035

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9385652/

Abstract

Reinforcement learning (RL) models have been influential in characterizing human learning and decision making, but few studies apply them to characterizing human spatial navigation and even fewer systematically compare RL models under different navigation requirements. Because RL can characterize one's learning strategies quantitatively and in a continuous manner, and one's consistency of using such strategies, it can provide a novel and important perspective for understanding the marked individual differences in human navigation and disentangle navigation strategies from navigation performance. One-hundred and fourteen participants completed wayfinding tasks in a virtual environment where different phases manipulated navigation requirements. We compared performance of five RL models (3 model-free, 1 model-based and 1 "hybrid") at fitting navigation behaviors in different phases. Supporting implications from prior literature, the hybrid model provided the best fit regardless of navigation requirements, suggesting the majority of participants rely on a blend of model-free (route-following) and model-based (cognitive mapping) learning in such navigation scenarios. Furthermore, consistent with a key prediction, there was a correlation in the hybrid model between the weight on model-based learning (i.e., navigation strategy) and the navigator's exploration vs. exploitation tendency (i.e., consistency of using such navigation strategy), which was modulated by navigation task requirements. Together, we not only show how computational findings from RL align with the spatial navigation literature, but also reveal how the relationship between navigation strategy and a person's consistency using such strategies changes as navigation requirements change.

摘要

强化学习 (RL) 模型在刻画人类学习和决策方面具有重要影响，但很少有研究将其应用于刻画人类空间导航，更少有研究系统地比较不同导航要求下的 RL 模型。由于 RL 可以定量且连续地刻画一个人的学习策略及其使用这些策略的一致性，因此它可以为理解人类导航中的显著个体差异提供一个新颖而重要的视角，并将导航策略与导航性能区分开来。114 名参与者在虚拟环境中完成了寻路任务，其中不同阶段操纵了导航要求。我们比较了五个 RL 模型（3 个无模型、1 个基于模型和 1 个“混合”）在不同阶段拟合导航行为的性能。支持先前文献中的相关含义，混合模型无论导航要求如何，都提供了最佳拟合，这表明大多数参与者在这种导航场景中依赖于无模型（路线跟随）和基于模型（认知映射）学习的混合。此外，与一个关键预测一致，混合模型中基于模型的学习（即导航策略）的权重与导航者的探索与利用倾向（即使用此类导航策略的一致性）之间存在相关性，而这种相关性受到导航任务要求的调节。总之，我们不仅展示了 RL 的计算结果如何与空间导航文献相一致，还揭示了随着导航要求的变化，导航策略与一个人使用这些策略的一致性之间的关系如何变化。

相似文献

A comparison of reinforcement learning models of human spatial navigation.人类空间导航的强化学习模型比较。

Sci Rep. 2022 Aug 17;12(1):13923. doi: 10.1038/s41598-022-18245-1.

Neural signatures of reinforcement learning correlate with strategy adoption during spatial navigation.神经信号与强化学习在空间导航中的策略采用相关。

Sci Rep. 2018 Jul 4;8(1):10110. doi: 10.1038/s41598-018-28241-z.

Predictive maps in rats and humans for spatial navigation.大鼠和人类空间导航的预测图。

Curr Biol. 2022 Sep 12;32(17):3676-3689.e5. doi: 10.1016/j.cub.2022.06.090. Epub 2022 Jul 20.

Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior.神经启发式强化学习改进奖励导向行为中的轨迹预测。

Int J Neural Syst. 2022 Sep;32(9):2250038. doi: 10.1142/S0129065722500381. Epub 2022 Aug 19.

Understanding Differences in Wayfinding Strategies.理解寻路策略的差异。

Top Cogn Sci. 2023 Jan;15(1):102-119. doi: 10.1111/tops.12592. Epub 2022 Jan 1.

Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。

Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.

RL-DOVS: Reinforcement Learning for Autonomous Robot Navigation in Dynamic Environments.RL-DOVS：动态环境下自主机器人导航的强化学习。

Sensors (Basel). 2022 May 19;22(10):3847. doi: 10.3390/s22103847.

Childhood wayfinding experience explains sex and individual differences in adult wayfinding strategy and anxiety.儿童寻路经验解释了成年寻路策略和焦虑的性别和个体差异。

Cogn Res Princ Implic. 2020 Mar 17;5(1):12. doi: 10.1186/s41235-020-00220-x.

A reinforcement-based mechanism for discontinuous learning.基于强化的非连续学习机制。

Proc Natl Acad Sci U S A. 2022 Dec 6;119(49):e2215352119. doi: 10.1073/pnas.2215352119. Epub 2022 Nov 28.

The neural correlates of memory integration in value-based decision-making during human spatial navigation.基于价值的决策中人类空间导航时记忆整合的神经关联。

Neuropsychologia. 2024 Jan 29;193:108758. doi: 10.1016/j.neuropsychologia.2023.108758. Epub 2023 Dec 14.

引用本文的文献

Spatially organized striatal neuromodulator release encodes trajectory errors.空间组织化的纹状体神经调质释放编码轨迹误差。

bioRxiv. 2024 Aug 14:2024.08.13.607797. doi: 10.1101/2024.08.13.607797.

Collaborative robots can augment human cognition in regret-sensitive tasks.协作机器人可以在对遗憾敏感的任务中增强人类认知。

PNAS Nexus. 2024 Jan 17;3(2):pgae016. doi: 10.1093/pnasnexus/pgae016. eCollection 2024 Feb.

The neural correlates of memory integration in value-based decision-making during human spatial navigation.基于价值的决策中人类空间导航时记忆整合的神经关联。

Neuropsychologia. 2024 Jan 29;193:108758. doi: 10.1016/j.neuropsychologia.2023.108758. Epub 2023 Dec 14.

本文引用的文献

What do Reinforcement Learning Models Measure? Interpreting Model Parameters in Cognition and Neuroscience.强化学习模型衡量的是什么？解读认知与神经科学中的模型参数。

Curr Opin Behav Sci. 2021 Oct;41:128-137. doi: 10.1016/j.cobeha.2021.06.004. Epub 2021 Jul 3.

The dynamics of explore-exploit decisions reveal a signal-to-noise mechanism for random exploration.探索-利用决策的动态变化揭示了随机探索的一种信噪比机制。

Sci Rep. 2021 Feb 4;11(1):3077. doi: 10.1038/s41598-021-82530-8.

Computational evidence for hierarchically structured reinforcement learning in humans.人类强化学习的分层结构计算证据。

Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29381-29389. doi: 10.1073/pnas.1912330117.

Environmental overlap and individual encoding strategy modulate memory interference in spatial navigation.环境重叠和个体编码策略调节空间导航中的记忆干扰。

Cognition. 2021 Feb;207:104508. doi: 10.1016/j.cognition.2020.104508. Epub 2020 Nov 7.

The role of working memory capacity in spatial learning depends on spatial information integration difficulty in the environment.工作记忆容量在空间学习中的作用取决于环境中空间信息整合的难度。

J Exp Psychol Gen. 2021 Apr;150(4):666-685. doi: 10.1037/xge0000972. Epub 2020 Sep 14.

Stress Disrupts Human Hippocampal-Prefrontal Function during Prospective Spatial Navigation and Hinders Flexible Behavior.压力会干扰人类在进行前瞻性空间导航时的海马-前额叶功能，并阻碍其灵活行为。

Curr Biol. 2020 May 18;30(10):1821-1833.e8. doi: 10.1016/j.cub.2020.03.006. Epub 2020 Apr 2.

SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0：Python 中的科学计算基础算法。

Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.

Heterogeneous correlations between hippocampus volume and cognitive map accuracy among healthy young adults.健康年轻成年人中海马体体积与认知地图准确性之间的异质性关联。

Cortex. 2020 Mar;124:167-175. doi: 10.1016/j.cortex.2019.11.011. Epub 2019 Dec 10.

A meta-analysis of sex differences in human navigation skills.人类导航技能性别差异的荟萃分析。

Psychon Bull Rev. 2019 Oct;26(5):1503-1528. doi: 10.3758/s13423-019-01633-6.

Hippocampal Contributions to Model-Based Planning and Spatial Memory.海马体对基于模型的规划和空间记忆的贡献。

Neuron. 2019 May 8;102(3):683-693.e4. doi: 10.1016/j.neuron.2019.02.014. Epub 2019 Mar 11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

人类空间导航的强化学习模型比较。

A comparison of reinforcement learning models of human spatial navigation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献