• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类和深度强化学习智能体中的目标导向导航依赖于基于向量和基于转换的策略的自适应混合。

Goal-directed navigation in humans and deep reinforcement learning agents relies on an adaptive mix of vector-based and transition-based strategies.

作者信息

Lan Denis C L, Hunt Laurence T, Summerfield Christopher

机构信息

Department of Experimental Psychology, Medical Sciences Division, University of Oxford, Oxford, United Kingdom.

出版信息

PLoS Biol. 2025 Jul 29;23(7):e3003296. doi: 10.1371/journal.pbio.3003296. eCollection 2025 Jul.

DOI:10.1371/journal.pbio.3003296
PMID:40729396
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12324678/
Abstract

Much has been learned about the cognitive and neural mechanisms by which humans and other animals navigate to reach their goals. However, most studies have involved a single, well-learned environment. By contrast, real-world wayfinding often occurs in unfamiliar settings, requiring people to combine memories of landmark locations with on-the-fly information about transitions between adjacent states. Here, we studied the strategies that support human navigation in wholly novel environments. We found that during goal-directed navigation, people use a mix of strategies, adaptively deploying both associations between proximal states (state transitions) and directions between distal landmarks (vectors) at stereotyped points on a journey. Deep neural networks meta-trained with reinforcement learning to find the shortest path to goal exhibited near-identical strategies, and in doing so, developed units specialized for the implementation of vector- and state transition-based strategies. These units exhibited response patterns and representational geometries that resemble those previously found in mammalian navigational systems. Overall, our results suggest that effective navigation in novel environments relies on an adaptive mix of state transition- and vector-based strategies, supported by different modes of representing the environment in the brain.

摘要

关于人类和其他动物为实现目标而导航的认知和神经机制,我们已经了解了很多。然而,大多数研究都涉及单一的、熟悉的环境。相比之下,现实世界中的寻路通常发生在不熟悉的环境中,这就要求人们将地标位置的记忆与相邻状态之间转换的即时信息结合起来。在这里,我们研究了支持人类在全新环境中导航的策略。我们发现,在目标导向的导航过程中,人们会使用多种策略,在旅程中的固定点上自适应地运用近端状态之间的关联(状态转换)和远端地标之间的方向(向量)。通过强化学习进行元训练以找到到达目标最短路径的深度神经网络表现出几乎相同的策略,并且在这样做的过程中,开发出了专门用于实施基于向量和状态转换策略的单元。这些单元表现出的反应模式和表征几何结构与先前在哺乳动物导航系统中发现的相似。总体而言,我们的结果表明,在新环境中的有效导航依赖于基于状态转换和向量的策略的自适应组合,并由大脑中表征环境的不同模式提供支持。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/7777e8e789b3/pbio.3003296.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/6e4bccd66dc9/pbio.3003296.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/3e4ab5f84386/pbio.3003296.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/98535d59a400/pbio.3003296.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/e40770e2e663/pbio.3003296.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/7777e8e789b3/pbio.3003296.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/6e4bccd66dc9/pbio.3003296.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/3e4ab5f84386/pbio.3003296.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/98535d59a400/pbio.3003296.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/e40770e2e663/pbio.3003296.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8355/12324678/7777e8e789b3/pbio.3003296.g005.jpg

相似文献

1
Goal-directed navigation in humans and deep reinforcement learning agents relies on an adaptive mix of vector-based and transition-based strategies.人类和深度强化学习智能体中的目标导向导航依赖于基于向量和基于转换的策略的自适应混合。
PLoS Biol. 2025 Jul 29;23(7):e3003296. doi: 10.1371/journal.pbio.3003296. eCollection 2025 Jul.
2
Short-Term Memory Impairment短期记忆障碍
3
Q-learning with temporal memory to navigate turbulence.基于时间记忆的Q学习以应对动荡。
Elife. 2025 Jul 21;13:RP102906. doi: 10.7554/eLife.102906.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Educational, supportive and behavioural interventions to improve usage of continuous positive airway pressure machines in adults with obstructive sleep apnoea.旨在提高阻塞性睡眠呼吸暂停成年患者持续气道正压通气机使用情况的教育、支持和行为干预措施。
Cochrane Database Syst Rev. 2014 Jan 8(1):CD007736. doi: 10.1002/14651858.CD007736.pub2.
7
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
8
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

本文引用的文献

1
Vector coding and place coding in hippocampus share a common directional signal.海马体中的向量编码和位置编码共享一个共同的方向信号。
Nat Commun. 2024 Dec 5;15(1):10630. doi: 10.1038/s41467-024-54935-2.
2
One-shot entorhinal maps enable flexible navigation in novel environments.一次性内嗅皮层地图可实现新环境中的灵活导航。
Nature. 2024 Nov;635(8040):943-950. doi: 10.1038/s41586-024-08034-3. Epub 2024 Oct 9.
3
Human hippocampal and entorhinal neurons encode the temporal structure of experience.人类海马体和内嗅皮层神经元对经验的时间结构进行编码。
Nature. 2024 Nov;635(8037):160-167. doi: 10.1038/s41586-024-07973-1. Epub 2024 Sep 25.
4
Meta-learned models of cognition.元认知学习模型。
Behav Brain Sci. 2023 Nov 23;47:e147. doi: 10.1017/S0140525X23003266.
5
Goal-seeking compresses neural codes for space in the human hippocampus and orbitofrontal cortex.目标导向压缩了人类海马体和眶额皮质中空间的神经编码。
Neuron. 2023 Dec 6;111(23):3885-3899.e6. doi: 10.1016/j.neuron.2023.08.021. Epub 2023 Sep 18.
6
Active and passive exploration for spatial knowledge acquisition: A meta-analysis.主动探索和被动探索在空间知识获取中的作用:一项元分析。
Q J Exp Psychol (Hove). 2024 May;77(5):964-982. doi: 10.1177/17470218231185121. Epub 2023 Jul 5.
7
Complementary task representations in hippocampus and prefrontal cortex for generalizing the structure of problems.海马体和前额叶皮层中的互补任务表示,用于推广问题的结构。
Nat Neurosci. 2022 Oct;25(10):1314-1326. doi: 10.1038/s41593-022-01149-8. Epub 2022 Sep 28.
8
Predictive maps in rats and humans for spatial navigation.大鼠和人类空间导航的预测图。
Curr Biol. 2022 Sep 12;32(17):3676-3689.e5. doi: 10.1016/j.cub.2022.06.090. Epub 2022 Jul 20.
9
How the environment shapes our ability to navigate.环境如何塑造我们的导航能力。
Clin Transl Med. 2022 Jun;12(6):e928. doi: 10.1002/ctm2.928.
10
Entropy of city street networks linked to future spatial navigation ability.城市街道网络的熵与未来的空间导航能力有关。
Nature. 2022 Apr;604(7904):104-110. doi: 10.1038/s41586-022-04486-7. Epub 2022 Mar 30.