• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于向量的人工代理中使用网格表示的导航。

Vector-based navigation using grid-like representations in artificial agents.

机构信息

DeepMind, London, UK.

Department of Cell and Developmental Biology, University College London, London, UK.

出版信息

Nature. 2018 May;557(7705):429-433. doi: 10.1038/s41586-018-0102-6. Epub 2018 May 9.

DOI:10.1038/s41586-018-0102-6
PMID:29743670
Abstract

Deep neural networks have achieved impressive successes in fields ranging from object recognition to complex games such as Go. Navigation, however, remains a substantial challenge for artificial agents, with deep neural networks trained by reinforcement learning failing to rival the proficiency of mammalian spatial behaviour, which is underpinned by grid cells in the entorhinal cortex . Grid cells are thought to provide a multi-scale periodic representation that functions as a metric for coding space and is critical for integrating self-motion (path integration) and planning direct trajectories to goals (vector-based navigation). Here we set out to leverage the computational functions of grid cells to develop a deep reinforcement learning agent with mammal-like navigational abilities. We first trained a recurrent network to perform path integration, leading to the emergence of representations resembling grid cells, as well as other entorhinal cell types . We then showed that this representation provided an effective basis for an agent to locate goals in challenging, unfamiliar, and changeable environments-optimizing the primary objective of navigation through deep reinforcement learning. The performance of agents endowed with grid-like representations surpassed that of an expert human and comparison agents, with the metric quantities necessary for vector-based navigation derived from grid-like units within the network. Furthermore, grid-like representations enabled agents to conduct shortcut behaviours reminiscent of those performed by mammals. Our findings show that emergent grid-like representations furnish agents with a Euclidean spatial metric and associated vector operations, providing a foundation for proficient navigation. As such, our results support neuroscientific theories that see grid cells as critical for vector-based navigation, demonstrating that the latter can be combined with path-based strategies to support navigation in challenging environments.

摘要

深度神经网络在从物体识别到围棋等复杂游戏等领域取得了令人瞩目的成就。然而,对于人工智能代理来说,导航仍然是一个巨大的挑战,通过强化学习训练的深度神经网络无法与哺乳动物的空间行为相媲美,而哺乳动物的空间行为是由内嗅皮层中的网格细胞支持的。网格细胞被认为提供了一种多尺度周期性表示,作为空间编码的度量标准,对于整合自身运动(路径整合)和规划到目标的直接轨迹(基于向量的导航)至关重要。在这里,我们着手利用网格细胞的计算功能开发一种具有类似哺乳动物导航能力的深度强化学习代理。我们首先训练一个递归网络来执行路径整合,从而产生类似于网格细胞的表示,以及其他内嗅细胞类型。然后,我们表明,这种表示为代理在具有挑战性、不熟悉和多变的环境中定位目标提供了一个有效的基础,通过深度强化学习优化了导航的主要目标。具有网格样表示的代理的性能超过了专家人类和比较代理,并且从网络中的网格样单元中推导出了基于向量的导航所需的度量量。此外,网格样表示使代理能够执行类似于哺乳动物的捷径行为。我们的研究结果表明,涌现的网格样表示为代理提供了欧几里得空间度量和相关的向量运算,为熟练导航提供了基础。因此,我们的结果支持了神经科学理论,即网格细胞对基于向量的导航至关重要,证明了后者可以与基于路径的策略相结合,以支持在具有挑战性的环境中的导航。

相似文献

1
Vector-based navigation using grid-like representations in artificial agents.基于向量的人工代理中使用网格表示的导航。
Nature. 2018 May;557(7705):429-433. doi: 10.1038/s41586-018-0102-6. Epub 2018 May 9.
2
Biomimetic FPGA-based spatial navigation model with grid cells and place cells.基于网格细胞和位置细胞的仿生 FPGA 空间导航模型。
Neural Netw. 2021 Jul;139:45-63. doi: 10.1016/j.neunet.2021.01.028. Epub 2021 Feb 13.
3
Compromised Grid-Cell-like Representations in Old Age as a Key Mechanism to Explain Age-Related Navigational Deficits.老年时网格细胞样表征受损是解释与年龄相关的导航缺陷的关键机制。
Curr Biol. 2018 Apr 2;28(7):1108-1115.e6. doi: 10.1016/j.cub.2018.02.038. Epub 2018 Mar 15.
4
Navigating with grid and place cells in cluttered environments.在杂乱环境中使用网格和位置细胞进行导航。
Hippocampus. 2020 Mar;30(3):220-232. doi: 10.1002/hipo.23147. Epub 2019 Aug 13.
5
Hexadirectional Modulation of Theta Power in Human Entorhinal Cortex during Spatial Navigation.人类内嗅皮层在空间导航过程中theta 功率的六向调制。
Curr Biol. 2018 Oct 22;28(20):3310-3315.e4. doi: 10.1016/j.cub.2018.08.029. Epub 2018 Oct 11.
6
Grid-like hexadirectional modulation of human entorhinal theta oscillations.网格状六向调制人类内嗅theta 振荡。
Proc Natl Acad Sci U S A. 2018 Oct 16;115(42):10798-10803. doi: 10.1073/pnas.1805007115. Epub 2018 Oct 3.
7
Environmental Barriers Disrupt Grid-like Representations in Humans during Navigation.环境障碍会干扰人类在导航过程中的网格状表现。
Curr Biol. 2019 Aug 19;29(16):2718-2722.e3. doi: 10.1016/j.cub.2019.06.072. Epub 2019 Aug 1.
8
Grid coding, spatial representation, and navigation: Should we assume an isomorphism?网格编码、空间表示和导航:我们是否应该假设同构?
Hippocampus. 2020 Apr;30(4):422-432. doi: 10.1002/hipo.23175. Epub 2019 Nov 18.
9
The Neurobiology of Mammalian Navigation.哺乳动物导航的神经生物学。
Curr Biol. 2018 Sep 10;28(17):R1023-R1042. doi: 10.1016/j.cub.2018.05.050.
10
Modeling place cells and grid cells in multi-compartment environments: Entorhinal-hippocampal loop as a multisensory integration circuit.多隔间环境中位置细胞和网格细胞的建模:作为多感觉整合回路的内嗅-海马环路。
Neural Netw. 2020 Jan;121:37-51. doi: 10.1016/j.neunet.2019.09.002. Epub 2019 Sep 6.

引用本文的文献

1
Grid cells accurately track movement during path integration-based navigation despite switching reference frames.尽管参考系发生切换,但网格细胞在基于路径整合的导航过程中仍能精确跟踪运动。
Nat Neurosci. 2025 Sep 10. doi: 10.1038/s41593-025-02054-6.
2
Speed modulations in grid cell information geometry.网格细胞信息几何中的速度调制
Nat Commun. 2025 Aug 19;16(1):7723. doi: 10.1038/s41467-025-62856-x.
3
Goal-directed navigation in humans and deep reinforcement learning agents relies on an adaptive mix of vector-based and transition-based strategies.
人类和深度强化学习智能体中的目标导向导航依赖于基于向量和基于转换的策略的自适应混合。
PLoS Biol. 2025 Jul 29;23(7):e3003296. doi: 10.1371/journal.pbio.3003296. eCollection 2025 Jul.
4
Learning place cells and remapping by decoding the cognitive map.通过解码认知地图来学习位置细胞和重映射。
Elife. 2025 Jul 28;13:RP99302. doi: 10.7554/eLife.99302.
5
Cortical dissociation of spatial reference frames during place navigation.位置导航过程中空间参照系的皮质解离
bioRxiv. 2025 Jun 29:2025.06.25.661569. doi: 10.1101/2025.06.25.661569.
6
REMI: Reconstructing Episodic Memory During Intrinsic Path Planning.REMI:在内在路径规划过程中重建情景记忆。
bioRxiv. 2025 Jul 3:2025.07.02.662824. doi: 10.1101/2025.07.02.662824.
7
Cooperative coding of continuous variables in networks with sparsity constraint.具有稀疏性约束的网络中连续变量的协同编码
PLoS Comput Biol. 2025 Jul 3;21(7):e1012156. doi: 10.1371/journal.pcbi.1012156. eCollection 2025 Jul.
8
Discovering cognitive strategies with tiny recurrent neural networks.使用微型递归神经网络发现认知策略。
Nature. 2025 Jul 2. doi: 10.1038/s41586-025-09142-4.
9
A hippocampal navigation model through hierarchical memory organization.一种通过分层记忆组织的海马体导航模型。
Cogn Neurodyn. 2025 Dec;19(1):103. doi: 10.1007/s11571-025-10254-w. Epub 2025 Jun 26.
10
Binding in hippocampal-entorhinal circuits enables compositionality in cognitive maps.海马-内嗅皮层回路中的绑定实现了认知地图中的组合性。
Adv Neural Inf Process Syst. 2024;37:39128-39157.