基于向量的人工代理中使用网格表示的导航。

Vector-based navigation using grid-like representations in artificial agents.

机构信息

DeepMind, London, UK.

Department of Cell and Developmental Biology, University College London, London, UK.

出版信息

Nature. 2018 May;557(7705):429-433. doi: 10.1038/s41586-018-0102-6. Epub 2018 May 9.

DOI:10.1038/s41586-018-0102-6

PMID:29743670

Abstract

Deep neural networks have achieved impressive successes in fields ranging from object recognition to complex games such as Go. Navigation, however, remains a substantial challenge for artificial agents, with deep neural networks trained by reinforcement learning failing to rival the proficiency of mammalian spatial behaviour, which is underpinned by grid cells in the entorhinal cortex . Grid cells are thought to provide a multi-scale periodic representation that functions as a metric for coding space and is critical for integrating self-motion (path integration) and planning direct trajectories to goals (vector-based navigation). Here we set out to leverage the computational functions of grid cells to develop a deep reinforcement learning agent with mammal-like navigational abilities. We first trained a recurrent network to perform path integration, leading to the emergence of representations resembling grid cells, as well as other entorhinal cell types . We then showed that this representation provided an effective basis for an agent to locate goals in challenging, unfamiliar, and changeable environments-optimizing the primary objective of navigation through deep reinforcement learning. The performance of agents endowed with grid-like representations surpassed that of an expert human and comparison agents, with the metric quantities necessary for vector-based navigation derived from grid-like units within the network. Furthermore, grid-like representations enabled agents to conduct shortcut behaviours reminiscent of those performed by mammals. Our findings show that emergent grid-like representations furnish agents with a Euclidean spatial metric and associated vector operations, providing a foundation for proficient navigation. As such, our results support neuroscientific theories that see grid cells as critical for vector-based navigation, demonstrating that the latter can be combined with path-based strategies to support navigation in challenging environments.

摘要

深度神经网络在从物体识别到围棋等复杂游戏等领域取得了令人瞩目的成就。然而，对于人工智能代理来说，导航仍然是一个巨大的挑战，通过强化学习训练的深度神经网络无法与哺乳动物的空间行为相媲美，而哺乳动物的空间行为是由内嗅皮层中的网格细胞支持的。网格细胞被认为提供了一种多尺度周期性表示，作为空间编码的度量标准，对于整合自身运动（路径整合）和规划到目标的直接轨迹（基于向量的导航）至关重要。在这里，我们着手利用网格细胞的计算功能开发一种具有类似哺乳动物导航能力的深度强化学习代理。我们首先训练一个递归网络来执行路径整合，从而产生类似于网格细胞的表示，以及其他内嗅细胞类型。然后，我们表明，这种表示为代理在具有挑战性、不熟悉和多变的环境中定位目标提供了一个有效的基础，通过深度强化学习优化了导航的主要目标。具有网格样表示的代理的性能超过了专家人类和比较代理，并且从网络中的网格样单元中推导出了基于向量的导航所需的度量量。此外，网格样表示使代理能够执行类似于哺乳动物的捷径行为。我们的研究结果表明，涌现的网格样表示为代理提供了欧几里得空间度量和相关的向量运算，为熟练导航提供了基础。因此，我们的结果支持了神经科学理论，即网格细胞对基于向量的导航至关重要，证明了后者可以与基于路径的策略相结合，以支持在具有挑战性的环境中的导航。

相似文献

Vector-based navigation using grid-like representations in artificial agents.

Nature. 2018 May;557(7705):429-433. doi: 10.1038/s41586-018-0102-6. Epub 2018 May 9.

Biomimetic FPGA-based spatial navigation model with grid cells and place cells.

Neural Netw. 2021 Jul;139:45-63. doi: 10.1016/j.neunet.2021.01.028. Epub 2021 Feb 13.

Compromised Grid-Cell-like Representations in Old Age as a Key Mechanism to Explain Age-Related Navigational Deficits.

Curr Biol. 2018 Apr 2;28(7):1108-1115.e6. doi: 10.1016/j.cub.2018.02.038. Epub 2018 Mar 15.

Navigating with grid and place cells in cluttered environments.

Hippocampus. 2020 Mar;30(3):220-232. doi: 10.1002/hipo.23147. Epub 2019 Aug 13.

Hexadirectional Modulation of Theta Power in Human Entorhinal Cortex during Spatial Navigation.

Curr Biol. 2018 Oct 22;28(20):3310-3315.e4. doi: 10.1016/j.cub.2018.08.029. Epub 2018 Oct 11.

Grid-like hexadirectional modulation of human entorhinal theta oscillations.

Proc Natl Acad Sci U S A. 2018 Oct 16;115(42):10798-10803. doi: 10.1073/pnas.1805007115. Epub 2018 Oct 3.

Environmental Barriers Disrupt Grid-like Representations in Humans during Navigation.

Curr Biol. 2019 Aug 19;29(16):2718-2722.e3. doi: 10.1016/j.cub.2019.06.072. Epub 2019 Aug 1.

Grid coding, spatial representation, and navigation: Should we assume an isomorphism?

Hippocampus. 2020 Apr;30(4):422-432. doi: 10.1002/hipo.23175. Epub 2019 Nov 18.

The Neurobiology of Mammalian Navigation.

Curr Biol. 2018 Sep 10;28(17):R1023-R1042. doi: 10.1016/j.cub.2018.05.050.

Modeling place cells and grid cells in multi-compartment environments: Entorhinal-hippocampal loop as a multisensory integration circuit.

Neural Netw. 2020 Jan;121:37-51. doi: 10.1016/j.neunet.2019.09.002. Epub 2019 Sep 6.

引用本文的文献

Grid cells accurately track movement during path integration-based navigation despite switching reference frames.

Nat Neurosci. 2025 Sep 10. doi: 10.1038/s41593-025-02054-6.

Speed modulations in grid cell information geometry.

Nat Commun. 2025 Aug 19;16(1):7723. doi: 10.1038/s41467-025-62856-x.

Goal-directed navigation in humans and deep reinforcement learning agents relies on an adaptive mix of vector-based and transition-based strategies.

PLoS Biol. 2025 Jul 29;23(7):e3003296. doi: 10.1371/journal.pbio.3003296. eCollection 2025 Jul.

Learning place cells and remapping by decoding the cognitive map.

Elife. 2025 Jul 28;13:RP99302. doi: 10.7554/eLife.99302.

Cortical dissociation of spatial reference frames during place navigation.

bioRxiv. 2025 Jun 29:2025.06.25.661569. doi: 10.1101/2025.06.25.661569.

REMI: Reconstructing Episodic Memory During Intrinsic Path Planning.

bioRxiv. 2025 Jul 3:2025.07.02.662824. doi: 10.1101/2025.07.02.662824.

Cooperative coding of continuous variables in networks with sparsity constraint.

PLoS Comput Biol. 2025 Jul 3;21(7):e1012156. doi: 10.1371/journal.pcbi.1012156. eCollection 2025 Jul.

Discovering cognitive strategies with tiny recurrent neural networks.

Nature. 2025 Jul 2. doi: 10.1038/s41586-025-09142-4.

A hippocampal navigation model through hierarchical memory organization.

Cogn Neurodyn. 2025 Dec;19(1):103. doi: 10.1007/s11571-025-10254-w. Epub 2025 Jun 26.

Binding in hippocampal-entorhinal circuits enables compositionality in cognitive maps.

Adv Neural Inf Process Syst. 2024;37:39128-39157.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于向量的人工代理中使用网格表示的导航。

Vector-based navigation using grid-like representations in artificial agents.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献