海马体的计算特性通过分层强化学习提高目标导向觅食的效率。

Computational Properties of the Hippocampus Increase the Efficiency of Goal-Directed Foraging through Hierarchical Reinforcement Learning.

作者信息

Chalmers Eric, Luczak Artur, Gruber Aaron J

机构信息

Department of Neuroscience, University of Lethbridge Lethbridge, AB, Canada.

出版信息

Front Comput Neurosci. 2016 Dec 12;10:128. doi: 10.3389/fncom.2016.00128. eCollection 2016.

DOI:10.3389/fncom.2016.00128

PMID:28018203

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5149552/

Abstract

The mammalian brain is thought to use a version of Model-based Reinforcement Learning (MBRL) to guide "goal-directed" behavior, wherein animals consider goals and make plans to acquire desired outcomes. However, conventional MBRL algorithms do not fully explain animals' ability to rapidly adapt to environmental changes, or learn multiple complex tasks. They also require extensive computation, suggesting that goal-directed behavior is cognitively expensive. We propose here that key features of processing in the hippocampus support a flexible MBRL mechanism for spatial navigation that is computationally efficient and can adapt quickly to change. We investigate this idea by implementing a computational MBRL framework that incorporates features inspired by computational properties of the hippocampus: a hierarchical representation of space, "forward sweeps" through future spatial trajectories, and context-driven remapping of place cells. We find that a hierarchical abstraction of space greatly reduces the computational load (mental effort) required for adaptation to changing environmental conditions, and allows efficient scaling to large problems. It also allows abstract knowledge gained at high levels to guide adaptation to new obstacles. Moreover, a context-driven remapping mechanism allows learning and memory of multiple tasks. Simulating dorsal or ventral hippocampal lesions in our computational framework qualitatively reproduces behavioral deficits observed in rodents with analogous lesions. The framework may thus embody key features of how the brain organizes model-based RL to efficiently solve navigation and other difficult tasks.

摘要

哺乳动物的大脑被认为使用一种基于模型的强化学习（MBRL）版本来指导“目标导向”行为，即动物会考虑目标并制定计划以获取期望的结果。然而，传统的MBRL算法并不能完全解释动物快速适应环境变化或学习多个复杂任务的能力。它们还需要大量的计算，这表明目标导向行为在认知上成本很高。我们在此提出，海马体处理过程的关键特征支持一种灵活的用于空间导航的MBRL机制，该机制计算效率高且能快速适应变化。我们通过实施一个计算MBRL框架来研究这一想法，该框架纳入了受海马体计算特性启发的特征：空间的分层表示、对未来空间轨迹的“向前扫描”以及位置细胞的上下文驱动重映射。我们发现，空间的分层抽象极大地降低了适应不断变化的环境条件所需的计算负荷（脑力），并允许有效地扩展到大型问题。它还允许在高层次获得的抽象知识指导对新障碍的适应。此外，上下文驱动的重映射机制允许学习和记忆多个任务。在我们的计算框架中模拟背侧或腹侧海马体损伤定性地再现了在具有类似损伤的啮齿动物中观察到的行为缺陷。因此，该框架可能体现了大脑如何组织基于模型的强化学习以有效解决导航和其他困难任务的关键特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc9/5149552/de7ff42b625a/fncom-10-00128-g0001.jpg

相似文献

Computational Properties of the Hippocampus Increase the Efficiency of Goal-Directed Foraging through Hierarchical Reinforcement Learning.

Front Comput Neurosci. 2016 Dec 12;10:128. doi: 10.3389/fncom.2016.00128. eCollection 2016.

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning.

Biomimetics (Basel). 2024 May 23;9(6):315. doi: 10.3390/biomimetics9060315.

Reinforcement learning approaches to hippocampus-dependent flexible spatial navigation.

Brain Neurosci Adv. 2021 Apr 9;5:2398212820975634. doi: 10.1177/2398212820975634. eCollection 2021 Jan-Dec.

Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior.

Int J Neural Syst. 2022 Sep;32(9):2250038. doi: 10.1142/S0129065722500381. Epub 2022 Aug 19.

Vision-Based Robot Navigation through Combining Unsupervised Learning and Hierarchical Reinforcement Learning.

Sensors (Basel). 2019 Apr 1;19(7):1576. doi: 10.3390/s19071576.

Model-based spatial navigation in the hippocampus-ventral striatum circuit: A computational analysis.

PLoS Comput Biol. 2018 Sep 17;14(9):e1006316. doi: 10.1371/journal.pcbi.1006316. eCollection 2018 Sep.

Rapid learning of spatial representations for goal-directed navigation based on a novel model of hippocampal place fields.

Neural Netw. 2023 Apr;161:116-128. doi: 10.1016/j.neunet.2023.01.010. Epub 2023 Jan 19.

A computational model for spatial cognition combining dorsal and ventral hippocampal place field maps: multiscale navigation.

Biol Cybern. 2020 Apr;114(2):187-207. doi: 10.1007/s00422-019-00812-x. Epub 2020 Jan 9.

Contribution of hippocampal place cell activity to learning and formation of goal-directed navigation in rats.

Neuroscience. 2003;117(4):1025-35. doi: 10.1016/s0306-4522(02)00700-5.

Habitual control of goal selection in humans.

Proc Natl Acad Sci U S A. 2015 Nov 10;112(45):13817-22. doi: 10.1073/pnas.1506367112. Epub 2015 Oct 12.

引用本文的文献

A hippocampal navigation model through hierarchical memory organization.

Cogn Neurodyn. 2025 Dec;19(1):103. doi: 10.1007/s11571-025-10254-w. Epub 2025 Jun 26.

Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks.

PLoS Comput Biol. 2023 Aug 18;19(8):e1011385. doi: 10.1371/journal.pcbi.1011385. eCollection 2023 Aug.

Adapting hippocampus multi-scale place field distributions in cluttered environments optimizes spatial navigation and learning.

Front Comput Neurosci. 2022 Dec 12;16:1039822. doi: 10.3389/fncom.2022.1039822. eCollection 2022.

Discovering Implied Serial Order Through Model-Free and Model-Based Learning.

Front Neurosci. 2019 Aug 20;13:878. doi: 10.3389/fnins.2019.00878. eCollection 2019.

Suppression of Ventral Hippocampal Output Impairs Integrated Orbitofrontal Encoding of Task Structure.

Neuron. 2017 Aug 30;95(5):1197-1207.e3. doi: 10.1016/j.neuron.2017.08.003. Epub 2017 Aug 17.

本文引用的文献

Memory hierarchies map onto the hippocampal long axis in humans.

Nat Neurosci. 2015 Nov;18(11):1562-4. doi: 10.1038/nn.4138. Epub 2015 Oct 19.

Habitual control of goal selection in humans.

Proc Natl Acad Sci U S A. 2015 Nov 10;112(45):13817-22. doi: 10.1073/pnas.1506367112. Epub 2015 Oct 12.

Topography of Place Maps along the CA3-to-CA2 Axis of the Hippocampus.

Neuron. 2015 Sep 2;87(5):1078-92. doi: 10.1016/j.neuron.2015.07.007. Epub 2015 Aug 19.

Neural Population Evidence of Functional Heterogeneity along the CA3 Transverse Axis: Pattern Completion versus Pattern Separation.

Neuron. 2015 Sep 2;87(5):1093-105. doi: 10.1016/j.neuron.2015.07.012. Epub 2015 Aug 19.

Distinct neural representation in the dorsolateral, dorsomedial, and ventral parts of the striatum during fixed- and free-choice tasks.

J Neurosci. 2015 Feb 25;35(8):3499-514. doi: 10.1523/JNEUROSCI.1962-14.2015.

Divide et impera: subgoaling reduces the complexity of probabilistic inference and problem solving.

J R Soc Interface. 2015 Mar 6;12(104):20141335. doi: 10.1098/rsif.2014.1335.

Place cells, grid cells, and memory.

Cold Spring Harb Perspect Biol. 2015 Feb 2;7(2):a021808. doi: 10.1101/cshperspect.a021808.

Homeostatic reinforcement learning for integrating reward collection and physiological stability.

Elife. 2014 Dec 2;3:e04811. doi: 10.7554/eLife.04811.

Model-based hierarchical reinforcement learning and human action control.

Philos Trans R Soc Lond B Biol Sci. 2014 Nov 5;369(1655). doi: 10.1098/rstb.2013.0480.

Neural correlates of strategic reasoning during competitive games.

Science. 2014 Oct 17;346(6207):340-3. doi: 10.1126/science.1256254. Epub 2014 Sep 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

海马体的计算特性通过分层强化学习提高目标导向觅食的效率。

Computational Properties of the Hippocampus Increase the Efficiency of Goal-Directed Foraging through Hierarchical Reinforcement Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献