• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全局重映射成为强化学习模型中依赖情境行为更新的机制。

Global remapping emerges as the mechanism for renewal of context-dependent behavior in a reinforcement learning model.

作者信息

Kappel David, Cheng Sen

机构信息

Institute for Neural Computation, Faculty of Computer Science, Ruhr University Bochum, Bochum, Germany.

出版信息

Front Comput Neurosci. 2025 Jan 15;18:1462110. doi: 10.3389/fncom.2024.1462110. eCollection 2024.

DOI:10.3389/fncom.2024.1462110
PMID:39881840
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11774835/
Abstract

INTRODUCTION

The hippocampal formation exhibits complex and context-dependent activity patterns and dynamics, e.g., place cell activity during spatial navigation in rodents or remapping of place fields when the animal switches between contexts. Furthermore, rodents show context-dependent renewal of extinguished behavior. However, the link between context-dependent neural codes and context-dependent renewal is not fully understood.

METHODS

We use a deep neural network-based reinforcement learning agent to study the learning dynamics that occur during spatial learning and context switching in a simulated ABA extinction and renewal paradigm in a 3D virtual environment.

RESULTS

Despite its simplicity, the network exhibits a number of features typically found in the CA1 and CA3 regions of the hippocampus. A significant proportion of neurons in deeper layers of the network are tuned to a specific spatial position of the agent in the environment-similar to place cells in the hippocampus. These complex spatial representations and dynamics occur spontaneously in the hidden layer of a deep network during learning. These spatial representations exhibit global remapping when the agent is exposed to a new context. The spatial maps are restored when the agent returns to the previous context, accompanied by renewal of the conditioned behavior. Remapping is facilitated by memory replay of experiences during training.

DISCUSSION

Our results show that integrated codes that jointly represent spatial and task-relevant contextual variables are the mechanism underlying renewal in a simulated DQN agent.

摘要

引言

海马结构呈现出复杂且依赖于情境的活动模式和动态变化,例如,啮齿动物在空间导航过程中的位置细胞活动,或者当动物在不同情境之间切换时位置场的重新映射。此外,啮齿动物表现出依赖于情境的消退行为的恢复。然而,依赖于情境的神经编码与依赖于情境的恢复之间的联系尚未完全明了。

方法

我们使用基于深度神经网络的强化学习智能体,来研究在三维虚拟环境中的模拟ABA消退和恢复范式下空间学习和情境切换过程中发生的学习动态。

结果

尽管网络结构简单,但它展现出许多通常在海马体CA1和CA3区域中发现的特征。网络较深层中的很大一部分神经元被调整至智能体在环境中的特定空间位置,这类似于海马体中的位置细胞。这些复杂的空间表征和动态变化在学习过程中自发地出现在深度网络的隐藏层中。当智能体暴露于新情境时,这些空间表征会发生全局重新映射。当智能体返回先前情境时,空间图谱会恢复,同时伴随条件行为的恢复。训练期间的经验记忆重放促进了重新映射。

讨论

我们的结果表明,联合表征空间和与任务相关的情境变量的整合编码是模拟深度Q网络智能体中恢复现象的潜在机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/cd7cbf13f52b/fncom-18-1462110-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/5c1b05b3f628/fncom-18-1462110-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/a51cf84808b7/fncom-18-1462110-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/2cf7b816f25a/fncom-18-1462110-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/b247e362b41d/fncom-18-1462110-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/dc6d19456808/fncom-18-1462110-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/cc64209aaac3/fncom-18-1462110-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/cd7cbf13f52b/fncom-18-1462110-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/5c1b05b3f628/fncom-18-1462110-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/a51cf84808b7/fncom-18-1462110-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/2cf7b816f25a/fncom-18-1462110-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/b247e362b41d/fncom-18-1462110-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/dc6d19456808/fncom-18-1462110-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/cc64209aaac3/fncom-18-1462110-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72df/11774835/cd7cbf13f52b/fncom-18-1462110-g0007.jpg

相似文献

1
Global remapping emerges as the mechanism for renewal of context-dependent behavior in a reinforcement learning model.全局重映射成为强化学习模型中依赖情境行为更新的机制。
Front Comput Neurosci. 2025 Jan 15;18:1462110. doi: 10.3389/fncom.2024.1462110. eCollection 2024.
2
Reactivation of Rate Remapping in CA3.CA3区速率重映射的重新激活
J Neurosci. 2016 Sep 7;36(36):9342-50. doi: 10.1523/JNEUROSCI.1678-15.2016.
3
Long-Term Characterization of Hippocampal Remapping during Contextual Fear Acquisition and Extinction.长期特征分析:在情景性恐惧获得和消退过程中海马重映射。
J Neurosci. 2020 Oct 21;40(43):8329-8342. doi: 10.1523/JNEUROSCI.1022-20.2020. Epub 2020 Sep 21.
4
Experience-dependent contextual codes in the hippocampus.海马体中的经验依赖型上下文代码。
Nat Neurosci. 2021 May;24(5):705-714. doi: 10.1038/s41593-021-00816-6. Epub 2021 Mar 22.
5
Place cell rate remapping by CA3 recurrent collaterals.CA3 循环侧支引起的位置细胞速率重映射。
PLoS Comput Biol. 2014 Jun 5;10(6):e1003648. doi: 10.1371/journal.pcbi.1003648. eCollection 2014 Jun.
6
Hippocampal place cell remapping occurs with memory storage of aversive experiences.海马位置细胞的重映射发生在对厌恶经历的记忆存储中。
Elife. 2023 Jul 19;12:e80661. doi: 10.7554/eLife.80661.
7
Dynamic coding of dorsal hippocampal neurons between tasks that differ in structure and memory demand.在结构和记忆需求不同的任务之间,背侧海马体神经元的动态编码。
Hippocampus. 2013 Feb;23(2):169-86. doi: 10.1002/hipo.22079. Epub 2012 Oct 4.
8
Animal-to-Animal Variability in Partial Hippocampal Remapping in Repeated Environments.动物在重复环境中的部分海马再映射中的个体间变异性。
J Neurosci. 2022 Jun 29;42(26):5268-5280. doi: 10.1523/JNEUROSCI.3221-20.2022. Epub 2022 May 31.
9
Is hippocampal remapping the physiological basis for context?海马重映射是上下文的生理基础吗?
Hippocampus. 2020 Aug;30(8):851-864. doi: 10.1002/hipo.23160. Epub 2019 Sep 30.
10
Enhanced Reactivation of Remapping Place Cells during Aversive Learning.厌恶学习过程中重定向位置细胞的增强再激活
J Neurosci. 2023 Mar 22;43(12):2153-2167. doi: 10.1523/JNEUROSCI.1450-22.2022. Epub 2023 Jan 3.

本文引用的文献

1
Temporal encoding in deep reinforcement learning agents.深度强化学习代理中的时间编码。
Sci Rep. 2023 Dec 15;13(1):22335. doi: 10.1038/s41598-023-49847-y.
2
CoBeL-RL: A neuroscience-oriented simulation framework for complex behavior and learning.CoBeL-RL:一个面向神经科学的复杂行为与学习模拟框架。
Front Neuroinform. 2023 Mar 9;17:1134405. doi: 10.3389/fninf.2023.1134405. eCollection 2023.
3
A model of hippocampal replay driven by experience and environmental structure facilitates spatial learning.经验和环境结构驱动的海马体重放模型促进了空间学习。
Elife. 2023 Mar 14;12:e82301. doi: 10.7554/eLife.82301.
4
Navigation task and action space drive the emergence of egocentric and allocentric spatial representations.导航任务和动作空间推动了自我中心和以任务为中心的空间表示的出现。
PLoS Comput Biol. 2022 Oct 31;18(10):e1010320. doi: 10.1371/journal.pcbi.1010320. eCollection 2022 Oct.
5
The hippocampal formation as a hierarchical generative model supporting generative replay and continual learning.海马结构作为一个层级生成模型,支持生成式重放和持续学习。
Prog Neurobiol. 2022 Oct;217:102329. doi: 10.1016/j.pneurobio.2022.102329. Epub 2022 Jul 21.
6
Hippocampal formation-inspired probabilistic generative model.海马体启发式概率生成模型。
Neural Netw. 2022 Jul;151:317-335. doi: 10.1016/j.neunet.2022.04.001. Epub 2022 Apr 8.
7
Contextual inference underlies the learning of sensorimotor repertoires.情境推理是感觉运动技能学习的基础。
Nature. 2021 Dec;600(7889):489-493. doi: 10.1038/s41586-021-04129-3. Epub 2021 Nov 24.
8
Hippocampal replay reflects specific past experiences rather than a plan for subsequent choice.海马体回放反映的是特定的过去经历,而不是对后续选择的计划。
Neuron. 2021 Oct 6;109(19):3149-3163.e6. doi: 10.1016/j.neuron.2021.07.029. Epub 2021 Aug 26.
9
Hippocampal neurons construct a map of an abstract value space.海马体神经元构建了一个抽象价值空间的图谱。
Cell. 2021 Sep 2;184(18):4640-4650.e10. doi: 10.1016/j.cell.2021.07.010. Epub 2021 Aug 3.
10
Emergence of complex dynamics of choice due to repeated exposures to extinction learning.由于反复接触灭绝学习而出现复杂的选择动力学。
Anim Cogn. 2021 Nov;24(6):1279-1297. doi: 10.1007/s10071-021-01521-4. Epub 2021 May 12.