• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从强化学习角度看记忆巩固。

Memory consolidation from a reinforcement learning perspective.

作者信息

Lee Jong Won, Jung Min Whan

机构信息

Center for Synaptic Brain Dysfunctions, Institute for Basic Science, Daejeon, Republic of Korea.

Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea.

出版信息

Front Comput Neurosci. 2025 Jan 8;18:1538741. doi: 10.3389/fncom.2024.1538741. eCollection 2024.

DOI:10.3389/fncom.2024.1538741
PMID:39845091
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11751224/
Abstract

Memory consolidation refers to the process of converting temporary memories into long-lasting ones. It is widely accepted that new experiences are initially stored in the hippocampus as rapid associative memories, which then undergo a consolidation process to establish more permanent traces in other regions of the brain. Over the past two decades, studies in humans and animals have demonstrated that the hippocampus is crucial not only for memory but also for imagination and future planning, with the CA3 region playing a pivotal role in generating novel activity patterns. Additionally, a growing body of evidence indicates the involvement of the hippocampus, especially the CA1 region, in valuation processes. Based on these findings, we propose that the CA3 region of the hippocampus generates diverse activity patterns, while the CA1 region evaluates and reinforces those patterns most likely to maximize rewards. This framework closely parallels Dyna, a reinforcement learning algorithm introduced by Sutton in 1991. In Dyna, an agent performs offline simulations to supplement trial-and-error value learning, greatly accelerating the learning process. We suggest that memory consolidation might be viewed as a process of deriving optimal strategies based on simulations derived from limited experiences, rather than merely strengthening incidental memories. From this perspective, memory consolidation functions as a form of offline reinforcement learning, aimed at enhancing adaptive decision-making.

摘要

记忆巩固是指将临时记忆转化为长期记忆的过程。人们普遍认为,新的经历最初作为快速联想记忆存储在海马体中,然后经历一个巩固过程,以便在大脑的其他区域建立更持久的痕迹。在过去的二十年里,对人类和动物的研究表明,海马体不仅对记忆至关重要,而且对想象和未来规划也至关重要,其中CA3区域在产生新的活动模式中起着关键作用。此外,越来越多的证据表明海马体,尤其是CA1区域,参与了评估过程。基于这些发现,我们提出海马体的CA3区域产生多样的活动模式,而CA1区域评估并强化那些最有可能使奖励最大化的模式。这个框架与1991年萨顿引入的强化学习算法Dyna非常相似。在Dyna中,一个智能体进行离线模拟以补充试错价值学习,极大地加速了学习过程。我们认为,记忆巩固可能被视为一个基于从有限经验中得出的模拟来推导最优策略的过程,而不仅仅是强化偶然记忆。从这个角度来看,记忆巩固作为一种离线强化学习形式,旨在增强适应性决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e423/11751224/50633a7fc9a4/fncom-18-1538741-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e423/11751224/50633a7fc9a4/fncom-18-1538741-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e423/11751224/50633a7fc9a4/fncom-18-1538741-g001.jpg

相似文献

1
Memory consolidation from a reinforcement learning perspective.从强化学习角度看记忆巩固。
Front Comput Neurosci. 2025 Jan 8;18:1538741. doi: 10.3389/fncom.2024.1538741. eCollection 2024.
2
A model of bi-directional interactions between complementary learning systems for memory consolidation of sequential experiences.一种用于顺序性经历记忆巩固的互补学习系统间双向交互模型。
Front Syst Neurosci. 2022 Oct 13;16:972235. doi: 10.3389/fnsys.2022.972235. eCollection 2022.
3
Enhancement of synchronized activity between hippocampal CA1 neurons during initial storage of associative fear memory.在关联性恐惧记忆的初始存储过程中海马体CA1神经元之间同步活动的增强。
J Physiol. 2017 Aug 1;595(15):5327-5340. doi: 10.1113/JP274212. Epub 2017 Jun 30.
4
Synaptic reentry reinforcement based network model for long-term memory consolidation.基于突触再入强化的长期记忆巩固网络模型
Hippocampus. 2002;12(5):637-47. doi: 10.1002/hipo.10102.
5
Offline replay supports planning in human reinforcement learning.离线重放支持人类强化学习中的规划。
Elife. 2018 Dec 14;7:e32548. doi: 10.7554/eLife.32548.
6
Dentate Gyrus Sharp Waves, a Local Field Potential Correlate of Learning in the Dentate Gyrus of Mice.齿状回尖波,作为学习在小鼠齿状回的局部场电位相关物。
J Neurosci. 2020 Sep 9;40(37):7105-7118. doi: 10.1523/JNEUROSCI.2275-19.2020. Epub 2020 Aug 19.
7
Adaptive stimulus selection for consolidation in the hippocampus.海马体巩固过程中的适应性刺激选择。
Nature. 2022 Jan;601(7892):240-244. doi: 10.1038/s41586-021-04118-6. Epub 2021 Dec 8.
8
Retrieval as a Fast Route to Memory Consolidation.作为记忆巩固快速途径的提取
Trends Cogn Sci. 2017 Aug;21(8):573-576. doi: 10.1016/j.tics.2017.05.001. Epub 2017 Jun 2.
9
NMDA receptor-dependent synaptic reinforcement as a crucial process for memory consolidation.作为记忆巩固关键过程的N-甲基-D-天冬氨酸受体依赖性突触强化。
Science. 2000 Nov 10;290(5494):1170-4. doi: 10.1126/science.290.5494.1170.
10
A dentate gyrus-CA3 inhibitory circuit promotes evolution of hippocampal-cortical ensembles during memory consolidation.齿状回-CA3 抑制性回路在记忆巩固过程中促进海马-皮层集合体的演化。
Elife. 2022 Feb 22;11:e70586. doi: 10.7554/eLife.70586.

本文引用的文献

1
Selection of experience for memory by hippocampal sharp wave ripples.海马体锐波涟漪对记忆的经验选择。
Science. 2024 Mar 29;383(6690):1478-1483. doi: 10.1126/science.adk8261. Epub 2024 Mar 28.
2
Rethinking the hippocampal cognitive map as a meta-learning computational module.重新思考海马体认知图作为元学习计算模块。
Trends Cogn Sci. 2023 Aug;27(8):702-712. doi: 10.1016/j.tics.2023.05.011. Epub 2023 Jun 23.
3
Neural dynamics underlying associative learning in the dorsal and ventral hippocampus.背侧和腹侧海马体中联想学习的神经动力学。
Nat Neurosci. 2023 May;26(5):798-809. doi: 10.1038/s41593-023-01296-6. Epub 2023 Apr 3.
4
The amygdala mediates the facilitating influence of emotions on memory through multiple interacting mechanisms.杏仁核通过多种相互作用的机制介导情绪对记忆的促进作用。
Neurobiol Stress. 2023 Feb 23;24:100529. doi: 10.1016/j.ynstr.2023.100529. eCollection 2023 May.
5
Septotemporal variations in hippocampal value and outcome processing.海马尾状回在时间和空间上的价值和结果处理变化。
Cell Rep. 2023 Feb 28;42(2):112094. doi: 10.1016/j.celrep.2023.112094. Epub 2023 Feb 9.
6
Hippocampal neurons construct a map of an abstract value space.海马体神经元构建了一个抽象价值空间的图谱。
Cell. 2021 Sep 2;184(18):4640-4650.e10. doi: 10.1016/j.cell.2021.07.010. Epub 2021 Aug 3.
7
Memory consolidation as an adaptive process.记忆巩固作为一种适应过程。
Psychon Bull Rev. 2021 Dec;28(6):1796-1810. doi: 10.3758/s13423-021-01978-x. Epub 2021 Jul 29.
8
Reward biases spontaneous neural reactivation during sleep.奖励会促使睡眠期间自发的神经再激活。
Nat Commun. 2021 Jul 6;12(1):4162. doi: 10.1038/s41467-021-24357-5.
9
Reinforcement learning approaches to hippocampus-dependent flexible spatial navigation.用于依赖海马体的灵活空间导航的强化学习方法。
Brain Neurosci Adv. 2021 Apr 9;5:2398212820975634. doi: 10.1177/2398212820975634. eCollection 2021 Jan-Dec.
10
Robust and distributed neural representation of action values.动作值的鲁棒和分布式神经表示。
Elife. 2021 Apr 20;10:e53045. doi: 10.7554/eLife.53045.