一种受海马体记忆机制启发的可扩展强化学习框架，用于高效的情境和序列决策。

A scalable reinforcement learning framework inspired by hippocampal memory mechanisms for efficient contextual and sequential decision making.

作者信息

Poursiami Hamed, Moshruba Ayana, Cooper Keiland W, Gobin Derek, Kaiser Md Abdullah-Al, Singh Ankur, Noor Rouhan, Shahbaba Babak, Jaiswal Akhilesh, Fortin Norbert J, Parsa Maryam

机构信息

Department of Electrical and Computer Engineering, George Mason University, Virginia, USA.

Center for the Neurobiology of Learning and Memory, UC Irvine, Irvine, USA.

出版信息

Sci Rep. 2025 Jul 12;15(1):25221. doi: 10.1038/s41598-025-10586-x.

DOI:10.1038/s41598-025-10586-x

PMID:40652085

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12255686/

Abstract

Efficient decision-making in context-dependent, sequential tasks remains a fundamental challenge in reinforcement learning (RL). Inspired by the function of the brain's hippocampal system, we introduce Hippocampal-Augmented Memory Integration (HAMI), a biologically inspired memory-based RL framework that leverages symbolic indexing, hierarchical memory refinement, and structured episodic retrieval to enhance both learning efficiency and adaptability. We also propose Hierarchical Contextual Sequences (HiCoS), a structured RL environment grounded in neuroscience studies on episodic and sequence memory and context-driven decision-making, which serves as a controlled testbed for evaluating biologically inspired memory-based decision-making systems. Our experimental results demonstrate that HAMI achieves high decision accuracy and improved sample efficiency while maintaining low memory utilization. HAMI's architecture exhibits significantly lower inference latency than baseline memory-based methods, and its structured retrieval is well-suited for further hardware acceleration with non-volatile memory (NVM)-based content-addressable memory (CAM). By integrating biologically inspired memory mechanisms with structured symbolic representations, HAMI provides a scalable and efficient memory-based RL framework for tackling context-dependent sequential tasks.

摘要

在上下文相关的序列任务中进行高效决策仍然是强化学习（RL）中的一个基本挑战。受大脑海马体系统功能的启发，我们引入了海马体增强记忆整合（HAMI），这是一个受生物学启发的基于记忆的强化学习框架，它利用符号索引、分层记忆细化和结构化情节检索来提高学习效率和适应性。我们还提出了分层上下文序列（HiCoS），这是一个基于情景和序列记忆以及上下文驱动决策的神经科学研究构建的结构化强化学习环境，它作为一个受控测试平台，用于评估受生物学启发的基于记忆的决策系统。我们的实验结果表明，HAMI在保持低内存利用率的同时，实现了高决策准确性和提高的样本效率。HAMI的架构比基于记忆的基线方法具有显著更低的推理延迟，并且其结构化检索非常适合使用基于非易失性存储器（NVM）的内容可寻址存储器（CAM）进行进一步的硬件加速。通过将受生物学启发的记忆机制与结构化符号表示相结合，HAMI为处理上下文相关的序列任务提供了一个可扩展且高效的基于记忆的强化学习框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15c2/12255686/c2e78a921c77/41598_2025_10586_Fig1_HTML.jpg

相似文献

A scalable reinforcement learning framework inspired by hippocampal memory mechanisms for efficient contextual and sequential decision making.一种受海马体记忆机制启发的可扩展强化学习框架，用于高效的情境和序列决策。

Sci Rep. 2025 Jul 12;15(1):25221. doi: 10.1038/s41598-025-10586-x.

Short-Term Memory Impairment短期记忆障碍

Privacy-Preserving Glycemic Management in Type 1 Diabetes: Development and Validation of a Multiobjective Federated Reinforcement Learning Framework.1型糖尿病中保护隐私的血糖管理：多目标联邦强化学习框架的开发与验证

JMIR Diabetes. 2025 Jul 4;10:e72874. doi: 10.2196/72874.

Dynamic Regulation of the Serotonin-Dopamine Interaction Within a Meta-reinforcement Learning Framework Encompassing the Prefrontal Cortex and Basal Ganglia.在包含前额叶皮层和基底神经节的元强化学习框架内血清素-多巴胺相互作用的动态调节

Int J Neural Syst. 2025 Aug;35(8):2550040. doi: 10.1142/S0129065725500406.

Shapley value-driven multi-modal deep reinforcement learning for complex decision-making.用于复杂决策的沙普利值驱动多模态深度强化学习

Neural Netw. 2025 Nov;191:107650. doi: 10.1016/j.neunet.2025.107650. Epub 2025 Jun 21.

Leveraging machine learning to uncover the hidden links between trusting behavior and biological markers.利用机器学习揭示信任行为与生物标志物之间的潜在联系。

Dialogues Clin Neurosci. 2025 Dec;27(1):201-215. doi: 10.1080/19585969.2025.2513697. Epub 2025 Jun 20.

Exploration versus exploitation decisions in the human brain: A systematic review of functional neuroimaging and neuropsychological studies.人类大脑中的探索与开发决策：功能神经影像学和神经心理学研究的系统综述。

Neuropsychologia. 2024 Jan 10;192:108740. doi: 10.1016/j.neuropsychologia.2023.108740. Epub 2023 Nov 29.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Dissociating Frontal Lobe Lesion Induced Deficits in Rule Value Learning Using Reinforcement Learning Models and a WCST Analog.使用强化学习模型和威斯康星卡片分类测验模拟法分离额叶病变引起的规则价值学习缺陷

eNeuro. 2025 May 20;12(5). doi: 10.1523/ENEURO.0117-25.2025. Print 2025 May.

本文引用的文献

Human hippocampal CA3 uses specific functional connectivity rules for efficient associative memory.人类海马体CA3区利用特定的功能连接规则来实现高效的联想记忆。

Cell. 2025 Jan 23;188(2):501-514.e18. doi: 10.1016/j.cell.2024.11.022. Epub 2024 Dec 11.

Temporally extended successor feature neural episodic control.时间扩展后继特征神经情景控制

Sci Rep. 2024 Jul 2;14(1):15103. doi: 10.1038/s41598-024-65687-w.

Towards biologically plausible model-based reinforcement learning in recurrent spiking networks by dreaming new experiences.通过“梦想”新的体验，在递归尖峰网络中实现基于生物学合理性的基于模型的强化学习。

Sci Rep. 2024 Jun 25;14(1):14656. doi: 10.1038/s41598-024-65631-y.

Deep random forest with ferroelectric analog content addressable memory.具有铁电模拟内容可寻址存储器的深度随机森林。

Sci Adv. 2024 Jun 7;10(23):eadk8471. doi: 10.1126/sciadv.adk8471. Epub 2024 Jun 5.

A theory of hippocampal function: New developments.海马功能理论：新进展。

Prog Neurobiol. 2024 Jul;238:102636. doi: 10.1016/j.pneurobio.2024.102636. Epub 2024 Jun 2.

Assessments of dentate gyrus function: discoveries and debates.齿状回功能评估：发现与争议。

Nat Rev Neurosci. 2023 Aug;24(8):502-517. doi: 10.1038/s41583-023-00710-z. Epub 2023 Jun 14.

Hippocampal ensembles represent sequential relationships among an extended sequence of nonspatial events.海马体集合体表示非空间事件的扩展序列中顺序关系。

Nat Commun. 2022 Feb 8;13(1):787. doi: 10.1038/s41467-022-28057-6.

A temporal record of the past with a spectrum of time constants in the monkey entorhinal cortex.猴子内嗅皮层中具有时间常数谱的过去的时间记录。

Proc Natl Acad Sci U S A. 2020 Aug 18;117(33):20274-20283. doi: 10.1073/pnas.1917197117. Epub 2020 Aug 3.

Dynamics of Awake Hippocampal-Prefrontal Replay for Spatial Learning and Memory-Guided Decision Making.清醒状态下海马-前额叶的空间学习和记忆引导决策的回放动力学。

Neuron. 2019 Dec 18;104(6):1110-1125.e7. doi: 10.1016/j.neuron.2019.09.012. Epub 2019 Oct 30.

Grandmaster level in StarCraft II using multi-agent reinforcement learning.星际争霸 II 中的大师级水平使用多智能体强化学习。

Nature. 2019 Nov;575(7782):350-354. doi: 10.1038/s41586-019-1724-z. Epub 2019 Oct 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种受海马体记忆机制启发的可扩展强化学习框架，用于高效的情境和序列决策。

A scalable reinforcement learning framework inspired by hippocampal memory mechanisms for efficient contextual and sequential decision making.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献