• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于工作记忆和学习学习的具有生物学合理性的门控循环神经网络。

Biologically plausible gated recurrent neural networks for working memory and learning-to-learn.

作者信息

van den Berg Alexandra R, Roelfsema Pieter R, Bohte Sander M

机构信息

Machine Learning Group, Centrum Wiskunde & Informatica, Amsterdam, The Netherlands.

Department of Vision & Cognition, Netherlands Institute for Neuroscience, Amsterdam, The Netherlands.

出版信息

PLoS One. 2024 Dec 31;19(12):e0316453. doi: 10.1371/journal.pone.0316453. eCollection 2024.

DOI:10.1371/journal.pone.0316453
PMID:39739908
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11687930/
Abstract

The acquisition of knowledge and skills does not occur in isolation but learning experiences amalgamate within and across domains. The process through which learning can accelerate over time is referred to as learning-to-learn or meta-learning. While meta-learning can be implemented in recurrent neural networks, these networks tend to be trained with architectures that are not easily interpretable or mappable to the brain and with learning rules that are biologically implausible. Specifically, these rules have often employed backpropagation-through-time, which relies on information that is unavailable at synapses that are undergoing plasticity in the brain. Previous studies that exclusively used local information for their weight updates had a limited capacity to integrate information over long timespans and could not easily learn-to-learn. Here, we propose a novel gated memory network named RECOLLECT, which can flexibly retain or forget information by means of a single memory gate and is trained with a biologically plausible trial-and-error-learning that requires only local information. We demonstrate that RECOLLECT successfully learns to represent task-relevant information over increasingly long memory delays in a pro-/anti-saccade task, and that it learns to flush its memory at the end of a trial. Moreover, we show that RECOLLECT can learn-to-learn an effective policy on a reversal bandit task. Finally, we show that the solutions acquired by RECOLLECT resemble how animals learn similar tasks.

摘要

知识和技能的获取并非孤立发生,而是学习经验在不同领域内部及之间相互融合。随着时间推移,学习得以加速的过程被称为学会学习或元学习。虽然元学习可在循环神经网络中实现,但这些网络往往采用不易解释或与大脑不匹配的架构进行训练,且使用的学习规则在生物学上也不合理。具体而言,这些规则常采用时间反向传播,而这依赖于大脑中正在发生可塑性变化的突触处无法获取的信息。以往那些仅使用局部信息进行权重更新的研究,在长时间跨度上整合信息的能力有限,且难以学会学习。在此,我们提出一种名为RECOLLECT的新型门控记忆网络,它能够通过单个记忆门灵活地保留或遗忘信息,并采用仅需局部信息的生物学上合理的试错学习进行训练。我们证明,在一个前瞻/反扫视任务中,RECOLLECT能够成功地在越来越长的记忆延迟中学习表征与任务相关的信息,并且它能在试验结束时学会清空其记忆。此外,我们表明RECOLLECT能够在一个反转强盗任务中学会学习一种有效的策略。最后,我们表明RECOLLECT获得的解决方案类似于动物学习类似任务的方式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/e8ce4e5ddb5e/pone.0316453.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/6ca2610c8256/pone.0316453.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/03a01c13b982/pone.0316453.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/047520ab74ec/pone.0316453.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/6ba7da01c983/pone.0316453.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/a78ab778d6a7/pone.0316453.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/58db057a7821/pone.0316453.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/57cb06e2628a/pone.0316453.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/b9198fc520bd/pone.0316453.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/e8ce4e5ddb5e/pone.0316453.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/6ca2610c8256/pone.0316453.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/03a01c13b982/pone.0316453.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/047520ab74ec/pone.0316453.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/6ba7da01c983/pone.0316453.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/a78ab778d6a7/pone.0316453.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/58db057a7821/pone.0316453.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/57cb06e2628a/pone.0316453.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/b9198fc520bd/pone.0316453.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/e8ce4e5ddb5e/pone.0316453.g009.jpg

相似文献

1
Biologically plausible gated recurrent neural networks for working memory and learning-to-learn.用于工作记忆和学习学习的具有生物学合理性的门控循环神经网络。
PLoS One. 2024 Dec 31;19(12):e0316453. doi: 10.1371/journal.pone.0316453. eCollection 2024.
2
Flexible Working Memory Through Selective Gating and Attentional Tagging.通过选择性门控和注意力标记实现灵活的工作记忆。
Neural Comput. 2021 Jan;33(1):1-40. doi: 10.1162/neco_a_01339. Epub 2020 Oct 20.
3
Biologically plausible learning in recurrent neural networks reproduces neural dynamics observed during cognitive tasks.循环神经网络中符合生物学原理的学习再现了认知任务期间观察到的神经动力学。
Elife. 2017 Feb 23;6:e20899. doi: 10.7554/eLife.20899.
4
A self-organizing short-term dynamical memory network.自组织短期动力记忆网络。
Neural Netw. 2018 Oct;106:30-41. doi: 10.1016/j.neunet.2018.06.008. Epub 2018 Jun 20.
5
Recurrent neural networks that learn multi-step visual routines with reinforcement learning.基于强化学习的循环神经网络,可用于学习多步骤视觉常规。
PLoS Comput Biol. 2024 Apr 29;20(4):e1012030. doi: 10.1371/journal.pcbi.1012030. eCollection 2024 Apr.
6
Towards biologically plausible model-based reinforcement learning in recurrent spiking networks by dreaming new experiences.通过“梦想”新的体验,在递归尖峰网络中实现基于生物学合理性的基于模型的强化学习。
Sci Rep. 2024 Jun 25;14(1):14656. doi: 10.1038/s41598-024-65631-y.
7
How attention can create synaptic tags for the learning of working memories in sequential tasks.在序列任务中,注意力如何为工作记忆的学习创建突触标签。
PLoS Comput Biol. 2015 Mar 5;11(3):e1004060. doi: 10.1371/journal.pcbi.1004060. eCollection 2015 Mar.
8
Pruning recurrent neural networks replicates adolescent changes in working memory and reinforcement learning.修剪递归神经网络复制了工作记忆和强化学习中的青少年变化。
Proc Natl Acad Sci U S A. 2022 May 31;119(22):e2121331119. doi: 10.1073/pnas.2121331119. Epub 2022 May 27.
9
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.
10
Target spike patterns enable efficient and biologically plausible learning for complex temporal tasks.目标尖峰模式可实现高效且在生物学上合理的复杂时间任务学习。
PLoS One. 2021 Feb 16;16(2):e0247014. doi: 10.1371/journal.pone.0247014. eCollection 2021.

本文引用的文献

1
Fast learning without synaptic plasticity in spiking neural networks.尖峰神经网络中的无突触可塑性快速学习。
Sci Rep. 2024 Apr 12;14(1):8557. doi: 10.1038/s41598-024-55769-0.
2
Meta-SpikePropamine: learning to learn with synaptic plasticity in spiking neural networks.元尖峰传播胺:在脉冲神经网络中通过突触可塑性进行学习学习
Front Neurosci. 2023 May 12;17:1183321. doi: 10.3389/fnins.2023.1183321. eCollection 2023.
3
Choice-selective sequences dominate in cortical relative to thalamic inputs to NAc to support reinforcement learning.
皮层对 NAc 的输入比丘脑的输入更具有选择选择性,从而支持强化学习。
Cell Rep. 2022 May 17;39(7):110756. doi: 10.1016/j.celrep.2022.110756.
4
A behavioural correlate of the synaptic eligibility trace in the nucleus accumbens.伏隔核中突触合格痕迹的行为相关性。
Sci Rep. 2022 Feb 4;12(1):1921. doi: 10.1038/s41598-022-05637-6.
5
A cortico-basal ganglia-thalamo-cortical channel underlying short-term memory.一条构成短期记忆基础的皮质-基底神经节-丘脑-皮质通路。
Neuron. 2021 Nov 3;109(21):3486-3499.e7. doi: 10.1016/j.neuron.2021.08.002. Epub 2021 Aug 31.
6
Latent learning, cognitive maps, and curiosity.潜伏学习、认知地图与好奇心。
Curr Opin Behav Sci. 2021 Apr;38:1-7. doi: 10.1016/j.cobeha.2020.06.003. Epub 2020 Jul 17.
7
Diversity and dynamism in the cerebellum.小脑的多样性与活力。
Nat Neurosci. 2021 Feb;24(2):160-167. doi: 10.1038/s41593-020-00754-9. Epub 2020 Dec 7.
8
Stimulus-Specific Visual Working Memory Representations in Human Cerebellar Lobule VIIb/VIIIa.刺激特异性视觉工作记忆在人类小脑VIIb/VIIIa 叶的表现。
J Neurosci. 2021 Feb 3;41(5):1033-1045. doi: 10.1523/JNEUROSCI.1253-20.2020. Epub 2020 Nov 19.
9
Flexible Working Memory Through Selective Gating and Attentional Tagging.通过选择性门控和注意力标记实现灵活的工作记忆。
Neural Comput. 2021 Jan;33(1):1-40. doi: 10.1162/neco_a_01339. Epub 2020 Oct 20.
10
A solution to the learning dilemma for recurrent networks of spiking neurons.用于尖峰神经元递归网络的学习困境的解决方案。
Nat Commun. 2020 Jul 17;11(1):3625. doi: 10.1038/s41467-020-17236-y.