Suppr超能文献

用于工作记忆和学习学习的具有生物学合理性的门控循环神经网络。

Biologically plausible gated recurrent neural networks for working memory and learning-to-learn.

作者信息

van den Berg Alexandra R, Roelfsema Pieter R, Bohte Sander M

机构信息

Machine Learning Group, Centrum Wiskunde & Informatica, Amsterdam, The Netherlands.

Department of Vision & Cognition, Netherlands Institute for Neuroscience, Amsterdam, The Netherlands.

出版信息

PLoS One. 2024 Dec 31;19(12):e0316453. doi: 10.1371/journal.pone.0316453. eCollection 2024.

Abstract

The acquisition of knowledge and skills does not occur in isolation but learning experiences amalgamate within and across domains. The process through which learning can accelerate over time is referred to as learning-to-learn or meta-learning. While meta-learning can be implemented in recurrent neural networks, these networks tend to be trained with architectures that are not easily interpretable or mappable to the brain and with learning rules that are biologically implausible. Specifically, these rules have often employed backpropagation-through-time, which relies on information that is unavailable at synapses that are undergoing plasticity in the brain. Previous studies that exclusively used local information for their weight updates had a limited capacity to integrate information over long timespans and could not easily learn-to-learn. Here, we propose a novel gated memory network named RECOLLECT, which can flexibly retain or forget information by means of a single memory gate and is trained with a biologically plausible trial-and-error-learning that requires only local information. We demonstrate that RECOLLECT successfully learns to represent task-relevant information over increasingly long memory delays in a pro-/anti-saccade task, and that it learns to flush its memory at the end of a trial. Moreover, we show that RECOLLECT can learn-to-learn an effective policy on a reversal bandit task. Finally, we show that the solutions acquired by RECOLLECT resemble how animals learn similar tasks.

摘要

知识和技能的获取并非孤立发生,而是学习经验在不同领域内部及之间相互融合。随着时间推移,学习得以加速的过程被称为学会学习或元学习。虽然元学习可在循环神经网络中实现,但这些网络往往采用不易解释或与大脑不匹配的架构进行训练,且使用的学习规则在生物学上也不合理。具体而言,这些规则常采用时间反向传播,而这依赖于大脑中正在发生可塑性变化的突触处无法获取的信息。以往那些仅使用局部信息进行权重更新的研究,在长时间跨度上整合信息的能力有限,且难以学会学习。在此,我们提出一种名为RECOLLECT的新型门控记忆网络,它能够通过单个记忆门灵活地保留或遗忘信息,并采用仅需局部信息的生物学上合理的试错学习进行训练。我们证明,在一个前瞻/反扫视任务中,RECOLLECT能够成功地在越来越长的记忆延迟中学习表征与任务相关的信息,并且它能在试验结束时学会清空其记忆。此外,我们表明RECOLLECT能够在一个反转强盗任务中学会学习一种有效的策略。最后,我们表明RECOLLECT获得的解决方案类似于动物学习类似任务的方式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f123/11687930/6ca2610c8256/pone.0316453.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验