• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过集群总体胜者全得读出机制实现参数工作记忆的储层网络模型的高效强化学习。

Efficient reinforcement learning of a reservoir network model of parametric working memory achieved with a cluster population winner-take-all readout mechanism.

作者信息

Cheng Zhenbo, Deng Zhidong, Hu Xiaolin, Zhang Bo, Yang Tianming

机构信息

State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, China; Department of Computer Science and Technology, Zhejiang University of Technology, Hangzhou, China; and.

State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, China;

出版信息

J Neurophysiol. 2015 Dec;114(6):3296-305. doi: 10.1152/jn.00378.2015. Epub 2015 Oct 7.

DOI:10.1152/jn.00378.2015
PMID:26445865
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6712583/
Abstract

The brain often has to make decisions based on information stored in working memory, but the neural circuitry underlying working memory is not fully understood. Many theoretical efforts have been focused on modeling the persistent delay period activity in the prefrontal areas that is believed to represent working memory. Recent experiments reveal that the delay period activity in the prefrontal cortex is neither static nor homogeneous as previously assumed. Models based on reservoir networks have been proposed to model such a dynamical activity pattern. The connections between neurons within a reservoir are random and do not require explicit tuning. Information storage does not depend on the stable states of the network. However, it is not clear how the encoded information can be retrieved for decision making with a biologically realistic algorithm. We therefore built a reservoir-based neural network to model the neuronal responses of the prefrontal cortex in a somatosensory delayed discrimination task. We first illustrate that the neurons in the reservoir exhibit a heterogeneous and dynamical delay period activity observed in previous experiments. Then we show that a cluster population circuit decodes the information from the reservoir with a winner-take-all mechanism and contributes to the decision making. Finally, we show that the model achieves a good performance rapidly by shaping only the readout with reinforcement learning. Our model reproduces important features of previous behavior and neurophysiology data. We illustrate for the first time how task-specific information stored in a reservoir network can be retrieved with a biologically plausible reinforcement learning training scheme.

摘要

大脑常常需要依据工作记忆中存储的信息做出决策,但工作记忆背后的神经回路尚未被完全理解。许多理论研究致力于对前额叶区域中持续的延迟期活动进行建模,这种活动被认为代表着工作记忆。最近的实验表明,前额叶皮层中的延迟期活动既不像之前所假设的那样静止不变,也并非均匀一致。基于储层网络的模型已被提出用于对这种动态活动模式进行建模。储层内神经元之间的连接是随机的,不需要进行明确的调整。信息存储并不依赖于网络的稳定状态。然而,目前尚不清楚如何通过一种符合生物学现实的算法来检索编码信息以用于决策。因此,我们构建了一个基于储层的神经网络,以对体感延迟辨别任务中前额叶皮层的神经元反应进行建模。我们首先表明,储层中的神经元呈现出先前实验中观察到的异质性和动态延迟期活动。然后我们展示了一个聚类群体回路通过胜者全得机制对来自储层的信息进行解码,并有助于决策。最后,我们表明该模型仅通过强化学习对读出进行塑造就能快速实现良好的性能。我们的模型重现了先前行为和神经生理学数据的重要特征。我们首次说明了如何通过一种具有生物学合理性的强化学习训练方案来检索储层网络中存储的特定任务信息。

相似文献

1
Efficient reinforcement learning of a reservoir network model of parametric working memory achieved with a cluster population winner-take-all readout mechanism.通过集群总体胜者全得读出机制实现参数工作记忆的储层网络模型的高效强化学习。
J Neurophysiol. 2015 Dec;114(6):3296-305. doi: 10.1152/jn.00378.2015. Epub 2015 Oct 7.
2
A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning.一个用于强化学习期间眶额皮质和任务空间获取的神经网络模型。
PLoS Comput Biol. 2018 Jan 4;14(1):e1005925. doi: 10.1371/journal.pcbi.1005925. eCollection 2018 Jan.
3
Reward-dependent learning in neuronal networks for planning and decision making.用于规划和决策的神经网络中基于奖励的学习。
Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0.
4
A working memory model based on fast Hebbian learning.基于快速赫布学习的工作记忆模型。
Network. 2003 Nov;14(4):789-802.
5
Stable population coding for working memory coexists with heterogeneous neural dynamics in prefrontal cortex.工作记忆的稳定群体编码与前额叶皮层中异质性神经动力学共存。
Proc Natl Acad Sci U S A. 2017 Jan 10;114(2):394-399. doi: 10.1073/pnas.1619449114. Epub 2016 Dec 27.
6
A model of visuospatial working memory in prefrontal cortex: recurrent network and cellular bistability.前额叶皮质中视觉空间工作记忆的模型:循环网络与细胞双稳性。
J Comput Neurosci. 1998 Dec;5(4):383-405. doi: 10.1023/a:1008837311948.
7
Monkey prefrontal neurons during Sternberg task performance: full contents of working memory or most recent item?执行斯特恩伯格任务期间猴子前额叶神经元:工作记忆的全部内容还是最新项目?
J Neurophysiol. 2017 Jun 1;117(6):2269-2281. doi: 10.1152/jn.00541.2016. Epub 2017 Mar 22.
8
Mnemonic Encoding and Cortical Organization in Parietal and Prefrontal Cortices.顶叶和前额叶皮质中的记忆编码与皮质组织
J Neurosci. 2017 Jun 21;37(25):6098-6112. doi: 10.1523/JNEUROSCI.3903-16.2017. Epub 2017 May 24.
9
Attention and working memory: a dynamical model of neuronal activity in the prefrontal cortex.注意力与工作记忆:前额叶皮层神经元活动的动力学模型
Eur J Neurosci. 2003 Oct;18(8):2374-90. doi: 10.1046/j.1460-9568.2003.02956.x.
10
Dopamine-mediated stabilization of delay-period activity in a network model of prefrontal cortex.多巴胺在前额叶皮质网络模型中对延迟期活动的介导性稳定作用
J Neurophysiol. 2000 Mar;83(3):1733-50. doi: 10.1152/jn.2000.83.3.1733.

引用本文的文献

1
A view-based decision mechanism for rewards in the primate amygdala.基于视图的灵长类动物杏仁核奖励决策机制。
Neuron. 2023 Dec 6;111(23):3871-3884.e14. doi: 10.1016/j.neuron.2023.08.024. Epub 2023 Sep 18.
2
A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning.一个用于强化学习期间眶额皮质和任务空间获取的神经网络模型。
PLoS Comput Biol. 2018 Jan 4;14(1):e1005925. doi: 10.1371/journal.pcbi.1005925. eCollection 2018 Jan.

本文引用的文献

1
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
2
Two-stage model in perceptual learning: toward a unified theory.两阶段模型在感知学习中的应用:走向统一理论。
Ann N Y Acad Sci. 2014 May;1316:18-28. doi: 10.1111/nyas.12419. Epub 2014 Apr 23.
3
Working models of working memory.工作记忆的工作模型。
Curr Opin Neurobiol. 2014 Apr;25:20-4. doi: 10.1016/j.conb.2013.10.008. Epub 2013 Dec 4.
4
Robust timing and motor patterns by taming chaos in recurrent neural networks.通过驯服递归神经网络中的混沌来实现强健的时间和运动模式。
Nat Neurosci. 2013 Jul;16(7):925-33. doi: 10.1038/nn.3405. Epub 2013 May 26.
5
From fixed points to chaos: three models of delayed discrimination.从不动点到混沌:三种延迟辨别模型。
Prog Neurobiol. 2013 Apr;103:214-22. doi: 10.1016/j.pneurobio.2013.02.002. Epub 2013 Feb 21.
6
Hippocampal representation of touch-guided behavior in rats: persistent and independent traces of stimulus and reward location.大鼠触压引导行为的海马表征:刺激和奖励位置的持久且独立的痕迹。
PLoS One. 2011 Jan 28;6(1):e16462. doi: 10.1371/journal.pone.0016462.
7
Internal representation of task rules by recurrent dynamics: the importance of the diversity of neural responses.递归动力学对任务规则的内部表示:神经反应多样性的重要性。
Front Comput Neurosci. 2010 Oct 4;4:24. doi: 10.3389/fncom.2010.00024. eCollection 2010.
8
Intrinsic biophysical diversity decorrelates neuronal firing while increasing information content.内在生物物理多样性在增加信息含量的同时使神经元放电去相关。
Nat Neurosci. 2010 Oct;13(10):1276-82. doi: 10.1038/nn.2630. Epub 2010 Aug 29.
9
Neuronal population coding of parametric working memory.参数工作记忆的神经元群体编码。
J Neurosci. 2010 Jul 14;30(28):9424-30. doi: 10.1523/JNEUROSCI.1875-10.2010.
10
Synaptic dynamics and decision making.突触动力学与决策。
Proc Natl Acad Sci U S A. 2010 Apr 20;107(16):7545-9. doi: 10.1073/pnas.1002333107. Epub 2010 Apr 1.