• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于首次放电的视觉分类,使用奖励调制的 STDP。

First-Spike-Based Visual Categorization Using Reward-Modulated STDP.

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):6178-6190. doi: 10.1109/TNNLS.2018.2826721. Epub 2018 May 8.

DOI:10.1109/TNNLS.2018.2826721
PMID:29993898
Abstract

Reinforcement learning (RL) has recently regained popularity with major achievements such as beating the European game of Go champion. Here, for the first time, we show that RL can be used efficiently to train a spiking neural network (SNN) to perform object recognition in natural images without using an external classifier. We used a feedforward convolutional SNN and a temporal coding scheme where the most strongly activated neurons fire first, while less activated ones fire later, or not at all. In the highest layers, each neuron was assigned to an object category, and it was assumed that the stimulus category was the category of the first neuron to fire. If this assumption was correct, the neuron was rewarded, i.e., spike-timing-dependent plasticity (STDP) was applied, which reinforced the neuron's selectivity. Otherwise, anti-STDP was applied, which encouraged the neuron to learn something else. As demonstrated on various image data sets (Caltech, ETH-80, and NORB), this reward-modulated STDP (R-STDP) approach has extracted particularly discriminative visual features, whereas classic unsupervised STDP extracts any feature that consistently repeats. As a result, R-STDP has outperformed STDP on these data sets. Furthermore, R-STDP is suitable for online learning and can adapt to drastic changes such as label permutations. Finally, it is worth mentioning that both feature extraction and classification were done with spikes, using at most one spike per neuron. Thus, the network is hardware friendly and energy efficient.

摘要

强化学习 (RL) 最近在重大成就方面重新受到关注,例如击败欧洲围棋冠军。在这里,我们首次展示了 RL 可以有效地用于训练尖峰神经网络 (SNN),以便在不使用外部分类器的情况下在自然图像中执行对象识别。我们使用了前馈卷积 SNN 和时间编码方案,其中最强激活的神经元首先发射,而较弱激活的神经元则稍后发射,或者根本不发射。在最高层,每个神经元都被分配到一个对象类别,并且假设刺激类别是第一个发射神经元的类别。如果这个假设是正确的,那么神经元就会得到奖励,即应用了依赖于尖峰时间的可塑性 (STDP),这增强了神经元的选择性。否则,应用抗 STDP,这鼓励神经元学习其他东西。正如在各种图像数据集(Caltech、ETH-80 和 NORB)上所证明的那样,这种奖励调制的 STDP(R-STDP)方法提取了特别有区别的视觉特征,而经典的无监督 STDP 则提取任何一致重复的特征。因此,R-STDP 在这些数据集上的性能优于 STDP。此外,R-STDP 适用于在线学习,并且可以适应标签排列等剧烈变化。最后,值得一提的是,特征提取和分类都是使用尖峰完成的,每个神经元最多使用一个尖峰。因此,该网络对硬件友好且节能。

相似文献

1
First-Spike-Based Visual Categorization Using Reward-Modulated STDP.基于首次放电的视觉分类,使用奖励调制的 STDP。
IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):6178-6190. doi: 10.1109/TNNLS.2018.2826721. Epub 2018 May 8.
2
STDP-based spiking deep convolutional neural networks for object recognition.基于 STDP 的尖峰深度卷积神经网络的目标识别。
Neural Netw. 2018 Mar;99:56-67. doi: 10.1016/j.neunet.2017.12.005. Epub 2017 Dec 23.
3
A learning theory for reward-modulated spike-timing-dependent plasticity with application to biofeedback.一种用于奖励调制的依赖于尖峰时间的可塑性的学习理论及其在生物反馈中的应用。
PLoS Comput Biol. 2008 Oct;4(10):e1000180. doi: 10.1371/journal.pcbi.1000180. Epub 2008 Oct 10.
4
A neuromorphic architecture for object recognition and motion anticipation using burst-STDP.基于突发式 STMDP 的目标识别和运动预测的神经形态架构
PLoS One. 2012;7(5):e36958. doi: 10.1371/journal.pone.0036958. Epub 2012 May 15.
5
Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.通过调节尖峰时间依赖性突触可塑性进行强化学习。
Neural Comput. 2007 Jun;19(6):1468-502. doi: 10.1162/neco.2007.19.6.1468.
6
Competitive Learning in a Spiking Neural Network: Towards an Intelligent Pattern Classifier.尖峰神经网络中的竞争学习:迈向智能模式分类器。
Sensors (Basel). 2020 Jan 16;20(2):500. doi: 10.3390/s20020500.
7
Dynamic evolving spiking neural networks for on-line spatio- and spectro-temporal pattern recognition.用于在线时空谱模式识别的动态进化尖峰神经网络。
Neural Netw. 2013 May;41:188-201. doi: 10.1016/j.neunet.2012.11.014. Epub 2012 Dec 20.
8
Unsupervised learning of visual features through spike timing dependent plasticity.通过依赖于尖峰时间的可塑性进行视觉特征的无监督学习。
PLoS Comput Biol. 2007 Feb 16;3(2):e31. doi: 10.1371/journal.pcbi.0030031. Epub 2007 Jan 2.
9
A forecast-based STDP rule suitable for neuromorphic implementation.一种适用于神经形态实现的基于预测的 STDP 规则。
Neural Netw. 2012 Aug;32:3-14. doi: 10.1016/j.neunet.2012.02.018. Epub 2012 Feb 14.
10
Categorization and decision-making in a neurobiologically plausible spiking network using a STDP-like learning rule.使用类似于 STDP 的学习规则在具有神经生物学合理性的尖峰网络中进行分类和决策。
Neural Netw. 2013 Dec;48:109-24. doi: 10.1016/j.neunet.2013.07.012. Epub 2013 Aug 14.

引用本文的文献

1
Replicating associative learning of rodents with a neuromorphic robot in an open-field arena.在开放场环境中使用神经形态机器人复制啮齿动物的联想学习。
Front Neurosci. 2025 Jun 25;19:1565780. doi: 10.3389/fnins.2025.1565780. eCollection 2025.
2
Motion feature extraction using magnocellular-inspired spiking neural networks for drone detection.使用受大细胞启发的脉冲神经网络进行无人机检测的运动特征提取
Front Comput Neurosci. 2025 Jan 22;19:1452203. doi: 10.3389/fncom.2025.1452203. eCollection 2025.
3
Adaptive spatiotemporal neural networks through complementary hybridization.
通过互补杂交实现的自适应时空神经网络。
Nat Commun. 2024 Aug 27;15(1):7355. doi: 10.1038/s41467-024-51641-x.
4
On computational models of theory of mind and the imitative reinforcement learning in spiking neural networks.关于心理理论的计算模型和尖峰神经网络中的模仿强化学习。
Sci Rep. 2024 Jan 23;14(1):1945. doi: 10.1038/s41598-024-52299-7.
5
Enhanced representation learning with temporal coding in sparsely spiking neural networks.稀疏脉冲神经网络中基于时间编码的增强表示学习
Front Comput Neurosci. 2023 Nov 21;17:1250908. doi: 10.3389/fncom.2023.1250908. eCollection 2023.
6
[A bio-inspired hierarchical spiking neural network with biological synaptic plasticity for event camera object recognition].一种具有生物突触可塑性的用于事件相机目标识别的生物启发式分层脉冲神经网络
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Aug 25;40(4):692-699. doi: 10.7507/1001-5515.202207040.
7
Voltage slope guided learning in spiking neural networks.脉冲神经网络中的电压斜率引导学习
Front Neurosci. 2022 Nov 10;16:1012964. doi: 10.3389/fnins.2022.1012964. eCollection 2022.
8
Training spiking neuronal networks to perform motor control using reinforcement and evolutionary learning.利用强化学习和进化学习训练脉冲神经网络以执行运动控制。
Front Comput Neurosci. 2022 Sep 30;16:1017284. doi: 10.3389/fncom.2022.1017284. eCollection 2022.
9
Linear leaky-integrate-and-fire neuron model based spiking neural networks and its mapping relationship to deep neural networks.基于线性泄漏积分发放神经元模型的脉冲神经网络及其与深度神经网络的映射关系。
Front Neurosci. 2022 Aug 24;16:857513. doi: 10.3389/fnins.2022.857513. eCollection 2022.
10
Backpropagation With Sparsity Regularization for Spiking Neural Network Learning.用于脉冲神经网络学习的带稀疏正则化的反向传播
Front Neurosci. 2022 Apr 14;16:760298. doi: 10.3389/fnins.2022.760298. eCollection 2022.