• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于势归一化解决脉冲深度Q网络中的脉冲特征信息消失问题。

Solving the spike feature information vanishing problem in spiking deep Q network with potential based normalization.

作者信息

Sun Yinqian, Zeng Yi, Li Yang

机构信息

Research Center for Brain-Inspired Intelligence, Institute of Automation, Chinese Academy of Sciences, Beijing, China.

School of Future Technology, University of Chinese Academy of Sciences, Beijing, China.

出版信息

Front Neurosci. 2022 Aug 25;16:953368. doi: 10.3389/fnins.2022.953368. eCollection 2022.

DOI:10.3389/fnins.2022.953368
PMID:36090282
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9453154/
Abstract

Brain-inspired spiking neural networks (SNNs) are successfully applied to many pattern recognition domains. The SNNs-based deep structure has achieved considerable results in perceptual tasks, such as image classification and target detection. However, applying deep SNNs in reinforcement learning (RL) tasks is still a problem to be explored. Although there have been previous studies on the combination of SNNs and RL, most focus on robotic control problems with shallow networks or using the ANN-SNN conversion method to implement spiking deep Q networks (SDQN). In this study, we mathematically analyzed the problem of the disappearance of spiking signal features in SDQN and proposed a potential-based layer normalization (pbLN) method to train spiking deep Q networks directly. Experiment shows that compared with state-of-art ANN-SNN conversion method and other SDQN works, the proposed pbLN spiking deep Q networks (PL-SDQN) achieved better performance on Atari game tasks.

摘要

受脑启发的脉冲神经网络(SNN)已成功应用于许多模式识别领域。基于SNN的深度结构在诸如图像分类和目标检测等感知任务中取得了显著成果。然而,将深度SNN应用于强化学习(RL)任务仍是一个有待探索的问题。尽管之前已有关于SNN与RL相结合的研究,但大多数研究集中在使用浅层网络的机器人控制问题上,或使用人工神经网络-脉冲神经网络(ANN-SNN)转换方法来实现脉冲深度Q网络(SDQN)。在本研究中,我们从数学角度分析了SDQN中脉冲信号特征消失的问题,并提出了一种基于势的层归一化(pbLN)方法来直接训练脉冲深度Q网络。实验表明,与当前最先进的ANN-SNN转换方法及其他SDQN研究相比,所提出的基于pbLN的脉冲深度Q网络(PL-SDQN)在雅达利游戏任务中表现更佳。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/e6c9dbd52608/fnins-16-953368-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/4f846a9ba4bd/fnins-16-953368-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/a69c28a84d46/fnins-16-953368-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/51e3db058023/fnins-16-953368-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/e440876efe2e/fnins-16-953368-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/45bb15213b7d/fnins-16-953368-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/e6c9dbd52608/fnins-16-953368-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/4f846a9ba4bd/fnins-16-953368-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/a69c28a84d46/fnins-16-953368-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/51e3db058023/fnins-16-953368-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/e440876efe2e/fnins-16-953368-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/45bb15213b7d/fnins-16-953368-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57c2/9453154/e6c9dbd52608/fnins-16-953368-g0006.jpg

相似文献

1
Solving the spike feature information vanishing problem in spiking deep Q network with potential based normalization.基于势归一化解决脉冲深度Q网络中的脉冲特征信息消失问题。
Front Neurosci. 2022 Aug 25;16:953368. doi: 10.3389/fnins.2022.953368. eCollection 2022.
2
Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game.强化学习策略在转换到应用于雅达利打破块游戏的尖峰神经元网络平台后的鲁棒性提高。
Neural Netw. 2019 Dec;120:108-115. doi: 10.1016/j.neunet.2019.08.009. Epub 2019 Aug 25.
3
Human-Level Control Through Directly Trained Deep Spiking Q-Networks.通过直接训练的深度脉冲Q网络实现人类水平的控制。
IEEE Trans Cybern. 2023 Nov;53(11):7187-7198. doi: 10.1109/TCYB.2022.3198259. Epub 2023 Oct 17.
4
Enabling Spike-Based Backpropagation for Training Deep Neural Network Architectures.实现基于尖峰的反向传播以训练深度神经网络架构。
Front Neurosci. 2020 Feb 28;14:119. doi: 10.3389/fnins.2020.00119. eCollection 2020.
5
Toward robust and scalable deep spiking reinforcement learning.迈向稳健且可扩展的深度脉冲强化学习。
Front Neurorobot. 2023 Jan 20;16:1075647. doi: 10.3389/fnbot.2022.1075647. eCollection 2022.
6
Optimizing Deeper Spiking Neural Networks for Dynamic Vision Sensing.深度尖峰神经网络在动态视觉传感中的优化。
Neural Netw. 2021 Dec;144:686-698. doi: 10.1016/j.neunet.2021.09.022. Epub 2021 Oct 5.
7
SSTDP: Supervised Spike Timing Dependent Plasticity for Efficient Spiking Neural Network Training.SSTDP:用于高效脉冲神经网络训练的监督式脉冲时间依赖可塑性
Front Neurosci. 2021 Nov 4;15:756876. doi: 10.3389/fnins.2021.756876. eCollection 2021.
8
A Tandem Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural Networks.一种用于深度脉冲神经网络有效训练和快速推理的串联学习规则。
IEEE Trans Neural Netw Learn Syst. 2023 Jan;34(1):446-460. doi: 10.1109/TNNLS.2021.3095724. Epub 2023 Jan 5.
9
Rethinking the performance comparison between SNNS and ANNS.重新思考 SNNS 和 ANNS 的性能比较。
Neural Netw. 2020 Jan;121:294-307. doi: 10.1016/j.neunet.2019.09.005. Epub 2019 Sep 19.
10
Training Spiking Neural Networks for Reinforcement Learning Tasks With Temporal Coding Method.使用时间编码方法为强化学习任务训练脉冲神经网络。
Front Neurosci. 2022 Aug 17;16:877701. doi: 10.3389/fnins.2022.877701. eCollection 2022.

引用本文的文献

1
Exploring spiking neural networks for deep reinforcement learning in robotic tasks.探索用于机器人任务中深度强化学习的脉冲神经网络。
Sci Rep. 2024 Dec 28;14(1):30648. doi: 10.1038/s41598-024-77779-8.
2
BayesianSpikeFusion: accelerating spiking neural network inference via Bayesian fusion of early prediction.贝叶斯脉冲融合:通过早期预测的贝叶斯融合加速脉冲神经网络推理
Front Neurosci. 2024 Aug 5;18:1420119. doi: 10.3389/fnins.2024.1420119. eCollection 2024.
3
Brain-inspired neural circuit evolution for spiking neural networks.基于脑启发的尖峰神经网络神经电路进化。

本文引用的文献

1
Human-Level Control Through Directly Trained Deep Spiking Q-Networks.通过直接训练的深度脉冲Q网络实现人类水平的控制。
IEEE Trans Cybern. 2023 Nov;53(11):7187-7198. doi: 10.1109/TCYB.2022.3198259. Epub 2023 Oct 17.
2
Decoding Pixel-Level Image Features From Two-Photon Calcium Signals of Macaque Visual Cortex.从猕猴视觉皮层的双光子钙信号中解码像素级图像特征
Neural Comput. 2022 May 19;34(6):1369-1397. doi: 10.1162/neco_a_01498.
3
Robust Transcoding Sensory Information With Neural Spikes.通过神经尖峰稳健转码感官信息。
Proc Natl Acad Sci U S A. 2023 Sep 26;120(39):e2218173120. doi: 10.1073/pnas.2218173120. Epub 2023 Sep 20.
4
BrainCog: A spiking neural network based, brain-inspired cognitive intelligence engine for brain-inspired AI and brain simulation.BrainCog:一种基于脉冲神经网络的、受大脑启发的认知智能引擎,用于受大脑启发的人工智能和大脑模拟。
Patterns (N Y). 2023 Jul 6;4(8):100789. doi: 10.1016/j.patter.2023.100789. eCollection 2023 Aug 11.
5
A brain-inspired theory of mind spiking neural network improves multi-agent cooperation and competition.一种受大脑启发的心智脉冲神经网络理论可改善多智能体合作与竞争。
Patterns (N Y). 2023 Jun 23;4(8):100775. doi: 10.1016/j.patter.2023.100775. eCollection 2023 Aug 11.
IEEE Trans Neural Netw Learn Syst. 2022 May;33(5):1935-1946. doi: 10.1109/TNNLS.2021.3107449. Epub 2022 May 2.
4
Quantum superposition inspired spiking neural network.受量子叠加启发的脉冲神经网络。
iScience. 2021 Jul 20;24(8):102880. doi: 10.1016/j.isci.2021.102880. eCollection 2021 Aug 20.
5
Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game.强化学习策略在转换到应用于雅达利打破块游戏的尖峰神经元网络平台后的鲁棒性提高。
Neural Netw. 2019 Dec;120:108-115. doi: 10.1016/j.neunet.2019.08.009. Epub 2019 Aug 25.
6
Striatal circuits for reward learning and decision-making.纹状体回路在奖励学习和决策中的作用。
Nat Rev Neurosci. 2019 Aug;20(8):482-494. doi: 10.1038/s41583-019-0189-2.
7
A Brain-Inspired Decision-Making Spiking Neural Network and Its Application in Unmanned Aerial Vehicle.一种受脑启发的决策脉冲神经网络及其在无人机中的应用。
Front Neurorobot. 2018 Sep 11;12:56. doi: 10.3389/fnbot.2018.00056. eCollection 2018.
8
Spiking neural networks for computer vision.用于计算机视觉的脉冲神经网络。
Interface Focus. 2018 Aug 6;8(4):20180007. doi: 10.1098/rsfs.2018.0007. Epub 2018 Jun 15.
9
Spatio-Temporal Backpropagation for Training High-Performance Spiking Neural Networks.用于训练高性能脉冲神经网络的时空反向传播
Front Neurosci. 2018 May 23;12:331. doi: 10.3389/fnins.2018.00331. eCollection 2018.
10
Conversion of Continuous-Valued Deep Networks to Efficient Event-Driven Networks for Image Classification.将连续值深度网络转换为用于图像分类的高效事件驱动网络
Front Neurosci. 2017 Dec 7;11:682. doi: 10.3389/fnins.2017.00682. eCollection 2017.