用于强化学习的基于内在可塑性编码的改进脉冲神经元网络。

Intrinsic plasticity coding improved spiking actor network for reinforcement learning.

作者信息

Liang Xingyue, Wu Qiaoyun, Liu Wenzhang, Zhou Yun, Tan Chunyu, Yin Hongfu, Sun Changyin

机构信息

School of Artificial Intelligence, Anhui University, Hefei, 230601, Anhui, China; Engineering Research Center of Autonomous Unmanned System Technology, Ministry of Education, Hefei, 230601, Anhui, China; Anhui Provincial Engineering Research Center for Unmanned Systems and Intelligent Technology, Hefei, 230601, Anhui, China.

School of Artificial Intelligence, Anhui University, Hefei, 230601, Anhui, China; Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, 230601, Anhui, China.

出版信息

Neural Netw. 2025 Apr;184:107054. doi: 10.1016/j.neunet.2024.107054. Epub 2024 Dec 19.

DOI:10.1016/j.neunet.2024.107054

PMID:39732066

Abstract

Deep reinforcement learning (DRL) exploits the powerful representational capabilities of deep neural networks (DNNs) and has achieved significant success. However, compared to DNNs, spiking neural networks (SNNs), which operate on binary signals, more closely resemble the biological characteristics of efficient learning observed in the brain. In SNNs, spiking neurons exhibit complex dynamic characteristics and learn based on principles of biological plasticity. Inspired by the brain's efficient computational mechanisms, information encoding plays a critical role in these networks. We propose an intrinsic plasticity coding improved spiking actor network (IP-SAN) for RL to achieve effective decision-making. The IP-SAN integrates adaptive population coding at the network level with dynamic spiking neuron coding at the neuron level, improving spatiotemporal state representation and promoting more accurate biological simulation. Experimental results show that our IP-SAN outperforms several state-of-the-art methods in five continuous control tasks.

摘要

深度强化学习（DRL）利用深度神经网络（DNN）强大的表征能力并取得了显著成功。然而，与DNN相比，基于二进制信号运行的脉冲神经网络（SNN）更类似于在大脑中观察到的高效学习的生物学特征。在SNN中，脉冲神经元表现出复杂的动态特性，并基于生物可塑性原理进行学习。受大脑高效计算机制的启发，信息编码在这些网络中起着关键作用。我们提出了一种用于强化学习的内在可塑性编码改进脉冲 actor 网络（IP-SAN），以实现有效的决策。IP-SAN 在网络层面集成了自适应群体编码，在神经元层面集成了动态脉冲神经元编码，改善了时空状态表征并促进了更精确的生物模拟。实验结果表明，我们的IP-SAN在五项连续控制任务中优于几种先进方法。

相似文献

Intrinsic plasticity coding improved spiking actor network for reinforcement learning.用于强化学习的基于内在可塑性编码的改进脉冲神经元网络。

Neural Netw. 2025 Apr;184:107054. doi: 10.1016/j.neunet.2024.107054. Epub 2024 Dec 19.

Multi-compartment neuron and population encoding powered spiking neural network for deep distributional reinforcement learning.用于深度分布式强化学习的多隔室神经元与群体编码驱动的脉冲神经网络

Neural Netw. 2025 Feb;182:106898. doi: 10.1016/j.neunet.2024.106898. Epub 2024 Nov 17.

Developmental Plasticity-Inspired Adaptive Pruning for Deep Spiking and Artificial Neural Networks.受发育可塑性启发的深度脉冲神经网络和人工神经网络自适应剪枝

IEEE Trans Pattern Anal Mach Intell. 2025 Jan;47(1):240-251. doi: 10.1109/TPAMI.2024.3467268. Epub 2024 Dec 4.

Reinforcement Learning in Spiking Neural Networks with Stochastic and Deterministic Synapses.具有随机和确定性突触的尖峰神经网络中的强化学习。

Neural Comput. 2019 Dec;31(12):2368-2389. doi: 10.1162/neco_a_01238. Epub 2019 Oct 15.

An unsupervised STDP-based spiking neural network inspired by biologically plausible learning rules and connections.一种基于无监督 STDP 的尖峰神经网络，灵感来自于具有生物学合理性的学习规则和连接。

Neural Netw. 2023 Aug;165:799-808. doi: 10.1016/j.neunet.2023.06.019. Epub 2023 Jun 22.

A review of learning in biologically plausible spiking neural networks.生物启发式尖峰神经网络学习的综述。

Neural Netw. 2020 Feb;122:253-272. doi: 10.1016/j.neunet.2019.09.036. Epub 2019 Oct 11.

Memory-Dependent Computation and Learning in Spiking Neural Networks Through Hebbian Plasticity.通过赫布可塑性实现脉冲神经网络中依赖记忆的计算与学习

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2551-2562. doi: 10.1109/TNNLS.2023.3341446. Epub 2025 Feb 6.

Toward robust and scalable deep spiking reinforcement learning.迈向稳健且可扩展的深度脉冲强化学习。

Front Neurorobot. 2023 Jan 20;16:1075647. doi: 10.3389/fnbot.2022.1075647. eCollection 2022.

Locally connected spiking neural networks for unsupervised feature learning.用于无监督特征学习的局部连接脉冲神经网络。

Neural Netw. 2019 Nov;119:332-340. doi: 10.1016/j.neunet.2019.08.016. Epub 2019 Aug 26.

Delay learning based on temporal coding in Spiking Neural Networks.基于尖峰神经网络的时间编码的延迟学习。

Neural Netw. 2024 Dec;180:106678. doi: 10.1016/j.neunet.2024.106678. Epub 2024 Aug 31.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于强化学习的基于内在可塑性编码的改进脉冲神经元网络。

Intrinsic plasticity coding improved spiking actor network for reinforcement learning.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献