用于车道保持车辆端到端控制的尖峰神经网络的间接和直接训练。

Indirect and direct training of spiking neural networks for end-to-end control of a lane-keeping vehicle.

机构信息

School of Data and Computer Science, Sun Yat-Sen University, China; Department of Computer Science, Technical University of Munich, Germany.

Department of Computer Science, Technical University of Munich, Germany.

出版信息

Neural Netw. 2020 Jan;121:21-36. doi: 10.1016/j.neunet.2019.05.019. Epub 2019 Jul 9.

DOI:10.1016/j.neunet.2019.05.019

PMID:31526952

Abstract

Building spiking neural networks (SNNs) based on biological synaptic plasticities holds a promising potential for accomplishing fast and energy-efficient computing, which is beneficial to mobile robotic applications. However, the implementations of SNNs in robotic fields are limited due to the lack of practical training methods. In this paper, we therefore introduce both indirect and direct end-to-end training methods of SNNs for a lane-keeping vehicle. First, we adopt a policy learned using the Deep Q-Learning (DQN) algorithm and then subsequently transfer it to an SNN using supervised learning. Second, we adopt the reward-modulated spike-timing-dependent plasticity (R-STDP) for training SNNs directly, since it combines the advantages of both reinforcement learning and the well-known spike-timing-dependent plasticity (STDP). We examine the proposed approaches in three scenarios in which a robot is controlled to keep within lane markings by using an event-based neuromorphic vision sensor. We further demonstrate the advantages of the R-STDP approach in terms of the lateral localization accuracy and training time steps by comparing them with other three algorithms presented in this paper.

摘要

基于生物突触可塑性构建尖峰神经网络（SNN）在实现快速和节能计算方面具有很大的潜力，这有利于移动机器人应用。然而，由于缺乏实用的训练方法，SNN 在机器人领域的应用受到限制。因此，在本文中，我们为车道保持车辆介绍了 SNN 的间接和直接端到端训练方法。首先，我们采用深度 Q 学习（DQN）算法学习的策略，然后使用监督学习将其转换为 SNN。其次，我们采用奖励调制尖峰时间依赖可塑性（R-STDP）直接训练 SNN，因为它结合了强化学习和著名的尖峰时间依赖可塑性（STDP）的优点。我们在三个场景中检验了所提出的方法，其中机器人使用基于事件的神经形态视觉传感器控制以保持在车道标记内。我们通过与本文中提出的其他三种算法进行比较，进一步展示了 R-STDP 方法在横向定位精度和训练时间步长方面的优势。

相似文献

Indirect and direct training of spiking neural networks for end-to-end control of a lane-keeping vehicle.

Neural Netw. 2020 Jan;121:21-36. doi: 10.1016/j.neunet.2019.05.019. Epub 2019 Jul 9.

A biologically plausible supervised learning method for spiking neural networks using the symmetric STDP rule.

Neural Netw. 2020 Jan;121:387-395. doi: 10.1016/j.neunet.2019.09.007. Epub 2019 Sep 27.

SSTDP: Supervised Spike Timing Dependent Plasticity for Efficient Spiking Neural Network Training.

Front Neurosci. 2021 Nov 4;15:756876. doi: 10.3389/fnins.2021.756876. eCollection 2021.

Locally connected spiking neural networks for unsupervised feature learning.

Neural Netw. 2019 Nov;119:332-340. doi: 10.1016/j.neunet.2019.08.016. Epub 2019 Aug 26.

An unsupervised STDP-based spiking neural network inspired by biologically plausible learning rules and connections.

Neural Netw. 2023 Aug;165:799-808. doi: 10.1016/j.neunet.2023.06.019. Epub 2023 Jun 22.

Competitive Learning in a Spiking Neural Network: Towards an Intelligent Pattern Classifier.

Sensors (Basel). 2020 Jan 16;20(2):500. doi: 10.3390/s20020500.

A forecast-based STDP rule suitable for neuromorphic implementation.

Neural Netw. 2012 Aug;32:3-14. doi: 10.1016/j.neunet.2012.02.018. Epub 2012 Feb 14.

Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle.

Front Neurorobot. 2019 May 3;13:18. doi: 10.3389/fnbot.2019.00018. eCollection 2019.

Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.

Neural Comput. 2007 Jun;19(6):1468-502. doi: 10.1162/neco.2007.19.6.1468.

An STDP training algorithm for a spiking neural network with dynamic threshold neurons.

Int J Neural Syst. 2010 Dec;20(6):463-80. doi: 10.1142/S0129065710002553.

引用本文的文献

Brain-inspired learning rules for spiking neural network-based control: a tutorial.

Biomed Eng Lett. 2024 Dec 2;15(1):37-55. doi: 10.1007/s13534-024-00436-6. eCollection 2025 Jan.

Analysis of Teaching Tactics Characteristics of Track and Field Sports Training in Colleges and Universities Based on Deep Neural Network.

Comput Intell Neurosci. 2022 Aug 21;2022:1932596. doi: 10.1155/2022/1932596. eCollection 2022.

Spike-timing-dependent plasticity rewards synchrony rather than causality.

Cereb Cortex. 2022 Dec 15;33(1):23-34. doi: 10.1093/cercor/bhac050.

Spiking Neural Network for Fourier Transform and Object Detection for Automotive Radar.

Front Neurorobot. 2021 Jun 7;15:688344. doi: 10.3389/fnbot.2021.688344. eCollection 2021.

Spatial Memory in a Spiking Neural Network with Robot Embodiment.

Sensors (Basel). 2021 Apr 10;21(8):2678. doi: 10.3390/s21082678.

Spatial Topological Relation Analysis for Cluttered Scenes.

Sensors (Basel). 2020 Dec 15;20(24):7181. doi: 10.3390/s20247181.

Attitude Trajectory Optimization to Ensure Balance Hexapod Locomotion.

Sensors (Basel). 2020 Nov 5;20(21):6295. doi: 10.3390/s20216295.

Spiking neural state machine for gait frequency entrainment in a flexible modular robot.

PLoS One. 2020 Oct 21;15(10):e0240267. doi: 10.1371/journal.pone.0240267. eCollection 2020.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于车道保持车辆端到端控制的尖峰神经网络的间接和直接训练。

Indirect and direct training of spiking neural networks for end-to-end control of a lane-keeping vehicle.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献