Lee Choongseop, Park Yuntae, Yoon Sungmin, Lee Jiwoon, Cho Youngho, Park Cheolsoo
Department of Computer Engineering, Kwangwoon University, Seoul, 01897 Republic of Korea.
Department of Electrical and Communication Engineering, Daelim University College, Anyang, 13916 Republic of Korea.
Biomed Eng Lett. 2024 Dec 2;15(1):37-55. doi: 10.1007/s13534-024-00436-6. eCollection 2025 Jan.
Robotic systems rely on spatio-temporal information to solve control tasks. With advancements in deep neural networks, reinforcement learning has significantly enhanced the performance of control tasks by leveraging deep learning techniques. However, as deep neural networks grow in complexity, they consume more energy and introduce greater latency. This complexity hampers their application in robotic systems that require real-time data processing. To address this issue, spiking neural networks, which emulate the biological brain by transmitting spatio-temporal information through spikes, have been developed alongside neuromorphic hardware that supports their operation. This paper reviews brain-inspired learning rules and examines the application of spiking neural networks in control tasks. We begin by exploring the features and implementations of biologically plausible spike-timing-dependent plasticity. Subsequently, we investigate the integration of a global third factor with spike-timing-dependent plasticity and its utilization and enhancements in both theoretical and applied research. We also discuss a method for locally applying a third factor that sophisticatedly modifies each synaptic weight through weight-based backpropagation. Finally, we review studies utilizing these learning rules to solve control tasks using spiking neural networks.
机器人系统依靠时空信息来解决控制任务。随着深度神经网络的发展,强化学习通过利用深度学习技术显著提高了控制任务的性能。然而,随着深度神经网络复杂度的增加,它们消耗更多能量并引入更大延迟。这种复杂性阻碍了它们在需要实时数据处理的机器人系统中的应用。为了解决这个问题,尖峰神经网络应运而生,它通过尖峰传输时空信息来模拟生物大脑,同时还开发了支持其运行的神经形态硬件。本文回顾了受大脑启发的学习规则,并研究了尖峰神经网络在控制任务中的应用。我们首先探讨生物学上合理的基于尖峰时间的可塑性的特征和实现。随后,我们研究了全局第三因素与基于尖峰时间的可塑性的整合及其在理论和应用研究中的利用和增强。我们还讨论了一种通过基于权重的反向传播来局部应用第三因素以精细修改每个突触权重的方法。最后,我们回顾了利用这些学习规则使用尖峰神经网络解决控制任务的研究。