IEEE Trans Biomed Circuits Syst. 2019 Jun;13(3):481-492. doi: 10.1109/TBCAS.2019.2907882. Epub 2019 Mar 27.
This paper presents an efficient hardware architecture for the design and implementation of a spike sorting system using online template matching. Over the past decade, various spike sorting algorithms have been proposed; however, due to their computational complexity, they may not be suitable for implantable devices that have stringent area and power consumption requirements. We first developed a software-based spike sorting system in both floating-point and fixed-point representations. Then, we used our developed software-based spike sorting system for: 1) studying various neural signal processing algorithms and assessing their feasibility for efficient hardware implementations; and 2) offline processing of previously recorded neural data and extracting the threshold data and spike templates for configuring our spike sorting hardware architecture. The characteristics and implementation results of the designed spike sorting system on a Xilinx Artix-7 A200TFBG676-2 field-programmable gate array are presented. The application-specific integrated circuit (ASIC) implementation of the designed spike sorting system is estimated to occupy 0.3 mm. Postlayout synthesis and simulation shows that the ASIC implementation will dissipate 64 nW from a 0.25-V supply, while operating at a 24-kHz frequency in a standard 45-nm CMOS technology. Compared to the previously published work, our ASIC implementation consumes 96.8% less power, while maintaining a comparable sorting accuracy. Moreover, our design can run at a higher clock frequency and uses fewer hardware resources, while achieving a 168% reduction in output data rate when comparing the raw data sampling rate and the sorted spike output rate and, yet, offers comparable spike sorting accuracy.
本文提出了一种高效的硬件架构,用于设计和实现使用在线模板匹配的 Spike 排序系统。在过去的十年中,已经提出了各种 Spike 排序算法;然而,由于它们的计算复杂性,它们可能不适合具有严格面积和功耗要求的植入式设备。我们首先在浮点和定点表示中开发了一个基于软件的 Spike 排序系统。然后,我们使用我们开发的基于软件的 Spike 排序系统来:1)研究各种神经信号处理算法,并评估它们在高效硬件实现中的可行性;2)对以前记录的神经数据进行离线处理,并提取阈值数据和 Spike 模板,用于配置我们的 Spike 排序硬件架构。本文介绍了在 Xilinx Artix-7 A200TFBG676-2 现场可编程门阵列上设计的 Spike 排序系统的特点和实现结果。设计的 Spike 排序系统的专用集成电路 (ASIC) 实现预计将占用 0.3 毫米。布局后综合和模拟表明,ASIC 实现将从 0.25V 电源消耗 64nW,而在标准 45nm CMOS 技术中以 24kHz 的频率运行。与以前发表的工作相比,我们的 ASIC 实现消耗的功率减少了 96.8%,同时保持了可比的排序准确性。此外,我们的设计可以运行在更高的时钟频率,并使用更少的硬件资源,同时在比较原始数据采样率和排序 Spike 输出率时,输出数据率降低了 168%,并且仍然提供可比的 Spike 排序准确性。