Sharma Anal Prakash, Rao K Akhilesh, Somappa Laxmeesha
IEEE Trans Biomed Circuits Syst. 2025 Apr;19(2):244-256. doi: 10.1109/TBCAS.2024.3505423. Epub 2025 Apr 2.
This work presents the development of on-chip machine learning (ML) classifiers for implantable neuromodulation system-on-chips (SoCs), aimed at detecting epileptic seizures for closed-loop neuromodulation applications. Tree-based classifiers have gained prominence due to low on-chip memory requirements for binary classification. This work focuses on optimizing hardware performance and associated trade-offs from two fronts, namely, (a) implementation of the Neural Tree (NT) classifier using model compression techniques and (b) design of feature extraction engine (FEE) using FIR filters and time-division multiplexed hardware optimizations. We provide insights into how model compression techniques of Neural Networks like weight pruning and weight sharing can be exploited to reduce the memory requirement of Neural Tree inference hardware. Both these techniques effectively reduce non-zero weights and therefore help to reduce memory requirements. We also detail the choice of feature extraction engine (FEE) hardware to extract temporal and spectral features and the relevant area-power-attenuation trade-offs for spectral feature extraction. The end-to-end hardware comprising the FEE, the Neural Tree classifier and serial peripherals are tested on a Zynq-7000 series SoC using pre-recorded patient data. The SoC-based evaluation platform allows rapid testing of various model optimizations on hardware using AXI protocol. The entire system, trained on data from the CHB-MIT scalp EEG database, achieved a sensitivity of 95.7% and a specificity of 94.3%, with an on-chip memory of 0.59 kB. Implementing the design in a 65nm CMOS process resulted in a worst-case power of 174 W and an area of 0.16 mm. These findings along with the optimizations mark significant progress toward energy-efficient, scalable neuromodulation systems capable of real-time neurological disorder prediction.
这项工作展示了用于可植入神经调节片上系统(SoC)的片上机器学习(ML)分类器的开发,旨在检测癫痫发作以用于闭环神经调节应用。基于树的分类器由于二进制分类的片上内存需求低而备受关注。这项工作专注于从两个方面优化硬件性能及相关权衡,即:(a)使用模型压缩技术实现神经树(NT)分类器,以及(b)使用FIR滤波器和时分复用硬件优化设计特征提取引擎(FEE)。我们深入探讨了如何利用神经网络的模型压缩技术,如权重修剪和权重共享,来降低神经树推理硬件的内存需求。这两种技术都有效地减少了非零权重,因此有助于降低内存需求。我们还详细介绍了用于提取时间和频谱特征的特征提取引擎(FEE)硬件的选择,以及频谱特征提取的相关面积 - 功耗 - 衰减权衡。使用预先记录的患者数据在Zynq - 7000系列SoC上测试了包括FEE、神经树分类器和串行外设的端到端硬件。基于SoC的评估平台允许使用AXI协议在硬件上快速测试各种模型优化。整个系统在来自CHB - MIT头皮脑电图数据库的数据上进行训练,实现了95.7%的灵敏度和94.3%的特异性,片上内存为0.59 kB。在65nm CMOS工艺中实现该设计,最坏情况下功耗为174 W,面积为0.16 mm²。这些发现以及优化标志着朝着能够实时预测神经疾病的节能、可扩展神经调节系统取得了重大进展。