Chen Jiadong, Wu Yincheng, Yang Yin, Wen Shiping, Shi Kaibo, Bermak Amine, Huang Tingwen
IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1779-1790. doi: 10.1109/TNNLS.2020.3044047. Epub 2022 Apr 4.
Recently, there has been a surge of interest in applying memristors to hardware implementations of deep neural networks due to various desirable properties of the memristor, such as nonvolativity, multivalue, and nanosize. Most existing neural network circuit designs, however, are based on generic frameworks that are not optimized for memristors. Furthermore, to the best of our knowledge, there are no existing efficient memristor-based implementations of complex neural network operators, such as deconvolutions and squeeze-and-excitation (SE) blocks, which are critical for achieving high accuracy in common medical image analysis applications, such as semantic segmentation. This article proposes convolution-kernel first (CKF), an efficient scheme for designing memristor-based fully convolutional neural networks (FCNs). Compared with existing neural network circuits, CKF enables effective parameter pruning, which significantly reduces circuit power consumption. Furthermore, CKF includes the novel, memristor-optimized implementations of deconvolution layers and SE blocks. Simulation results on real medical image segmentation tasks confirm that CKF obtains up to 56.2% reduction in terms of computations and 33.62-W reduction in terms of power consumption in the circuit after weight pruning while retaining high accuracy on the test set. Moreover, the pruning results can be applied directly to existing circuits without any modification for the corresponding system.
最近,由于忆阻器具有诸如非易失性、多值性和纳米尺寸等各种理想特性,将忆阻器应用于深度神经网络的硬件实现引起了人们极大的兴趣。然而,大多数现有的神经网络电路设计都是基于未针对忆阻器进行优化的通用框架。此外,据我们所知,目前还没有基于忆阻器的高效实现复杂神经网络算子的方法,如反卷积和挤压激励(SE)模块,而这些对于在诸如语义分割等常见医学图像分析应用中实现高精度至关重要。本文提出了卷积核优先(CKF),这是一种设计基于忆阻器的全卷积神经网络(FCN)的有效方案。与现有的神经网络电路相比,CKF能够进行有效的参数修剪,从而显著降低电路功耗。此外,CKF还包括反卷积层和SE模块的新颖的、针对忆阻器优化的实现。在实际医学图像分割任务上的仿真结果证实,在权值修剪后,CKF在计算量方面最多可减少56.2%,在电路功耗方面可减少33.62瓦,同时在测试集上保持高精度。此外,修剪结果可以直接应用于现有电路,而无需对相应系统进行任何修改。