用于低功耗混合信号卷积神经网络处理芯片的模拟卷积算子电路

Asghar Malik Summair, Arslan Saad, Kim HyungWon

Department of Electronics, College of Electrical and Computer Engineering, Chungbuk National University, Cheongju 28644, Republic of Korea.

Department of Electrical and Computer Engineering, COMSATS University Islamabad, Abbottabad Campus, University Road, Tobe Camp, Abbottabad 22044, Pakistan.

Sensors (Basel). 2023 Dec 4;23(23):9612. doi: 10.3390/s23239612.

In this paper, we propose a compact and low-power mixed-signal approach to implementing convolutional operators that are often responsible for most of the chip area and power consumption of Convolutional Neural Network (CNN) processing chips. The convolutional operators consist of several multiply-and-accumulate (MAC) units. MAC units are the primary components that process convolutional layers and fully connected layers of CNN models. Analog implementation of MAC units opens a new paradigm for realizing low-power CNN processing chips, benefiting from less power and area consumption. The proposed mixed-signal convolutional operator comprises low-power binary-weighted current steering digital-to-analog conversion (DAC) circuits and accumulation capacitors. Compared with a conventional binary-weighted DAC, the proposed circuit benefits from optimum accuracy, smaller area, and lower power consumption due to its symmetric design. The proposed convolutional operator takes as input a set of 9-bit digital input feature data and weight parameters of the convolutional filter. It then calculates the convolutional filter's result and accumulates the resulting voltage on capacitors. In addition, the convolutional operator employs a novel charge-sharing technique to process negative MAC results. We propose an analog max-pooling circuit that instantly selects the maximum input voltage. To demonstrate the performance of the proposed mixed-signal convolutional operator, we implemented a CNN processing chip consisting of 3 analog convolutional operators, with each operator processing a 3 × 3 kernel. This chip contains 27 MAC circuits, an analog max-pooling, and an analog-to-digital conversion (ADC) circuit. The mixed-signal CNN processing chip is implemented using a CMOS 55 nm process, which occupies a silicon area of 0.0559 mm and consumes an average power of 540.6 μW. The proposed mixed-signal CNN processing chip offers an area reduction of 84.21% and an energy reduction of 91.85% compared with a conventional digital CNN processing chip. Moreover, another CNN processing chip is implemented with more analog convolutional operators to demonstrate the operation and structure of an example convolutional layer of a CNN model. Therefore, the proposed analog convolutional operator can be adapted in various CNN models as an alternative to digital counterparts.

在本文中，我们提出了一种紧凑且低功耗的混合信号方法来实现卷积算子，这些卷积算子通常占据卷积神经网络（CNN）处理芯片的大部分芯片面积和功耗。卷积算子由多个乘法累加（MAC）单元组成。MAC单元是处理CNN模型卷积层和全连接层的主要组件。MAC单元的模拟实现为实现低功耗CNN处理芯片开辟了一种新范式，因为其功耗和面积消耗更低。所提出的混合信号卷积算子包括低功耗二进制加权电流转向数模转换（DAC）电路和累加电容。与传统的二进制加权DAC相比，所提出的电路因其对称设计而具有最佳精度、更小面积和更低功耗的优势。所提出的卷积算子将一组9位数字输入特征数据和卷积滤波器的权重参数作为输入。然后，它计算卷积滤波器的结果，并将产生的电压累加到电容器上。此外，卷积算子采用一种新颖的电荷共享技术来处理负MAC结果。我们提出了一种模拟最大池化电路，可即时选择最大输入电压。为了展示所提出的混合信号卷积算子的性能，我们实现了一个由3个模拟卷积算子组成的CNN处理芯片，每个算子处理一个3×3内核。该芯片包含27个MAC电路、一个模拟最大池化电路和一个模数转换（ADC）电路。该混合信号CNN处理芯片采用CMOS 55纳米工艺实现，占用硅面积0.0559平方毫米，平均功耗为540.6微瓦。与传统数字CNN处理芯片相比，所提出的混合信号CNN处理芯片的面积减少了84.21%，能耗降低了91.85%。此外，还实现了另一个具有更多模拟卷积算子的CNN处理芯片，以展示CNN模型示例卷积层的操作和结构。因此，所提出的模拟卷积算子可适用于各种CNN模型，作为数字对应算子的替代方案。