Yook Chan-Gi, Kim Jung Nam, Kim Yoon, Shim Wonbo
Department of Electrical and Information Engineering, Seoul National University of Science and Technology, Seoul 01811, Republic of Korea.
School of Electrical and Computer Engineering, University of Seoul, Seoul 02504, Republic of Korea.
Micromachines (Basel). 2023 Sep 7;14(9):1753. doi: 10.3390/mi14091753.
The existing von Neumann architecture for artificial intelligence (AI) computations suffers from excessive power consumption and memory bottlenecks. As an alternative, compute-in-memory (CIM) technology has been emerging. Among various CIM device candidates, split-gate NOR flash offers advantages such as a high density and low on-state current, enabling low-power operation, and benefiting from a high level of technological maturity. To achieve high energy efficiency and high accuracy in CIM inference chips, it is necessary to optimize device design by targeting low power consumption at the device level and surpassing baseline accuracy at the system level. In split-gate NOR flash, significant factors that can cause CIM inference accuracy drop are the device conductance variation, caused by floating gate charge variation, and a low on-off current ratio. Conductance variation generally has a trade-off relationship with the on-current, which greatly affects CIM dynamic power consumption. In this paper, we propose strategies for designing optimal devices by adjusting oxide thickness and other structural parameters. As a result of setting to 13.4 nm, to 4.6 nm and setting other parameters to optimal points, the design achieves erase on-current below 2 μA, program on-current below 10 pA, and off-current below 1 pA, while maintaining an inference accuracy of over 92%.
现有的用于人工智能(AI)计算的冯·诺依曼架构存在功耗过高和内存瓶颈问题。作为一种替代方案,内存计算(CIM)技术正在兴起。在各种CIM设备候选方案中,分裂栅极NOR闪存具有诸如高密度和低导通状态电流等优点,能够实现低功耗运行,并受益于高度的技术成熟度。为了在CIM推理芯片中实现高能效和高精度,有必要通过在器件层面以低功耗为目标以及在系统层面超越基线精度来优化器件设计。在分裂栅极NOR闪存中,可能导致CIM推理精度下降的重要因素是由浮栅电荷变化引起的器件电导变化以及低开关电流比。电导变化通常与导通电流存在权衡关系,这对CIM动态功耗有很大影响。在本文中,我们提出了通过调整氧化物厚度和其他结构参数来设计最优器件的策略。通过将设置为13.4纳米,设置为4.6纳米并将其他参数设置为最佳点,该设计实现了擦除导通电流低于2微安,编程导通电流低于10皮安,关断电流低于1皮安,同时保持超过92%的推理精度。