• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于人工智能应用的FinFET 6T-SRAM全数字内存计算:概述与分析

FinFET 6T-SRAM All-Digital Compute-in-Memory for Artificial Intelligence Applications: An Overview and Analysis.

作者信息

Gul Waqas, Shams Maitham, Al-Khalili Dhamin

机构信息

Department of Electronics, Carleton University, 1125 Colonel Bay Drive, Ottawa, ON K1S 5B6, Canada.

出版信息

Micromachines (Basel). 2023 Jul 31;14(8):1535. doi: 10.3390/mi14081535.

DOI:10.3390/mi14081535
PMID:37630071
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10456776/
Abstract

Artificial intelligence (AI) has revolutionized present-day life through automation and independent decision-making capabilities. For AI hardware implementations, the 6T-SRAM cell is a suitable candidate due to its performance edge over its counterparts. However, modern AI hardware such as neural networks (NNs) access off-chip data quite often, degrading the overall system performance. Compute-in-memory (CIM) reduces off-chip data access transactions. One CIM approach is based on the mixed-signal domain, but it suffers from limited bit precision and signal margin issues. An alternate emerging approach uses the all-digital signal domain that provides better signal margins and bit precision; however, it will be at the expense of hardware overhead. We have analyzed digital signal domain CIM silicon-verified 6T-SRAM CIM solutions, after classifying them as SRAM-based accelerators, i.e., near-memory computing (NMC), and custom SRAM-based CIM, i.e., in-memory-computing (IMC). We have focused on multiply and accumulate (MAC) as the most frequent operation in convolution neural networks (CNNs) and compared state-of-the-art implementations. Neural networks with low weight precision, i.e., <12b, show lower accuracy but higher power efficiency. An input precision of 8b achieves implementation requirements. The maximum performance reported is 7.49 TOPS at 330 MHz, while custom SRAM-based performance has shown a maximum of 5.6 GOPS at 100 MHz. The second part of this article analyzes the FinFET 6T-SRAM as one of the critical components in determining overall performance of an AI computing system. We have investigated the FinFET 6T-SRAM cell performance and limitations as dictated by the FinFET technology-specific parameters, such as sizing, threshold voltage (V), supply voltage (V), and process and environmental variations. The HD FinFET 6T-SRAM cell shows 32% lower read access time and 1.09 times better leakage power as compared with the HC cell configuration. The minimum achievable supply voltage is 600 mV without utilization of any read- or write-assist scheme for all cell configurations, while temperature variations show noise margin deviation of up to 22% of the nominal values.

摘要

人工智能(AI)通过自动化和独立决策能力彻底改变了当今的生活。对于人工智能硬件实现而言,6T-SRAM单元因其相较于同类产品的性能优势而成为合适的选择。然而,诸如神经网络(NN)等现代人工智能硬件经常访问片外数据,这会降低整个系统的性能。内存计算(CIM)减少了片外数据访问事务。一种CIM方法基于混合信号域,但它存在位精度有限和信号裕度问题。另一种新兴方法使用全数字信号域,该方法提供了更好的信号裕度和位精度;然而,这将以硬件开销为代价。在将数字信号域CIM硅验证的6T-SRAM CIM解决方案分类为基于SRAM的加速器(即近内存计算(NMC))和基于定制SRAM的CIM(即内存内计算(IMC))之后,我们对其进行了分析。我们将重点放在乘法累加(MAC)上,因为它是卷积神经网络(CNN)中最频繁的操作,并比较了最先进的实现方式。权重精度较低(即<12b)的神经网络显示出较低的精度,但具有较高的功率效率。8b的输入精度可满足实现要求。报告的最高性能在330 MHz时为7.49 TOPS,而基于定制SRAM的性能在100 MHz时最高为5.6 GOPS。本文的第二部分分析了FinFET 6T-SRAM,它是决定人工智能计算系统整体性能的关键组件之一。我们研究了由FinFET技术特定参数(如尺寸、阈值电压(V)、电源电压(V)以及工艺和环境变化)所决定的FinFET 6T-SRAM单元的性能和局限性。与HC单元配置相比,HD FinFET 6T-SRAM单元的读取访问时间降低了32%,泄漏功率提高了1.09倍。对于所有单元配置,在不使用任何读取或写入辅助方案的情况下,可实现的最低电源电压为600 mV,而温度变化会使噪声裕度偏差高达标称值的22%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/76367210caae/micromachines-14-01535-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/eaff291c8a21/micromachines-14-01535-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/42419f1f13af/micromachines-14-01535-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/268c905a7b2e/micromachines-14-01535-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/209f578e138a/micromachines-14-01535-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/7cbe3fc0722b/micromachines-14-01535-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/3d922d5eb373/micromachines-14-01535-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/b60b0211dc7c/micromachines-14-01535-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/c0d015d7e2a3/micromachines-14-01535-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/5d6ac34c3717/micromachines-14-01535-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/9c4356bdb705/micromachines-14-01535-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/4d2da9b2c78a/micromachines-14-01535-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/5a90ce23ddee/micromachines-14-01535-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/d1db803c51da/micromachines-14-01535-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/b7f6be913207/micromachines-14-01535-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/76367210caae/micromachines-14-01535-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/eaff291c8a21/micromachines-14-01535-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/42419f1f13af/micromachines-14-01535-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/268c905a7b2e/micromachines-14-01535-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/209f578e138a/micromachines-14-01535-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/7cbe3fc0722b/micromachines-14-01535-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/3d922d5eb373/micromachines-14-01535-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/b60b0211dc7c/micromachines-14-01535-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/c0d015d7e2a3/micromachines-14-01535-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/5d6ac34c3717/micromachines-14-01535-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/9c4356bdb705/micromachines-14-01535-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/4d2da9b2c78a/micromachines-14-01535-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/5a90ce23ddee/micromachines-14-01535-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/d1db803c51da/micromachines-14-01535-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/b7f6be913207/micromachines-14-01535-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b37/10456776/76367210caae/micromachines-14-01535-g015.jpg

相似文献

1
FinFET 6T-SRAM All-Digital Compute-in-Memory for Artificial Intelligence Applications: An Overview and Analysis.用于人工智能应用的FinFET 6T-SRAM全数字内存计算:概述与分析
Micromachines (Basel). 2023 Jul 31;14(8):1535. doi: 10.3390/mi14081535.
2
SRAM Cell Design Challenges in Modern Deep Sub-Micron Technologies: An Overview.现代深亚微米技术中的静态随机存取存储器(SRAM)单元设计挑战:概述
Micromachines (Basel). 2022 Aug 17;13(8):1332. doi: 10.3390/mi13081332.
3
SRAM-Based CIM Architecture Design for Event Detection.基于静态随机存储器的事件检测 CIM 体系结构设计。
Sensors (Basel). 2022 Oct 16;22(20):7854. doi: 10.3390/s22207854.
4
DAM SRAM CORE: An Efficient High-Speed and Low-Power CIM SRAM CORE Design for Feature Extraction Convolutional Layers in Binary Neural Networks.DAM SRAM核心:一种用于二进制神经网络中特征提取卷积层的高效高速低功耗CIM SRAM核心设计。
Micromachines (Basel). 2024 Apr 30;15(5):617. doi: 10.3390/mi15050617.
5
Fusion of memristor and digital compute-in-memory processing for energy-efficient edge computing.用于节能边缘计算的忆阻器与数字内存计算处理融合
Science. 2024 Apr 19;384(6693):325-332. doi: 10.1126/science.adf5538. Epub 2024 Apr 18.
6
A High-Precision Voltage-Quantization-Based Current-Mode Computing-in-Memory SRAM.一种基于高精度电压量化的电流模式内存计算SRAM。
Micromachines (Basel). 2023 Nov 29;14(12):2180. doi: 10.3390/mi14122180.
7
A 0.3 V PNN Based 10T SRAM with Pulse Control Based Read-Assist and Write Data-Aware Schemes for Low Power Applications.一种基于0.3V脉冲神经网络的10T静态随机存取存储器,采用基于脉冲控制的读辅助和写数据感知方案,用于低功耗应用。
Sensors (Basel). 2021 Oct 2;21(19):6591. doi: 10.3390/s21196591.
8
Hybrid Silicon Substrate FinFET-Metal Insulator Metal (MIM) Memristor Based Sense Amplifier Design for the Non-Volatile SRAM Cell.用于非易失性静态随机存取存储器(SRAM)单元的基于混合硅衬底鳍式场效应晶体管-金属绝缘体金属(MIM)忆阻器的读出放大器设计
Micromachines (Basel). 2023 Jan 17;14(2):232. doi: 10.3390/mi14020232.
9
Simulation of a Fully Digital Computing-in-Memory for Non-Volatile Memory for Artificial Intelligence Edge Applications.用于人工智能边缘应用的非易失性存储器全数字内存计算模拟
Micromachines (Basel). 2023 May 31;14(6):1175. doi: 10.3390/mi14061175.
10
Performance Degradation in Static Random Access Memory of 10 nm Node FinFET Owing to Displacement Defects.由于位移缺陷导致的10纳米节点鳍式场效应晶体管静态随机存取存储器性能退化
Micromachines (Basel). 2023 May 22;14(5):1090. doi: 10.3390/mi14051090.

本文引用的文献

1
SRAM Cell Design Challenges in Modern Deep Sub-Micron Technologies: An Overview.现代深亚微米技术中的静态随机存取存储器(SRAM)单元设计挑战:概述
Micromachines (Basel). 2022 Aug 17;13(8):1332. doi: 10.3390/mi13081332.