• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用全 8 位整数训练高性能和大规模深度神经网络。

Training high-performance and large-scale deep neural networks with full 8-bit integers.

机构信息

Department of Precision Instrument, Center for Brain Inspired Computing Research, Tsinghua University, Beijing 100084, China; Beijing Innovation Center for Future Chip, Tsinghua University, Beijing 100084, China.

Department of Electrical and Computer Engineering, University of California, Santa Barbara, CA 93106, USA.

出版信息

Neural Netw. 2020 May;125:70-82. doi: 10.1016/j.neunet.2019.12.027. Epub 2020 Jan 15.

DOI:10.1016/j.neunet.2019.12.027
PMID:32070857
Abstract

Deep neural network (DNN) quantization converting floating-point (FP) data in the network to integers (INT) is an effective way to shrink the model size for memory saving and simplify the operations for compute acceleration. Recently, researches on DNN quantization develop from inference to training, laying a foundation for the online training on accelerators. However, existing schemes leaving batch normalization (BN) untouched during training are mostly incomplete quantization that still adopts high precision FP in some parts of the data paths. Currently, there is no solution that can use only low bit-width INT data during the whole training process of large-scale DNNs with acceptable accuracy. In this work, through decomposing all the computation steps in DNNs and fusing three special quantization functions to satisfy the different precision requirements, we propose a unified complete quantization framework termed as "WAGEUBN" to quantize DNNs involving all data paths including W (Weights), A (Activation), G (Gradient), E (Error), U (Update), and BN. Moreover, the Momentum optimizer is also quantized to realize a completely quantized framework. Experiments on ResNet18/34/50 models demonstrate that WAGEUBN can achieve competitive accuracy on the ImageNet dataset. For the first time, the study of quantization in large-scale DNNs is advanced to the full 8-bit INT level. In this way, all the operations in the training and inference can be bit-wise operations, pushing towards faster processing speed, decreased memory cost, and higher energy efficiency. Our throughout quantization framework has great potential for future efficient portable devices with online learning ability.

摘要

深度神经网络(DNN)量化将网络中的浮点数(FP)数据转换为整数(INT)是一种有效的方法,可以缩小模型尺寸以节省内存并简化计算加速操作。最近,DNN 量化的研究从推理扩展到训练,为加速器上的在线训练奠定了基础。然而,现有的方案在训练过程中不触及批量归一化(BN),大多是不完整的量化,在数据路径的某些部分仍然采用高精度 FP。目前,还没有一种解决方案可以在使用大规模 DNN 进行整个训练过程中仅使用低比特宽度 INT 数据,并保持可接受的精度。在这项工作中,通过分解 DNN 中的所有计算步骤,并融合三个特殊的量化函数来满足不同的精度要求,我们提出了一个统一的完整量化框架,称为“WAGEUBN”,用于量化包括 W(权重)、A(激活)、G(梯度)、E(误差)、U(更新)和 BN 在内的所有数据路径的 DNN。此外,还对 Momentum 优化器进行了量化,以实现完全量化的框架。在 ResNet18/34/50 模型上的实验表明,WAGEUBN 可以在 ImageNet 数据集上实现有竞争力的精度。这是首次将大规模 DNN 中的量化研究推进到全 8 位 INT 级别。这样,训练和推理中的所有操作都可以是位运算,从而实现更快的处理速度、降低内存成本和更高的能效。我们的全面量化框架具有很大的潜力,可以用于未来具有在线学习能力的高效便携式设备。

相似文献

1
Training high-performance and large-scale deep neural networks with full 8-bit integers.用全 8 位整数训练高性能和大规模深度神经网络。
Neural Netw. 2020 May;125:70-82. doi: 10.1016/j.neunet.2019.12.027. Epub 2020 Jan 15.
2
Quantization Friendly MobileNet (QF-MobileNet) Architecture for Vision Based Applications on Embedded Platforms.面向嵌入式平台视觉应用的量化友好型 MobileNet(QF-MobileNet)架构。
Neural Netw. 2021 Apr;136:28-39. doi: 10.1016/j.neunet.2020.12.022. Epub 2020 Dec 29.
3
Exploiting Retraining-Based Mixed-Precision Quantization for Low-Cost DNN Accelerator Design.利用基于再训练的混合精度量化进行低成本深度神经网络加速器设计。
IEEE Trans Neural Netw Learn Syst. 2021 Jul;32(7):2925-2938. doi: 10.1109/TNNLS.2020.3008996. Epub 2021 Jul 6.
4
L1 -Norm Batch Normalization for Efficient Training of Deep Neural Networks.L1-范数批归一化在深度神经网络高效训练中的应用。
IEEE Trans Neural Netw Learn Syst. 2019 Jul;30(7):2043-2051. doi: 10.1109/TNNLS.2018.2876179. Epub 2018 Nov 9.
5
Unsupervised Network Quantization via Fixed-Point Factorization.通过定点分解实现无监督网络量化
IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2706-2720. doi: 10.1109/TNNLS.2020.3007749. Epub 2021 Jun 2.
6
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators.随机和对抗性比特错误鲁棒性:节能且安全的深度神经网络加速器
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3632-3647. doi: 10.1109/TPAMI.2022.3181972.
7
A Novel Low-Bit Quantization Strategy for Compressing Deep Neural Networks.一种用于压缩深度神经网络的新型低比特量化策略。
Comput Intell Neurosci. 2020 Feb 18;2020:7839064. doi: 10.1155/2020/7839064. eCollection 2020.
8
Diverse Sample Generation: Pushing the Limit of Generative Data-Free Quantization.多样化样本生成:突破无数据生成量化的极限
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11689-11706. doi: 10.1109/TPAMI.2023.3272925. Epub 2023 Sep 5.
9
Optimization-Based Post-Training Quantization With Bit-Split and Stitching.基于优化的带位分割与拼接的训练后量化
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2119-2135. doi: 10.1109/TPAMI.2022.3159369. Epub 2023 Jan 6.
10
Rethinking the Importance of Quantization Bias, Toward Full Low-Bit Training.重新思考量化偏差的重要性,迈向全低比特训练。
IEEE Trans Image Process. 2022;31:7006-7019. doi: 10.1109/TIP.2022.3216776. Epub 2022 Nov 14.

引用本文的文献

1
Speed up integer-arithmetic-only inference via bit-shifting.通过移位加速仅整数运算的推理。
Sci Rep. 2025 May 28;15(1):18765. doi: 10.1038/s41598-025-02544-4.
2
Continual Learning with Deep Neural Networks in Physiological Signal Data: A Survey.生理信号数据中深度神经网络的持续学习:一项综述。
Healthcare (Basel). 2024 Jan 9;12(2):155. doi: 10.3390/healthcare12020155.
3
End-to-End Implementation of Various Hybrid Neural Networks on a Cross-Paradigm Neuromorphic Chip.跨范式神经形态芯片上各种混合神经网络的端到端实现
Front Neurosci. 2021 Feb 2;15:615279. doi: 10.3389/fnins.2021.615279. eCollection 2021.