长程零样本生成深度网络量化。

Long-range zero-shot generative deep network quantization.

机构信息

School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, China.

School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, China; Shenzhen Research Institute of Big data, Shenzhen, China.

出版信息

Neural Netw. 2023 Sep;166:683-691. doi: 10.1016/j.neunet.2023.07.042. Epub 2023 Aug 5.

DOI:10.1016/j.neunet.2023.07.042

PMID:37604077

Abstract

Quantization approximates a deep network model with floating-point numbers by the model with low bit width numbers, thereby accelerating inference and reducing computation. Zero-shot quantization, which aims to quantize a model without access to the original data, can be achieved by fitting the real data distribution through data synthesis. However, it has been observed that zero-shot quantization leads to inferior performance compared to post-training quantization with real data for two primary reasons: 1) a normal generator has difficulty obtaining a high diversity of synthetic data since it lacks long-range information to allocate attention to global features, and 2) synthetic images aim to simulate the statistics of real data, which leads to weak intra-class heterogeneity and limited feature richness. To overcome these problems, we propose a novel deep network quantizer called long-range zero-shot generative deep network quantization (LRQ). Technically, we propose a long-range generator (LRG) to learn long-range information instead of simple local features. To incorporate more global features into the synthetic data, we use long-range attention with large-kernel convolution in the generator. In addition, we also present an adversarial margin add (AMA) module to force intra-class angular enlargement between the feature vector and class center. The AMA module forms an adversarial process that increases the convergence difficulty of the loss function, which is opposite to the training objective of the original loss function. Furthermore, to transfer knowledge from the full-precision network, we also utilize decoupled knowledge distillation. Extensive experiments demonstrate that LRQ obtains better performance than other competitors.

摘要

量化通过使用低比特位数的数字来近似具有浮点数的深度网络模型，从而加速推理和减少计算。零 shot 量化旨在在无法访问原始数据的情况下对模型进行量化，可以通过数据合成来拟合真实数据分布来实现。然而，已经观察到，零 shot 量化与使用真实数据进行后训练量化相比，性能较差，主要有两个原因：1）由于正常生成器缺乏分配注意力到全局特征的远程信息，因此难以获得具有高多样性的合成数据；2）合成图像旨在模拟真实数据的统计信息，这导致类内异质性弱且特征丰富度有限。为了解决这些问题，我们提出了一种名为长程零 shot 生成深度网络量化（LRQ）的新型深度网络量化器。从技术上讲，我们提出了一种长程生成器（LRG）来学习长程信息，而不是简单的局部特征。为了将更多的全局特征纳入合成数据中，我们在生成器中使用具有大核卷积的长程注意力。此外，我们还提出了对抗性边缘添加（AMA）模块，以迫使特征向量和类中心之间的类内角度扩大。AMA 模块形成一个对抗过程，增加了损失函数的收敛难度，这与原始损失函数的训练目标相反。此外，为了从全精度网络转移知识，我们还利用解耦知识蒸馏。大量实验表明，LRQ 比其他竞争对手获得更好的性能。

相似文献

Long-range zero-shot generative deep network quantization.

Neural Netw. 2023 Sep;166:683-691. doi: 10.1016/j.neunet.2023.07.042. Epub 2023 Aug 5.

Training high-performance and large-scale deep neural networks with full 8-bit integers.

Neural Netw. 2020 May;125:70-82. doi: 10.1016/j.neunet.2019.12.027. Epub 2020 Jan 15.

Diverse Sample Generation: Pushing the Limit of Generative Data-Free Quantization.

IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11689-11706. doi: 10.1109/TPAMI.2023.3272925. Epub 2023 Sep 5.

Quantization via Distillation and Contrastive Learning.

IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17164-17176. doi: 10.1109/TNNLS.2023.3300309. Epub 2024 Dec 2.

Augmented semantic feature based generative network for generalized zero-shot learning.

Neural Netw. 2021 Nov;143:1-11. doi: 10.1016/j.neunet.2021.04.014. Epub 2021 Apr 21.

Leveraging Balanced Semantic Embedding for Generative Zero-Shot Learning.

IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):9575-9582. doi: 10.1109/TNNLS.2022.3208525. Epub 2023 Oct 27.

Deep Unbiased Embedding Transfer for Zero-shot Learning.

IEEE Trans Image Process. 2019 Oct 22. doi: 10.1109/TIP.2019.2947780.

Convolutional Neural Networks Quantization with Double-Stage Squeeze-and-Threshold.

Int J Neural Syst. 2022 Dec;32(12):2250051. doi: 10.1142/S0129065722500514. Epub 2022 Sep 26.

Generative Multi-Label Zero-Shot Learning.

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14611-14624. doi: 10.1109/TPAMI.2023.3295772. Epub 2023 Nov 3.

Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited.

IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3030-3047. doi: 10.1109/TPAMI.2020.3045530. Epub 2022 May 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

长程零样本生成深度网络量化。

Long-range zero-shot generative deep network quantization.

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献