面向具有稀疏性的精确二值化神经网络以用于移动应用

Toward Accurate Binarized Neural Networks With Sparsity for Mobile Application.

作者信息

Wang Peisong, He Xiangyu, Cheng Jian

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 May 27;PP. doi: 10.1109/TNNLS.2022.3173498.

DOI:10.1109/TNNLS.2022.3173498

Abstract

While binarized neural networks (BNNs) have attracted great interest, popular approaches proposed so far mainly exploit the symmetric sign function for feature binarization, i.e., to binarize activations into -1 and +1 with a fixed threshold of 0. However, whether this option is optimal has been largely overlooked. In this work, we propose the Sparsity-inducing BNN (Si-BNN) to quantize the activations to be either 0 or +1, which better approximates ReLU using 1-bit. We further introduce trainable thresholds into the backward function of binarization to guide the gradient propagation. Our method dramatically outperforms the current state-of-the-art, lowering the performance gap between full-precision networks and BNNs on mainstream architectures, achieving the new state-of-the-art on binarized AlexNet (Top-1 50.5%), ResNet-18 (Top-1 62.2%), and ResNet-50 (Top-1 68.3%). At inference time, Si-BNN still enjoys the high efficiency of bit-wise operations. In our implementation, the running time of binary AlexNet on the CPU can be competitive with the popular GPU-based deep learning framework.

摘要

虽然二值化神经网络（BNNs）已经引起了极大的关注，但迄今为止提出的流行方法主要利用对称符号函数进行特征二值化，即将激活值以固定阈值0二值化为-1和+1。然而，这种选择是否最优在很大程度上被忽视了。在这项工作中，我们提出了稀疏诱导二值化神经网络（Si-BNN），将激活值量化为0或+1，这能更好地用1位近似ReLU。我们还在二值化的反向函数中引入了可训练阈值，以指导梯度传播。我们的方法显著优于当前的最先进方法，缩小了主流架构上全精度网络和二值化神经网络之间的性能差距，在二值化的AlexNet（Top-1 50.5%）、ResNet-18（Top-1 62.2%）和ResNet-50（Top-1 68.3%）上达到了新的最先进水平。在推理时，Si-BNN仍然具有按位运算的高效率。在我们的实现中，二进制AlexNet在CPU上的运行时间可以与流行的基于GPU的深度学习框架相媲美。

相似文献

Toward Accurate Binarized Neural Networks With Sparsity for Mobile Application.面向具有稀疏性的精确二值化神经网络以用于移动应用

IEEE Trans Neural Netw Learn Syst. 2022 May 27;PP. doi: 10.1109/TNNLS.2022.3173498.

BinVPR: Binary Neural Networks towards Real-Valued for Visual Place Recognition.BinVPR：用于视觉场所识别的实值二元神经网络

Sensors (Basel). 2024 Jun 25;24(13):4130. doi: 10.3390/s24134130.

Binary Neural Networks in FPGAs: Architectures, Tool Flows and Hardware Comparisons.FPGA中的二进制神经网络：架构、工具流程及硬件比较

Sensors (Basel). 2023 Nov 17;23(22):9254. doi: 10.3390/s23229254.

Gradient Matters: Designing Binarized Neural Networks via Enhanced Information-Flow.梯度重要性：通过增强信息流设计二值化神经网络。

IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7551-7562. doi: 10.1109/TPAMI.2021.3117908. Epub 2022 Oct 4.

E2FIF: Push the Limit of Binarized Deep Imagery Super-Resolution Using End-to-End Full-Precision Information Flow.E2FIF：利用端到端全精度信息流突破二值化深度图像超分辨率的极限。

IEEE Trans Image Process. 2023;32:5379-5393. doi: 10.1109/TIP.2023.3315540. Epub 2023 Oct 5.

PresB-Net: parametric binarized neural network with learnable activations and shuffled grouped convolution.PresB-Net：具有可学习激活函数和随机分组卷积的参数化二值神经网络。

PeerJ Comput Sci. 2022 Jan 3;8:e842. doi: 10.7717/peerj-cs.842. eCollection 2022.

Optimizing Data Flow in Binary Neural Networks.优化二值神经网络中的数据流

Sensors (Basel). 2024 Jul 23;24(15):4780. doi: 10.3390/s24154780.

A storage-efficient ensemble classification using filter sharing on binarized convolutional neural networks.一种在二值化卷积神经网络上使用滤波器共享的存储高效集成分类方法。

PeerJ Comput Sci. 2022 Mar 29;8:e924. doi: 10.7717/peerj-cs.924. eCollection 2022.

FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution.FABNet：用于单图像超分辨率的频率感知二值化网络。

IEEE Trans Image Process. 2023;32:6234-6247. doi: 10.1109/TIP.2023.3328565. Epub 2023 Nov 20.

Convolutional Neural Networks Quantization with Double-Stage Squeeze-and-Threshold.卷积神经网络的双阶段压缩-门限量化方法。

Int J Neural Syst. 2022 Dec;32(12):2250051. doi: 10.1142/S0129065722500514. Epub 2022 Sep 26.

引用本文的文献

ResNet14Attention network for identifying the titration end-point of potassium dichromate.用于识别重铬酸钾滴定终点的ResNet14注意力网络。

Heliyon. 2023 Aug 6;9(8):e18992. doi: 10.1016/j.heliyon.2023.e18992. eCollection 2023 Aug.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

面向具有稀疏性的精确二值化神经网络以用于移动应用

Toward Accurate Binarized Neural Networks With Sparsity for Mobile Application.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献