面向深度模型压缩的歧视感知网络剪枝。

Discrimination-Aware Network Pruning for Deep Model Compression.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Aug;44(8):4035-4051. doi: 10.1109/TPAMI.2021.3066410. Epub 2022 Jul 1.

DOI:10.1109/TPAMI.2021.3066410

Abstract

We study network pruning which aims to remove redundant channels/kernels and hence speed up the inference of deep networks. Existing pruning methods either train from scratch with sparsity constraints or minimize the reconstruction error between the feature maps of the pre-trained models and the compressed ones. Both strategies suffer from some limitations: the former kind is computationally expensive and difficult to converge, while the latter kind optimizes the reconstruction error but ignores the discriminative power of channels. In this paper, we propose a simple-yet-effective method called discrimination-aware channel pruning (DCP) to choose the channels that actually contribute to the discriminative power. To this end, we first introduce additional discrimination-aware losses into the network to increase the discriminative power of the intermediate layers. Next, we select the most discriminative channels for each layer by considering the discrimination-aware loss and the reconstruction error, simultaneously. We then formulate channel pruning as a sparsity-inducing optimization problem with a convex objective and propose a greedy algorithm to solve the resultant problem. Note that a channel (3D tensor) often consists of a set of kernels (each with a 2D matrix). Besides the redundancy in channels, some kernels in a channel may also be redundant and fail to contribute to the discriminative power of the network, resulting in kernel level redundancy. To solve this issue, we propose a discrimination-aware kernel pruning (DKP) method to further compress deep networks by removing redundant kernels. To avoid manually determining the pruning rate for each layer, we propose two adaptive stopping conditions to automatically determine the number of selected channels/kernels. The proposed adaptive stopping conditions tend to yield more efficient models with better performance in practice. Extensive experiments on both image classification and face recognition demonstrate the effectiveness of our methods. For example, on ILSVRC-12, the resultant ResNet-50 model with 30 percent reduction of channels even outperforms the baseline model by 0.36 percent in terms of Top-1 accuracy. We also deploy the pruned models on a smartphone (equipped with a Qualcomm Snapdragon 845 processor). The pruned MobileNetV1 and MobileNetV2 achieve 1.93× and 1.42× inference acceleration on the mobile device, respectively, with negligible performance degradation. The source code and the pre-trained models are available at https://github.com/SCUT-AILab/DCP.

摘要

我们研究网络剪枝，旨在去除冗余的通道/内核，从而加快深度网络的推理速度。现有的剪枝方法要么在稀疏性约束下从头开始训练，要么最小化预训练模型和压缩模型之间的特征图之间的重建误差。这两种策略都存在一些局限性：前者计算成本高，难以收敛，而后者则优化了重建误差，但忽略了通道的判别能力。在本文中，我们提出了一种简单而有效的方法，称为判别感知通道剪枝（DCP），以选择对判别能力有贡献的通道。为此，我们首先在网络中引入额外的判别感知损失，以增加中间层的判别能力。接下来，我们通过同时考虑判别感知损失和重建误差，为每个层选择最具判别力的通道。然后，我们将通道剪枝表示为一个具有凸目标的稀疏诱导优化问题，并提出了一种贪婪算法来求解该问题。请注意，一个通道（三维张量）通常由一组内核（每个内核都有一个二维矩阵）组成。除了通道中的冗余之外，通道中的一些内核也可能是冗余的，并且无法为网络的判别能力做出贡献，从而导致内核级冗余。为了解决这个问题，我们提出了一种判别感知内核剪枝（DKP）方法，通过去除冗余内核进一步压缩深度网络。为了避免为每个层手动确定剪枝率，我们提出了两种自适应停止条件来自动确定选择的通道/内核数量。在实践中，所提出的自适应停止条件往往会产生更有效的模型，并且性能更好。在图像分类和人脸识别方面的大量实验表明了我们方法的有效性。例如，在 ILSVRC-12 上，通道减少 30%的结果 ResNet-50 模型在 Top-1 准确率方面甚至比基线模型高出 0.36%。我们还在智能手机（配备高通骁龙 845 处理器）上部署了剪枝模型。剪枝后的 MobileNetV1 和 MobileNetV2 在移动设备上的推理速度分别提高了 1.93 倍和 1.42 倍，性能几乎没有下降。源代码和预训练模型可在 https://github.com/SCUT-AILab/DCP 上获得。

相似文献

Discrimination-Aware Network Pruning for Deep Model Compression.面向深度模型压缩的歧视感知网络剪枝。

IEEE Trans Pattern Anal Mach Intell. 2022 Aug;44(8):4035-4051. doi: 10.1109/TPAMI.2021.3066410. Epub 2022 Jul 1.

Random pruning: channel sparsity by expectation scaling factor.随机剪枝：通过期望缩放因子实现通道稀疏性

PeerJ Comput Sci. 2023 Sep 5;9:e1564. doi: 10.7717/peerj-cs.1564. eCollection 2023.

Weak sub-network pruning for strong and efficient neural networks.弱子网络剪枝技术：构建强大而高效的神经网络

Neural Netw. 2021 Dec;144:614-626. doi: 10.1016/j.neunet.2021.09.015. Epub 2021 Sep 30.

Dynamical Conventional Neural Network Channel Pruning by Genetic Wavelet Channel Search for Image Classification.基于遗传小波通道搜索的动态传统神经网络通道剪枝用于图像分类

Front Comput Neurosci. 2021 Oct 27;15:760554. doi: 10.3389/fncom.2021.760554. eCollection 2021.

EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression.EDP：一种用于卷积神经网络压缩的高效分解与剪枝方案。

IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4499-4513. doi: 10.1109/TNNLS.2020.3018177. Epub 2021 Oct 5.

Performance-Aware Approximation of Global Channel Pruning for Multitask CNNs.面向多任务 CNN 全局通道剪枝的性能感知逼近。

IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):10267-10284. doi: 10.1109/TPAMI.2023.3260903. Epub 2023 Jun 30.

Carrying Out CNN Channel Pruning in a White Box.在白盒中进行卷积神经网络（CNN）通道剪枝

IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):7946-7955. doi: 10.1109/TNNLS.2022.3147269. Epub 2023 Oct 5.

LAP: Latency-aware automated pruning with dynamic-based filter selection.LAP：基于动态滤波器选择的延迟感知自动剪枝

Neural Netw. 2022 Aug;152:407-418. doi: 10.1016/j.neunet.2022.05.002. Epub 2022 May 10.

Auxiliary Pneumonia Classification Algorithm Based on Pruning Compression.基于剪枝压缩的辅助性肺炎分类算法。

Comput Math Methods Med. 2022 Jul 18;2022:8415187. doi: 10.1155/2022/8415187. eCollection 2022.

Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks.渐进式软滤波器剪枝在深度卷积神经网络中的应用。

IEEE Trans Cybern. 2020 Aug;50(8):3594-3604. doi: 10.1109/TCYB.2019.2933477. Epub 2019 Aug 27.

引用本文的文献

OR-FCOS: an enhanced fully convolutional one-stage approach for growth stage identification of Oudemansiella raphanipes.OR-FCOS：一种用于识别萝卜丝膜菌生长阶段的增强型全卷积单阶段方法。

Sci Rep. 2025 Jul 15;15(1):25576. doi: 10.1038/s41598-025-09303-5.

Research trends in livestock facial identification: a review.家畜面部识别的研究趋势：综述

J Anim Sci Technol. 2025 Jan;67(1):43-55. doi: 10.5187/jast.2025.e4. Epub 2025 Jan 31.

Advances in neural architecture search.神经架构搜索的进展。

Natl Sci Rev. 2024 Aug 23;11(8):nwae282. doi: 10.1093/nsr/nwae282. eCollection 2024 Aug.

StripeRust-Pocket: A Mobile-Based Deep Learning Application for Efficient Disease Severity Assessment of Wheat Stripe Rust.条锈病口袋应用：一款基于移动端的深度学习应用程序，用于高效评估小麦条锈病的病情严重程度。

Plant Phenomics. 2024 Jul 23;2024:0201. doi: 10.34133/plantphenomics.0201. eCollection 2024.

GAT TransPruning: progressive channel pruning strategy combining graph attention network and transformer.GAT TransPruning：结合图注意力网络和Transformer的渐进式通道剪枝策略

PeerJ Comput Sci. 2024 Apr 23;10:e2012. doi: 10.7717/peerj-cs.2012. eCollection 2024.

A Pruning Method for Deep Convolutional Network Based on Heat Map Generation Metrics.基于热图生成指标的深度卷积网络修剪方法。

Sensors (Basel). 2022 Mar 4;22(5):2022. doi: 10.3390/s22052022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

面向深度模型压缩的歧视感知网络剪枝。

Discrimination-Aware Network Pruning for Deep Model Compression.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献