用于加速卷积神经网络的复杂混合加权剪枝方法

Complex hybrid weighted pruning method for accelerating convolutional neural networks.

作者信息

Geng Xu, Gao Jinxiong, Zhang Yonghui, Xu Dingtan

机构信息

School of Information and Communication Engineering, Hainan University, Haikou, 570228, China.

出版信息

Sci Rep. 2024 Mar 6;14(1):5570. doi: 10.1038/s41598-024-55942-5.

DOI:10.1038/s41598-024-55942-5

PMID:38448451

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10917793/

Abstract

The increasing interest in filter pruning of convolutional neural networks stems from its inherent ability to effectively compress and accelerate these networks. Currently, filter pruning is mainly divided into two schools: norm-based and relation-based. These methods aim to selectively remove the least important filters according to predefined rules. However, the limitations of these methods lie in the inadequate consideration of filter diversity and the impact of batch normalization (BN) layers on the input of the next layer, which may lead to performance degradation. To address the above limitations of norm-based and similarity-based methods, this study conducts empirical analyses to reveal their drawbacks and subsequently introduces a groundbreaking complex hybrid weighted pruning method. By evaluating the correlations and norms between individual filters, as well as the parameters of the BN layer, our method effectively identifies and prunes the most redundant filters in a robust manner, thereby avoiding significant decreases in network performance. We conducted comprehensive and direct pruning experiments on different depths of ResNet using publicly available image classification datasets, ImageNet and CIFAR-10. The results demonstrate the significant efficacy of our approach. In particular, when applied to the ResNet-50 on the ImageNet dataset, achieves a significant reduction of 53.5% in floating-point operations, with a performance loss of only 0.6%.

摘要

对卷积神经网络滤波器剪枝的兴趣日益浓厚，源于其有效压缩和加速这些网络的内在能力。目前，滤波器剪枝主要分为两类：基于范数的和基于关系的。这些方法旨在根据预定义规则选择性地去除最不重要的滤波器。然而，这些方法的局限性在于对滤波器多样性以及批量归一化（BN）层对下一层输入的影响考虑不足，这可能导致性能下降。为了解决基于范数和基于相似度方法的上述局限性，本研究进行了实证分析以揭示其缺点，随后引入了一种开创性的复杂混合加权剪枝方法。通过评估各个滤波器之间的相关性和范数以及BN层的参数，我们的方法能够以稳健的方式有效地识别和修剪最冗余的滤波器，从而避免网络性能大幅下降。我们使用公开可用的图像分类数据集ImageNet和CIFAR-10对不同深度的ResNet进行了全面且直接的剪枝实验。结果证明了我们方法的显著有效性。特别是，当应用于ImageNet数据集上的ResNet-50时，浮点运算显著减少了53.5%，而性能损失仅为0.6%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d457/10917793/ecf6cc733050/41598_2024_55942_Fig1_HTML.jpg

相似文献

Complex hybrid weighted pruning method for accelerating convolutional neural networks.用于加速卷积神经网络的复杂混合加权剪枝方法

Sci Rep. 2024 Mar 6;14(1):5570. doi: 10.1038/s41598-024-55942-5.

HRel: Filter pruning based on High Relevance between activation maps and class labels.HRel：基于激活图与类别标签之间的高相关性的滤波器修剪。

Neural Netw. 2022 Mar;147:186-197. doi: 10.1016/j.neunet.2021.12.017. Epub 2021 Dec 30.

SAAF: Self-Adaptive Attention Factor-Based Taylor-Pruning on Convolutional Neural Networks.SAAF：基于自适应注意力因子的卷积神经网络泰勒剪枝法

IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):8540-8553. doi: 10.1109/TNNLS.2024.3439435. Epub 2025 May 2.

Dynamical Conventional Neural Network Channel Pruning by Genetic Wavelet Channel Search for Image Classification.基于遗传小波通道搜索的动态传统神经网络通道剪枝用于图像分类

Front Comput Neurosci. 2021 Oct 27;15:760554. doi: 10.3389/fncom.2021.760554. eCollection 2021.

Model pruning based on filter similarity for edge device deployment.基于滤波器相似度的模型剪枝用于边缘设备部署。

Front Neurorobot. 2023 Mar 2;17:1132679. doi: 10.3389/fnbot.2023.1132679. eCollection 2023.

Weak sub-network pruning for strong and efficient neural networks.弱子网络剪枝技术：构建强大而高效的神经网络

Neural Netw. 2021 Dec;144:614-626. doi: 10.1016/j.neunet.2021.09.015. Epub 2021 Sep 30.

Pruning Networks With Cross-Layer Ranking & k-Reciprocal Nearest Filters.基于跨层排序和k近邻互反滤波器的网络剪枝

IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):9139-9148. doi: 10.1109/TNNLS.2022.3156047. Epub 2023 Oct 27.

LAP: Latency-aware automated pruning with dynamic-based filter selection.LAP：基于动态滤波器选择的延迟感知自动剪枝

Neural Netw. 2022 Aug;152:407-418. doi: 10.1016/j.neunet.2022.05.002. Epub 2022 May 10.

Hierarchical Threshold Pruning Based on Uniform Response Criterion.基于均匀响应准则的分层阈值修剪

IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):10869-10881. doi: 10.1109/TNNLS.2023.3244994. Epub 2024 Aug 5.

Redundant feature pruning for accelerated inference in deep neural networks.冗余特征剪枝在深度神经网络中的加速推理。

Neural Netw. 2019 Oct;118:148-158. doi: 10.1016/j.neunet.2019.04.021. Epub 2019 May 9.

引用本文的文献

Two-dimensional fully ferroelectric-gated hybrid computing-in-memory hardware for high-precision and energy-efficient dynamic tracking.用于高精度和节能动态跟踪的二维全铁电门控混合计算内存硬件。

Sci Adv. 2024 Sep 6;10(36):eadp0174. doi: 10.1126/sciadv.adp0174. Epub 2024 Sep 4.

本文引用的文献

Physics-informed neural networks as surrogate models of hydrodynamic simulators.基于物理信息的神经网络作为流体动力学模拟器的替代模型。

Sci Total Environ. 2024 Feb 20;912:168814. doi: 10.1016/j.scitotenv.2023.168814. Epub 2023 Nov 26.

Gaussian process emulation of spatio-temporal outputs of a 2D inland flood model.二维内陆洪水模型时空输出的高斯过程仿真。

Water Res. 2022 Oct 15;225:119100. doi: 10.1016/j.watres.2022.119100. Epub 2022 Sep 14.

GCNs-Net: A Graph Convolutional Neural Network Approach for Decoding Time-Resolved EEG Motor Imagery Signals.GCNs-Net：一种用于解码时分辨脑电运动想象信号的图卷积神经网络方法。

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7312-7323. doi: 10.1109/TNNLS.2022.3202569. Epub 2024 Jun 3.

Deep Feature Mining the Attention-Based Bidirectional Long Short Term Memory Graph Convolutional Neural Network for Human Motor Imagery Recognition.深度特征挖掘：基于注意力的双向长短期记忆图卷积神经网络用于人类运动想象识别

Front Bioeng Biotechnol. 2022 Feb 11;9:706229. doi: 10.3389/fbioe.2021.706229. eCollection 2021.

A novel approach of decoding EEG four-class motor imagery tasks via scout ESI and CNN.通过侦察 ESI 和 CNN 对 EEG 四类运动想象任务进行解码的新方法。

J Neural Eng. 2020 Feb 5;17(1):016048. doi: 10.1088/1741-2552/ab4af6.

Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks.渐进式软滤波器剪枝在深度卷积神经网络中的应用。

IEEE Trans Cybern. 2020 Aug;50(8):3594-3604. doi: 10.1109/TCYB.2019.2933477. Epub 2019 Aug 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于加速卷积神经网络的复杂混合加权剪枝方法

Complex hybrid weighted pruning method for accelerating convolutional neural networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献