Suppr超能文献

神经网络剪枝的跨层重要性评估。

Cross-layer importance evaluation for neural network pruning.

机构信息

Department of Control Science and Engineering, College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China.

出版信息

Neural Netw. 2024 Nov;179:106496. doi: 10.1016/j.neunet.2024.106496. Epub 2024 Jul 3.

Abstract

Filter pruning has achieved remarkable success in reducing memory consumption and speeding up inference for convolutional neural networks (CNNs). Some prior works, such as heuristic methods, attempted to search for suitable sparse structures during the pruning process, which may be expensive and time-consuming. In this paper, an efficient cross-layer importance evaluation (CIE) method is proposed to automatically calculate proportional relationships among convolutional layers. Firstly, every layer is pruned separately by grid sampling way to obtain the accuracy of the model for all sampling points. And then, contribution matrices are built to describe the importance of each layer to model accuracy. Finally, the binary search algorithm is used to search the optimal sparse structure under a target pruned value. Extensive experiments on multiple representative image classification tasks demonstrate that proposed method acquires better compression performance under a little time cost compared to existing pruning algorithms. For instance, it reduces more than 50% FLOPs with only a small loss of 0.93% and 0.43% in the top-1 and top-5 accuracy for ResNet50, respectively. At the cost of only 0.24% accuracy loss, the pruned VGG19 model parameters are successfully compressed by 27.23× and the throughput has increased by 2.46×. On the whole, CIE has an excellent effect on the deployment and application of the CNNs model in edge device in terms of efficiency and accuracy.

摘要

滤波器剪枝在减少卷积神经网络(CNN)的内存消耗和加速推理方面取得了显著的成功。一些先前的工作,如启发式方法,试图在剪枝过程中搜索合适的稀疏结构,这可能是昂贵和耗时的。在本文中,提出了一种有效的跨层重要性评估(CIE)方法,用于自动计算卷积层之间的比例关系。首先,通过网格采样方法对每个层进行单独剪枝,以获得模型在所有采样点的准确性。然后,构建贡献矩阵来描述每个层对模型准确性的重要性。最后,使用二进制搜索算法在目标剪枝值下搜索最佳稀疏结构。在多个具有代表性的图像分类任务上的广泛实验表明,与现有剪枝算法相比,所提出的方法在花费很少的时间成本下获得了更好的压缩性能。例如,对于 ResNet50,它将 FLOPs 减少了 50%以上,而在 top-1 和 top-5 准确性方面仅损失了 0.93%和 0.43%。在仅损失 0.24%准确性的情况下,成功地将剪枝后的 VGG19 模型参数压缩了 27.23 倍,吞吐量增加了 2.46 倍。总的来说,CIE 在效率和准确性方面对 CNNs 模型在边缘设备中的部署和应用具有极好的效果。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验