Suppr超能文献

基于梯度动量联合优化的结构先验驱动特征提取卷积神经网络图像分类

Structural prior-driven feature extraction with gradient-momentum combined optimization for convolutional neural network image classification.

机构信息

School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China.

School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China; Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Nanjing, 210023, Jiangsu, China.

出版信息

Neural Netw. 2024 Nov;179:106511. doi: 10.1016/j.neunet.2024.106511. Epub 2024 Jul 9.

Abstract

Recent image classification efforts have achieved certain success by incorporating prior information such as labels and logical rules to learn discriminative features. However, these methods overlook the variability of features, resulting in feature inconsistency and fluctuations in model parameter updates, which further contribute to decreased image classification accuracy and model instability. To address this issue, this paper proposes a novel method combining structural prior-driven feature extraction with gradient-momentum (SPGM), from the perspectives of consistent feature learning and precise parameter updates, to enhance the accuracy and stability of image classification. Specifically, SPGM leverages a structural prior-driven feature extraction (SPFE) approach to calculate gradients of multi-level features and original images to construct structural information, which is then transformed into prior knowledge to drive the network to learn features consistent with the original images. Additionally, an optimization strategy integrating gradients and momentum (GMO) is introduced, dynamically adjusting the direction and step size of parameter updates based on the angle and norm of the sum of gradients and momentum, enabling precise model parameter updates. Extensive experiments on CIFAR10 and CIFAR100 datasets demonstrate that the SPGM method significantly reduces the top-1 error rate in image classification, enhances the classification performance, and outperforms state-of-the-art methods.

摘要

最近的图像分类工作通过结合标签和逻辑规则等先验信息来学习判别特征,取得了一定的成功。然而,这些方法忽略了特征的可变性,导致特征不一致和模型参数更新的波动,从而进一步降低了图像分类的准确性和模型的不稳定性。针对这个问题,本文提出了一种新的方法,将结构先验驱动的特征提取与梯度动量(SPGM)相结合,从一致的特征学习和精确的参数更新的角度出发,提高图像分类的准确性和稳定性。具体来说,SPGM 利用结构先验驱动的特征提取(SPFE)方法来计算多层次特征和原始图像的梯度,构建结构信息,然后将其转换为先验知识,以驱动网络学习与原始图像一致的特征。此外,还引入了一种集成梯度和动量的优化策略(GMO),根据梯度和动量之和的角度和范数,动态调整参数更新的方向和步长,实现精确的模型参数更新。在 CIFAR10 和 CIFAR100 数据集上的大量实验表明,SPGM 方法显著降低了图像分类的 top-1 错误率,提高了分类性能,优于最先进的方法。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验