School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China.
School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China; Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Nanjing, 210023, Jiangsu, China.
Neural Netw. 2024 Nov;179:106511. doi: 10.1016/j.neunet.2024.106511. Epub 2024 Jul 9.
Recent image classification efforts have achieved certain success by incorporating prior information such as labels and logical rules to learn discriminative features. However, these methods overlook the variability of features, resulting in feature inconsistency and fluctuations in model parameter updates, which further contribute to decreased image classification accuracy and model instability. To address this issue, this paper proposes a novel method combining structural prior-driven feature extraction with gradient-momentum (SPGM), from the perspectives of consistent feature learning and precise parameter updates, to enhance the accuracy and stability of image classification. Specifically, SPGM leverages a structural prior-driven feature extraction (SPFE) approach to calculate gradients of multi-level features and original images to construct structural information, which is then transformed into prior knowledge to drive the network to learn features consistent with the original images. Additionally, an optimization strategy integrating gradients and momentum (GMO) is introduced, dynamically adjusting the direction and step size of parameter updates based on the angle and norm of the sum of gradients and momentum, enabling precise model parameter updates. Extensive experiments on CIFAR10 and CIFAR100 datasets demonstrate that the SPGM method significantly reduces the top-1 error rate in image classification, enhances the classification performance, and outperforms state-of-the-art methods.
最近的图像分类工作通过结合标签和逻辑规则等先验信息来学习判别特征,取得了一定的成功。然而,这些方法忽略了特征的可变性,导致特征不一致和模型参数更新的波动,从而进一步降低了图像分类的准确性和模型的不稳定性。针对这个问题,本文提出了一种新的方法,将结构先验驱动的特征提取与梯度动量(SPGM)相结合,从一致的特征学习和精确的参数更新的角度出发,提高图像分类的准确性和稳定性。具体来说,SPGM 利用结构先验驱动的特征提取(SPFE)方法来计算多层次特征和原始图像的梯度,构建结构信息,然后将其转换为先验知识,以驱动网络学习与原始图像一致的特征。此外,还引入了一种集成梯度和动量的优化策略(GMO),根据梯度和动量之和的角度和范数,动态调整参数更新的方向和步长,实现精确的模型参数更新。在 CIFAR10 和 CIFAR100 数据集上的大量实验表明,SPGM 方法显著降低了图像分类的 top-1 错误率,提高了分类性能,优于最先进的方法。