基于梯度动量联合优化的结构先验驱动特征提取卷积神经网络图像分类

Structural prior-driven feature extraction with gradient-momentum combined optimization for convolutional neural network image classification.

机构信息

School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China.

School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China; Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks, Nanjing, 210023, Jiangsu, China.

出版信息

Neural Netw. 2024 Nov;179:106511. doi: 10.1016/j.neunet.2024.106511. Epub 2024 Jul 9.

DOI:10.1016/j.neunet.2024.106511

PMID:39146718

Abstract

Recent image classification efforts have achieved certain success by incorporating prior information such as labels and logical rules to learn discriminative features. However, these methods overlook the variability of features, resulting in feature inconsistency and fluctuations in model parameter updates, which further contribute to decreased image classification accuracy and model instability. To address this issue, this paper proposes a novel method combining structural prior-driven feature extraction with gradient-momentum (SPGM), from the perspectives of consistent feature learning and precise parameter updates, to enhance the accuracy and stability of image classification. Specifically, SPGM leverages a structural prior-driven feature extraction (SPFE) approach to calculate gradients of multi-level features and original images to construct structural information, which is then transformed into prior knowledge to drive the network to learn features consistent with the original images. Additionally, an optimization strategy integrating gradients and momentum (GMO) is introduced, dynamically adjusting the direction and step size of parameter updates based on the angle and norm of the sum of gradients and momentum, enabling precise model parameter updates. Extensive experiments on CIFAR10 and CIFAR100 datasets demonstrate that the SPGM method significantly reduces the top-1 error rate in image classification, enhances the classification performance, and outperforms state-of-the-art methods.

摘要

最近的图像分类工作通过结合标签和逻辑规则等先验信息来学习判别特征，取得了一定的成功。然而，这些方法忽略了特征的可变性，导致特征不一致和模型参数更新的波动，从而进一步降低了图像分类的准确性和模型的不稳定性。针对这个问题，本文提出了一种新的方法，将结构先验驱动的特征提取与梯度动量（SPGM）相结合，从一致的特征学习和精确的参数更新的角度出发，提高图像分类的准确性和稳定性。具体来说，SPGM 利用结构先验驱动的特征提取（SPFE）方法来计算多层次特征和原始图像的梯度，构建结构信息，然后将其转换为先验知识，以驱动网络学习与原始图像一致的特征。此外，还引入了一种集成梯度和动量的优化策略（GMO），根据梯度和动量之和的角度和范数，动态调整参数更新的方向和步长，实现精确的模型参数更新。在 CIFAR10 和 CIFAR100 数据集上的大量实验表明，SPGM 方法显著降低了图像分类的 top-1 错误率，提高了分类性能，优于最先进的方法。

相似文献

Structural prior-driven feature extraction with gradient-momentum combined optimization for convolutional neural network image classification.基于梯度动量联合优化的结构先验驱动特征提取卷积神经网络图像分类

Neural Netw. 2024 Nov;179:106511. doi: 10.1016/j.neunet.2024.106511. Epub 2024 Jul 9.

Brain tumor classification for MRI images using dual-discriminator conditional generative adversarial network.基于双鉴别器条件生成对抗网络的 MRI 图像脑肿瘤分类。

Electromagn Biol Med. 2024 Apr 2;43(1-2):81-94. doi: 10.1080/15368378.2024.2321352. Epub 2024 Mar 10.

A deep image classification model based on prior feature knowledge embedding and application in medical diagnosis.基于先验特征知识嵌入的深度图像分类模型及其在医学诊断中的应用。

Sci Rep. 2024 Jun 9;14(1):13244. doi: 10.1038/s41598-024-63818-x.

A novel adaptive momentum method for medical image classification using convolutional neural network.基于卷积神经网络的医学图像分类自适应动量方法

BMC Med Imaging. 2022 Mar 1;22(1):34. doi: 10.1186/s12880-022-00755-z.

Deep Convolution Neural Network for Malignancy Detection and Classification in Microscopic Uterine Cervix Cell Images.用于子宫颈细胞显微图像中恶性肿瘤检测与分类的深度卷积神经网络

Asian Pac J Cancer Prev. 2019 Nov 1;20(11):3447-3456. doi: 10.31557/APJCP.2019.20.11.3447.

Deep compressed sensing MRI via a gradient-enhanced fusion model.基于梯度增强融合模型的深度压缩感知磁共振成像

Med Phys. 2023 Mar;50(3):1390-1405. doi: 10.1002/mp.16164. Epub 2023 Feb 6.

A novel biomedical image indexing and retrieval system via deep preference learning.一种基于深度偏好学习的新型生物医学图像索引和检索系统。

Comput Methods Programs Biomed. 2018 May;158:53-69. doi: 10.1016/j.cmpb.2018.02.003. Epub 2018 Feb 6.

Medical Image Classification Algorithm Based on Visual Attention Mechanism-MCNN.基于视觉注意力机制的医学图像分类算法——多列卷积神经网络（MCNN）

Oxid Med Cell Longev. 2021 Feb 19;2021:6280690. doi: 10.1155/2021/6280690. eCollection 2021.

Integrating neural networks with advanced optimization techniques for accurate kidney disease diagnosis.将神经网络与先进的优化技术相结合，实现准确的肾病诊断。

Sci Rep. 2024 Sep 18;14(1):21740. doi: 10.1038/s41598-024-71410-6.

White blood cells detection and classification based on regional convolutional neural networks.基于区域卷积神经网络的白细胞检测与分类。

Med Hypotheses. 2020 Feb;135:109472. doi: 10.1016/j.mehy.2019.109472. Epub 2019 Nov 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于梯度动量联合优化的结构先验驱动特征提取卷积神经网络图像分类

Structural prior-driven feature extraction with gradient-momentum combined optimization for convolutional neural network image classification.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献