基于连续多尺度特征学习的图像分类模型。

Consecutive multiscale feature learning-based image classification model.

机构信息

AI Department, IT Convergence R &D Center, Vitasoft, Seoul, South Korea.

School of Computer Science and Engineering, Kyungpook National University, Daegu, 41586, South Korea.

出版信息

Sci Rep. 2023 Mar 3;13(1):3595. doi: 10.1038/s41598-023-30480-8.

DOI:10.1038/s41598-023-30480-8

PMID:36869132

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9984458/

Abstract

Extracting useful features at multiple scales is a crucial task in computer vision. The emergence of deep-learning techniques and the advancements in convolutional neural networks (CNNs) have facilitated effective multiscale feature extraction that results in stable performance improvements in numerous real-life applications. However, currently available state-of-the-art methods primarily rely on a parallel multiscale feature extraction approach, and despite exhibiting competitive accuracy, the models lead to poor results in efficient computation and low generalization on small-scale images. Moreover, efficient and lightweight networks cannot appropriately learn useful features, and this causes underfitting when training with small-scale images or datasets with a limited number of samples. To address these problems, we propose a novel image classification system based on elaborate data preprocessing steps and a carefully designed CNN model architecture. Specifically, we present a consecutive multiscale feature-learning network (CMSFL-Net) that employs a consecutive feature-learning approach based on the usage of various feature maps with different receptive fields to achieve faster training/inference and higher accuracy. In the conducted experiments using six real-life image classification datasets, including small-scale, large-scale, and limited data, the CMSFL-Net exhibits an accuracy comparable with those of existing state-of-the-art efficient networks. Moreover, the proposed system outperforms them in terms of efficiency and speed and achieves the best results in accuracy-efficiency trade-off.

摘要

在计算机视觉中，从多个尺度提取有用特征是一项关键任务。深度学习技术的出现和卷积神经网络（CNN）的进步促进了有效的多尺度特征提取，从而在许多实际应用中实现了稳定的性能提升。然而，目前现有的最先进的方法主要依赖于并行多尺度特征提取方法，尽管表现出了竞争准确性，但这些模型在高效计算和小图像的低泛化方面的效果较差。此外，高效和轻量级的网络无法适当地学习有用的特征，这导致在使用小图像或样本数量有限的数据集进行训练时出现欠拟合。为了解决这些问题，我们提出了一种基于精心设计的数据预处理步骤和 CNN 模型架构的新型图像分类系统。具体来说，我们提出了一种连续多尺度特征学习网络（CMSFL-Net），它采用基于使用具有不同感受野的各种特征图的连续特征学习方法，以实现更快的训练/推理和更高的准确性。在使用包括小、大、小数据集的六个真实图像分类数据集进行的实验中，CMSFL-Net 表现出的准确性可与现有的最先进的高效网络相媲美。此外，与其他高效系统相比，该系统在效率和速度方面表现更好，并在准确性-效率权衡中取得了最佳结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/221b/9984458/8f69393521e5/41598_2023_30480_Fig1_HTML.jpg

相似文献

Consecutive multiscale feature learning-based image classification model.

Sci Rep. 2023 Mar 3;13(1):3595. doi: 10.1038/s41598-023-30480-8.

CEModule: A Computation Efficient Module for Lightweight Convolutional Neural Networks.

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6069-6080. doi: 10.1109/TNNLS.2021.3133127. Epub 2023 Sep 1.

A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.

Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

ARTD-Net: Anchor-Free Based Recyclable Trash Detection Net Using Edgeless Module.

Sensors (Basel). 2023 Mar 7;23(6):2907. doi: 10.3390/s23062907.

MFL-Net: An Efficient Lightweight Multi-Scale Feature Learning CNN for COVID-19 Diagnosis From CT Images.

IEEE J Biomed Health Inform. 2022 Nov;26(11):5355-5363. doi: 10.1109/JBHI.2022.3196489. Epub 2022 Nov 10.

Multiscale space-time-frequency feature-guided multitask learning CNN for motor imagery EEG classification.

J Neural Eng. 2021 Feb 24;18(2). doi: 10.1088/1741-2552/abd82b.

GhoMR: Multi-Receptive Lightweight Residual Modules for Hyperspectral Classification.

Sensors (Basel). 2020 Nov 29;20(23):6823. doi: 10.3390/s20236823.

Multiscale Feature-Learning with a Unified Model for Hyperspectral Image Classification.

Sensors (Basel). 2023 Sep 3;23(17):7628. doi: 10.3390/s23177628.

TBUnet: A Pure Convolutional U-Net Capable of Multifaceted Feature Extraction for Medical Image Segmentation.

J Med Syst. 2023 Nov 17;47(1):122. doi: 10.1007/s10916-023-02014-2.

Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.

Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.

本文引用的文献

Exploiting dynamic spatio-temporal graph convolutional neural networks for citywide traffic flows prediction.

Neural Netw. 2022 Jan;145:233-247. doi: 10.1016/j.neunet.2021.10.021. Epub 2021 Oct 28.

One-stage CNN detector-based benthonic organisms detection with limited training dataset.

Neural Netw. 2021 Dec;144:247-259. doi: 10.1016/j.neunet.2021.08.014. Epub 2021 Aug 28.

Text Data Augmentation for Deep Learning.

J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.

A semi-supervised zero-shot image classification method based on soft-target.

Neural Netw. 2021 Nov;143:88-96. doi: 10.1016/j.neunet.2021.05.019. Epub 2021 May 25.

Residual wide-kernel deep convolutional auto-encoder for intelligent rotating machinery fault diagnosis with limited samples.

Neural Netw. 2021 Sep;141:133-144. doi: 10.1016/j.neunet.2021.04.003. Epub 2021 Apr 9.

PyDiNet: Pyramid Dilated Network for medical image segmentation.

Neural Netw. 2021 Aug;140:274-281. doi: 10.1016/j.neunet.2021.03.023. Epub 2021 Mar 26.

MRI-Based Brain Tumor Classification Using Ensemble of Deep Features and Machine Learning Classifiers.

Sensors (Basel). 2021 Mar 22;21(6):2222. doi: 10.3390/s21062222.

Improved deep CNNs based on Nonlinear Hybrid Attention Module for image classification.

Neural Netw. 2021 Aug;140:158-166. doi: 10.1016/j.neunet.2021.01.005. Epub 2021 Feb 12.

Generating photo-realistic training data to improve face recognition accuracy.

Neural Netw. 2021 Feb;134:86-94. doi: 10.1016/j.neunet.2020.11.008. Epub 2020 Nov 27.

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation.

Nat Methods. 2021 Feb;18(2):203-211. doi: 10.1038/s41592-020-01008-z. Epub 2020 Dec 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于连续多尺度特征学习的图像分类模型。

Consecutive multiscale feature learning-based image classification model.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献