多尺度深度学习应用综述。

A Review on Multiscale-Deep-Learning Applications.

机构信息

Department of Electrical, Electronic and Systems Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia.

Department of Electrical and Computer Engineering, Faculty of Engineering, Universitas Syiah Kuala, Kopelma Darussalam 23111, Indonesia.

出版信息

Sensors (Basel). 2022 Sep 28;22(19):7384. doi: 10.3390/s22197384.

DOI:10.3390/s22197384

PMID:36236483

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9573412/

Abstract

In general, most of the existing convolutional neural network (CNN)-based deep-learning models suffer from spatial-information loss and inadequate feature-representation issues. This is due to their inability to capture multiscale-context information and the exclusion of semantic information throughout the pooling operations. In the early layers of a CNN, the network encodes simple semantic representations, such as edges and corners, while, in the latter part of the CNN, the network encodes more complex semantic features, such as complex geometric shapes. Theoretically, it is better for a CNN to extract features from different levels of semantic representation because tasks such as classification and segmentation work better when both simple and complex feature maps are utilized. Hence, it is also crucial to embed multiscale capability throughout the network so that the various scales of the features can be optimally captured to represent the intended task. Multiscale representation enables the network to fuse low-level and high-level features from a restricted receptive field to enhance the deep-model performance. The main novelty of this review is the comprehensive novel taxonomy of multiscale-deep-learning methods, which includes details of several architectures and their strengths that have been implemented in the existing works. Predominantly, multiscale approaches in deep-learning networks can be classed into two categories: multiscale feature learning and multiscale feature fusion. Multiscale feature learning refers to the method of deriving feature maps by examining kernels over several sizes to collect a larger range of relevant features and predict the input images' spatial mapping. Multiscale feature fusion uses features with different resolutions to find patterns over short and long distances, without a deep network. Additionally, several examples of the techniques are also discussed according to their applications in satellite imagery, medical imaging, agriculture, and industrial and manufacturing systems.

摘要

总的来说，大多数现有的基于卷积神经网络（CNN）的深度学习模型都存在空间信息丢失和特征表示不足的问题。这是由于它们无法捕获多尺度上下文信息，并在池化操作中排除语义信息。在 CNN 的早期层中，网络编码简单的语义表示，如边缘和拐角，而在 CNN 的后期部分，网络编码更复杂的语义特征，如复杂的几何形状。从理论上讲，CNN 从不同的语义表示层次提取特征更好，因为分类和分割等任务在利用简单和复杂特征图时效果更好。因此，在整个网络中嵌入多尺度能力也很关键，以便能够最佳地捕获各种尺度的特征，以表示预期的任务。多尺度表示使网络能够融合来自受限感受野的低水平和高水平特征，从而提高深度模型的性能。本篇综述的主要新颖之处在于对多尺度深度学习方法进行了全面的新分类，其中包括了现有工作中实施的几种架构及其优势的详细信息。主要地，深度学习网络中的多尺度方法可以分为两类：多尺度特征学习和多尺度特征融合。多尺度特征学习是指通过检查几个大小的核来导出特征图的方法，以收集更大范围的相关特征，并预测输入图像的空间映射。多尺度特征融合使用具有不同分辨率的特征来寻找短距离和长距离的模式，而无需深度网络。此外，还根据它们在卫星图像、医学成像、农业以及工业和制造系统中的应用讨论了几种技术示例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6085/9573412/0e16b3f354d0/sensors-22-07384-g001.jpg

相似文献

A Review on Multiscale-Deep-Learning Applications.

Sensors (Basel). 2022 Sep 28;22(19):7384. doi: 10.3390/s22197384.

A multibranch and multiscale neural network based on semantic perception for multimodal medical image fusion.

Sci Rep. 2024 Jul 30;14(1):17609. doi: 10.1038/s41598-024-68183-3.

PS5-Net: a medical image segmentation network with multiscale resolution.

J Med Imaging (Bellingham). 2024 Jan;11(1):014008. doi: 10.1117/1.JMI.11.1.014008. Epub 2024 Feb 19.

Automatic segmentation of spine x-ray images based on multiscale feature enhancement network.

Med Phys. 2024 Oct;51(10):7282-7294. doi: 10.1002/mp.17278. Epub 2024 Jun 30.

Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.

IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.

Multiscale space-time-frequency feature-guided multitask learning CNN for motor imagery EEG classification.

J Neural Eng. 2021 Feb 24;18(2). doi: 10.1088/1741-2552/abd82b.

DM-CNN: Dynamic Multi-scale Convolutional Neural Network with uncertainty quantification for medical image classification.

Comput Biol Med. 2024 Jan;168:107758. doi: 10.1016/j.compbiomed.2023.107758. Epub 2023 Nov 29.

A comparison between two semantic deep learning frameworks for the autosomal dominant polycystic kidney disease segmentation based on magnetic resonance images.

BMC Med Inform Decis Mak. 2019 Dec 12;19(Suppl 9):244. doi: 10.1186/s12911-019-0988-4.

Skin-CAD: Explainable deep learning classification of skin cancer from dermoscopic images by feature selection of dual high-level CNNs features and transfer learning.

Comput Biol Med. 2024 Aug;178:108798. doi: 10.1016/j.compbiomed.2024.108798. Epub 2024 Jun 25.

Optimization Algorithm of Moving Object Detection Using Multiscale Pyramid Convolutional Neural Networks.

Comput Intell Neurosci. 2023 Mar 10;2023:3320547. doi: 10.1155/2023/3320547. eCollection 2023.

引用本文的文献

MSFE-GallNet-X: a multi-scale feature extraction-based CNN Model for gallbladder disease analysis with enhanced explainability.

BMC Med Imaging. 2025 Aug 30;25(1):360. doi: 10.1186/s12880-025-01902-y.

Hierarchical Multi-Scale Mamba with Tubular Structure-Aware Convolution for Retinal Vessel Segmentation.

Entropy (Basel). 2025 Aug 14;27(8):862. doi: 10.3390/e27080862.

SPP-SegNet and SE-DenseNet201: A Dual-Model Approach for Cervical Cell Segmentation and Classification.

Cancers (Basel). 2025 Jun 27;17(13):2177. doi: 10.3390/cancers17132177.

Quantitative Analysis of 3-Monochloropropane-1,2-diol in Fried Oil Using Convolutional Neural Networks Optimizing with a Stepwise Hybrid Preprocessing Strategy Based on Fourier Transform Infrared Spectroscopy.

Foods. 2025 May 9;14(10):1670. doi: 10.3390/foods14101670.

Evolution of deep learning tooth segmentation from CT/CBCT images: a systematic review and meta-analysis.

BMC Oral Health. 2025 May 26;25(1):800. doi: 10.1186/s12903-025-05984-6.

Advanced pathological subtype classification of thyroid cancer using efficientNetB0.

Diagn Pathol. 2025 Mar 7;20(1):28. doi: 10.1186/s13000-025-01621-6.

Recent Advances in Deep Learning-Based Spatiotemporal Fusion Methods for Remote Sensing Images.

Sensors (Basel). 2025 Feb 12;25(4):1093. doi: 10.3390/s25041093.

Comparing prediction accuracy for 30-day readmission following primary total knee arthroplasty: the ACS-NSQIP risk calculator versus a novel artificial neural network model.

Knee Surg Relat Res. 2025 Jan 13;37(1):3. doi: 10.1186/s43019-024-00256-z.

From multi-omics to predictive biomarker: AI in tumor microenvironment.

Front Immunol. 2024 Dec 23;15:1514977. doi: 10.3389/fimmu.2024.1514977. eCollection 2024.

Neural network representations of multiphase Equations of State.

Sci Rep. 2024 Dec 5;14(1):30288. doi: 10.1038/s41598-024-81445-4.

本文引用的文献

Micro-Expression-Based Emotion Recognition Using Waterfall Atrous Spatial Pyramid Pooling Networks.

Sensors (Basel). 2022 Jun 19;22(12):4634. doi: 10.3390/s22124634.

Detection of COVID19 from X-ray images using multiscale Deep Convolutional Neural Network.

Appl Soft Comput. 2022 Apr;119:108610. doi: 10.1016/j.asoc.2022.108610. Epub 2022 Feb 14.

BreaCNet: A high-accuracy breast thermogram classifier based on mobile convolutional neural network.

Math Biosci Eng. 2022 Jan;19(2):1304-1331. doi: 10.3934/mbe.2022060. Epub 2021 Dec 3.

Image-Based Wheat Fungi Diseases Identification by Deep Learning.

Plants (Basel). 2021 Jul 21;10(8):1500. doi: 10.3390/plants10081500.

Residual-Shuffle Network with Spatial Pyramid Pooling Module for COVID-19 Screening.

Diagnostics (Basel). 2021 Aug 19;11(8):1497. doi: 10.3390/diagnostics11081497.

Multiscale Parallel Algorithm for Early Detection of Tomato Gray Mold in a Complex Natural Environment.

Front Plant Sci. 2021 May 11;12:620273. doi: 10.3389/fpls.2021.620273. eCollection 2021.

Image Segmentation Using Deep Learning: A Survey.

IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3523-3542. doi: 10.1109/TPAMI.2021.3059968. Epub 2022 Jun 3.

DBAN: Adversarial Network With Multi-Scale Features for Cardiac MRI Segmentation.

IEEE J Biomed Health Inform. 2021 Jun;25(6):2018-2028. doi: 10.1109/JBHI.2020.3028463. Epub 2021 Jun 3.

An integrated deep learning framework for joint segmentation of blood pool and myocardium.

Med Image Anal. 2020 May;62:101685. doi: 10.1016/j.media.2020.101685. Epub 2020 Mar 5.

DMCNN: A Deep Multiscale Convolutional Neural Network Model for Medical Image Segmentation.

J Healthc Eng. 2019 Dec 26;2019:8597606. doi: 10.1155/2019/8597606. eCollection 2019.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多尺度深度学习应用综述。

A Review on Multiscale-Deep-Learning Applications.

机构信息

Department of Electrical, Electronic and Systems Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia.

Department of Electrical and Computer Engineering, Faculty of Engineering, Universitas Syiah Kuala, Kopelma Darussalam 23111, Indonesia.

出版信息

Sensors (Basel). 2022 Sep 28;22(19):7384. doi: 10.3390/s22197384.

DOI:10.3390/s22197384

PMID:36236483

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9573412/

Abstract

摘要

多尺度深度学习应用综述。

A Review on Multiscale-Deep-Learning Applications.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

多尺度深度学习应用综述。

A Review on Multiscale-Deep-Learning Applications.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献