更快的 SCDNet：具有分割连接和灵活空洞卷积的实时语义分割网络。

Faster SCDNet: Real-Time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.

机构信息

School of Computer & Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China.

出版信息

Sensors (Basel). 2023 Mar 14;23(6):3112. doi: 10.3390/s23063112.

DOI:10.3390/s23063112

PMID:36991823

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10057038/

Abstract

Recently, semantic segmentation has been widely applied in various realistic scenarios. Many semantic segmentation backbone networks use various forms of dense connection to improve the efficiency of gradient propagation in the network. They achieve excellent segmentation accuracy but lack inference speed. Therefore, we propose a backbone network SCDNet with a dual path structure and higher speed and accuracy. Firstly, we propose a split connection structure, which is a streamlined lightweight backbone with a parallel structure to increase inference speed. Secondly, we introduce a flexible dilated convolution using different dilation rates so that the network can have richer receptive fields to perceive objects. Then, we propose a three-level hierarchical module to effectively balance the feature maps with multiple resolutions. Finally, a refined flexible and lightweight decoder is utilized. Our work achieves a trade-off of accuracy and speed on the Cityscapes and Camvid datasets. Specifically, we obtain a 36% improvement in FPS and a 0.7% improvement in mIoU on the Cityscapes test set.

摘要

最近，语义分割在各种现实场景中得到了广泛应用。许多语义分割骨干网络使用各种形式的密集连接来提高网络中梯度传播的效率。它们实现了优异的分割精度，但缺乏推理速度。因此，我们提出了一种具有双路径结构的骨干网络 SCDNet，以实现更高的速度和精度。首先，我们提出了一种分割连接结构，这是一种流线型的轻量级骨干网络，具有并行结构，可提高推理速度。其次，我们引入了一种灵活的扩张卷积，使用不同的扩张率，使网络能够具有更丰富的感受野来感知物体。然后，我们提出了一个三级分层模块，以有效地平衡具有多个分辨率的特征图。最后，使用了一个精细化的灵活轻量级解码器。我们的工作在 Cityscapes 和 Camvid 数据集上实现了准确性和速度之间的权衡。具体来说，我们在 Cityscapes 测试集上获得了 36%的帧率提高和 0.7%的 mIoU 提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6156/10057038/f44d1d462731/sensors-23-03112-g001.jpg

相似文献

Faster SCDNet: Real-Time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.

Sensors (Basel). 2023 Mar 14;23(6):3112. doi: 10.3390/s23063112.

MFAFNet: A Lightweight and Efficient Network with Multi-Level Feature Adaptive Fusion for Real-Time Semantic Segmentation.

Sensors (Basel). 2023 Jul 13;23(14):6382. doi: 10.3390/s23146382.

A lightweight multi-dimension dynamic convolutional network for real-time semantic segmentation.

Front Neurorobot. 2022 Dec 15;16:1075520. doi: 10.3389/fnbot.2022.1075520. eCollection 2022.

A Fast Attention-Guided Hierarchical Decoding Network for Real-Time Semantic Segmentation.

Sensors (Basel). 2023 Dec 24;24(1):95. doi: 10.3390/s24010095.

Multi-Level and Multi-Scale Feature Aggregation Network for Semantic Segmentation in Vehicle-Mounted Scenes.

Sensors (Basel). 2021 May 9;21(9):3270. doi: 10.3390/s21093270.

Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation.

Neural Netw. 2021 May;137:188-199. doi: 10.1016/j.neunet.2021.01.021. Epub 2021 Jan 30.

Rethinking 1D convolution for lightweight semantic segmentation.

Front Neurorobot. 2023 Feb 9;17:1119231. doi: 10.3389/fnbot.2023.1119231. eCollection 2023.

Lightweight semantic segmentation network with configurable context and small object attention.

Front Comput Neurosci. 2023 Oct 23;17:1280640. doi: 10.3389/fncom.2023.1280640. eCollection 2023.

Based on cross-scale fusion attention mechanism network for semantic segmentation for street scenes.

Front Neurorobot. 2023 Aug 31;17:1204418. doi: 10.3389/fnbot.2023.1204418. eCollection 2023.

LMFFNet: A Well-Balanced Lightweight Network for Fast and Accurate Semantic Segmentation.

IEEE Trans Neural Netw Learn Syst. 2023 Jun;34(6):3205-3219. doi: 10.1109/TNNLS.2022.3176493. Epub 2023 Jun 1.

本文引用的文献

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

更快的 SCDNet：具有分割连接和灵活空洞卷积的实时语义分割网络。

Faster SCDNet: Real-Time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.

机构信息

School of Computer & Communication Engineering, University of Science and Technology Beijing, Beijing 100083, China.

出版信息

Sensors (Basel). 2023 Mar 14;23(6):3112. doi: 10.3390/s23063112.

DOI:10.3390/s23063112

PMID:36991823

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10057038/

Abstract

摘要

更快的 SCDNet：具有分割连接和灵活空洞卷积的实时语义分割网络。

Faster SCDNet: Real-Time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

更快的 SCDNet：具有分割连接和灵活空洞卷积的实时语义分割网络。

Faster SCDNet: Real-Time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.

机构信息

出版信息

相似文献

本文引用的文献