基于编解码器架构的水下图像语义分割方法。

Semantic segmentation method of underwater images based on encoder-decoder architecture.

机构信息

Department of Mechanical Engineering, College of Field Engineering and Army Engineering University, PLA, Nanjing, China.

出版信息

PLoS One. 2022 Aug 25;17(8):e0272666. doi: 10.1371/journal.pone.0272666. eCollection 2022.

DOI:10.1371/journal.pone.0272666

PMID:36006956

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9409518/

Abstract

With the exploration and development of marine resources, deep learning is more and more widely used in underwater image processing. However, the quality of the original underwater images is so low that traditional semantic segmentation methods obtain poor segmentation results, such as blurred target edges, insufficient segmentation accuracy, and poor regional boundary segmentation effects. To solve these problems, this paper proposes a semantic segmentation method for underwater images. Firstly, the image enhancement based on multi-spatial transformation is performed to improve the quality of the original images, which is not common in other advanced semantic segmentation methods. Then, the densely connected hybrid atrous convolution effectively expands the receptive field and slows down the speed of resolution reduction. Next, the cascaded atrous convolutional spatial pyramid pooling module integrates boundary features of different scales to enrich target details. Finally, the context information aggregation decoder fuses the features of the shallow network and the deep network to extract rich contextual information, which greatly reduces information loss. The proposed method was evaluated on RUIE, HabCam UID, and UIEBD. Compared with the state-of-the-art semantic segmentation algorithms, the proposed method has advantages in segmentation integrity, location accuracy, boundary clarity, and detail in subjective perception. On the objective data, the proposed method achieves the highest MIOU of 68.3 and OA of 79.4, and it has a low resource consumption. Besides, the ablation experiment also verifies the effectiveness of our method.

摘要

随着海洋资源的开发和利用，深度学习在水下图像处理中得到了越来越广泛的应用。然而，原始水下图像的质量很低，传统的语义分割方法得到的分割结果较差，例如目标边缘模糊、分割精度不足、区域边界分割效果差等。为了解决这些问题，本文提出了一种水下图像语义分割方法。首先，对图像进行基于多空间变换的增强处理，以提高原始图像的质量，这在其他先进的语义分割方法中并不常见。然后，密集连接混合空洞卷积有效地扩大了感受野，减缓了分辨率降低的速度。接下来，级联空洞卷积空间金字塔池化模块集成了不同尺度的边界特征，以丰富目标细节。最后，上下文信息聚合解码器融合浅层网络和深层网络的特征，提取丰富的上下文信息，从而大大减少了信息的丢失。该方法在 RUIE、HabCam UID 和 UIEBD 数据集上进行了评估。与现有的语义分割算法相比，该方法在分割完整性、位置准确性、边界清晰度和细节主观感知方面具有优势。在客观数据上，该方法的 MIOU 最高可达 68.3，OA 最高可达 79.4，资源消耗较低。此外，消融实验也验证了该方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b0d/9409518/9732c7ed6ec7/pone.0272666.g001.jpg

相似文献

Semantic segmentation method of underwater images based on encoder-decoder architecture.

PLoS One. 2022 Aug 25;17(8):e0272666. doi: 10.1371/journal.pone.0272666. eCollection 2022.

A multiple-channel and atrous convolution network for ultrasound image segmentation.

Med Phys. 2020 Dec;47(12):6270-6285. doi: 10.1002/mp.14512. Epub 2020 Oct 18.

Deep Neural Network-Based Semantic Segmentation of Microvascular Decompression Images.

Sensors (Basel). 2021 Feb 7;21(4):1167. doi: 10.3390/s21041167.

Multi-Scale Deep Neural Network Based on Dilated Convolution for Spacecraft Image Segmentation.

Sensors (Basel). 2022 Jun 1;22(11):4222. doi: 10.3390/s22114222.

D-SAT: dual semantic aggregation transformer with dual attention for medical image segmentation.

Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/acf2e5.

Fusion network based on the dual attention mechanism and atrous spatial pyramid pooling for automatic segmentation in retinal vessel images.

J Opt Soc Am A Opt Image Sci Vis. 2022 Aug 1;39(8):1393-1402. doi: 10.1364/JOSAA.459912.

TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.

Comput Biol Med. 2023 Dec;167:107583. doi: 10.1016/j.compbiomed.2023.107583. Epub 2023 Oct 21.

SC-Net: Symmetrical conical network for colorectal pathology image segmentation.

Comput Methods Programs Biomed. 2024 May;248:108119. doi: 10.1016/j.cmpb.2024.108119. Epub 2024 Mar 13.

A multibranch and multiscale neural network based on semantic perception for multimodal medical image fusion.

Sci Rep. 2024 Jul 30;14(1):17609. doi: 10.1038/s41598-024-68183-3.

Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation.

Neural Netw. 2024 Oct;178:106546. doi: 10.1016/j.neunet.2024.106546. Epub 2024 Jul 17.

引用本文的文献

Underwater Fish Segmentation Algorithm Based on Improved PSPNet Network.

Sensors (Basel). 2023 Sep 25;23(19):8072. doi: 10.3390/s23198072.

本文引用的文献

An Underwater Image Enhancement Benchmark Dataset and Beyond.

IEEE Trans Image Process. 2019 Nov 28. doi: 10.1109/TIP.2019.2955241.

Color Balance and Fusion for Underwater Image Enhancement.

IEEE Trans Image Process. 2018 Jan;27(1):379-393. doi: 10.1109/TIP.2017.2759252. Epub 2017 Oct 5.

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.

Guided image filtering.

IEEE Trans Pattern Anal Mach Intell. 2013 Jun;35(6):1397-409. doi: 10.1109/TPAMI.2012.213.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于编解码器架构的水下图像语义分割方法。

Semantic segmentation method of underwater images based on encoder-decoder architecture.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献