Department of Mechanical Engineering, College of Field Engineering and Army Engineering University, PLA, Nanjing, China.
PLoS One. 2022 Aug 25;17(8):e0272666. doi: 10.1371/journal.pone.0272666. eCollection 2022.
With the exploration and development of marine resources, deep learning is more and more widely used in underwater image processing. However, the quality of the original underwater images is so low that traditional semantic segmentation methods obtain poor segmentation results, such as blurred target edges, insufficient segmentation accuracy, and poor regional boundary segmentation effects. To solve these problems, this paper proposes a semantic segmentation method for underwater images. Firstly, the image enhancement based on multi-spatial transformation is performed to improve the quality of the original images, which is not common in other advanced semantic segmentation methods. Then, the densely connected hybrid atrous convolution effectively expands the receptive field and slows down the speed of resolution reduction. Next, the cascaded atrous convolutional spatial pyramid pooling module integrates boundary features of different scales to enrich target details. Finally, the context information aggregation decoder fuses the features of the shallow network and the deep network to extract rich contextual information, which greatly reduces information loss. The proposed method was evaluated on RUIE, HabCam UID, and UIEBD. Compared with the state-of-the-art semantic segmentation algorithms, the proposed method has advantages in segmentation integrity, location accuracy, boundary clarity, and detail in subjective perception. On the objective data, the proposed method achieves the highest MIOU of 68.3 and OA of 79.4, and it has a low resource consumption. Besides, the ablation experiment also verifies the effectiveness of our method.
随着海洋资源的开发和利用,深度学习在水下图像处理中得到了越来越广泛的应用。然而,原始水下图像的质量很低,传统的语义分割方法得到的分割结果较差,例如目标边缘模糊、分割精度不足、区域边界分割效果差等。为了解决这些问题,本文提出了一种水下图像语义分割方法。首先,对图像进行基于多空间变换的增强处理,以提高原始图像的质量,这在其他先进的语义分割方法中并不常见。然后,密集连接混合空洞卷积有效地扩大了感受野,减缓了分辨率降低的速度。接下来,级联空洞卷积空间金字塔池化模块集成了不同尺度的边界特征,以丰富目标细节。最后,上下文信息聚合解码器融合浅层网络和深层网络的特征,提取丰富的上下文信息,从而大大减少了信息的丢失。该方法在 RUIE、HabCam UID 和 UIEBD 数据集上进行了评估。与现有的语义分割算法相比,该方法在分割完整性、位置准确性、边界清晰度和细节主观感知方面具有优势。在客观数据上,该方法的 MIOU 最高可达 68.3,OA 最高可达 79.4,资源消耗较低。此外,消融实验也验证了该方法的有效性。