Graduate School of Information Science and Technology, Hokkaido University, N-14, W-9, Kita-ku, Sapporo 060-0814, Japan.
Faculty of Information Science and Technology, Hokkaido University, N-14, W-9, Kita-ku, Sapporo 060-0814, Japan.
Sensors (Basel). 2022 Mar 17;22(6):2330. doi: 10.3390/s22062330.
In this paper, we present a novel defect detection model based on an improved U-Net architecture. As a semantic segmentation task, the defect detection task has the problems of background-foreground imbalance, multi-scale targets, and feature similarity between the background and defects in the real-world data. Conventionally, general convolutional neural network (CNN)-based networks mainly focus on natural image tasks, which are insensitive to the problems in our task. The proposed method has a network design for multi-scale segmentation based on the U-Net architecture including an atrous spatial pyramid pooling (ASPP) module and an inception module, and can detect various types of defects compared to conventional simple CNN-based methods. Through the experiments using a real-world subway tunnel image dataset, the proposed method showed a better performance than that of general semantic segmentation including state-of-the-art methods. Additionally, we showed that our method can achieve excellent detection balance among multi-scale defects.
在本文中,我们提出了一种基于改进的 U-Net 架构的新型缺陷检测模型。作为语义分割任务,缺陷检测任务具有背景-前景不平衡、多尺度目标以及真实数据中背景和缺陷之间特征相似性等问题。传统上,基于通用卷积神经网络(CNN)的网络主要侧重于自然图像任务,而对我们任务中的问题不敏感。所提出的方法具有基于 U-Net 架构的多尺度分割网络设计,包括空洞空间金字塔池化(ASPP)模块和 inception 模块,与传统的简单基于 CNN 的方法相比,可以检测各种类型的缺陷。通过使用真实的地铁隧道图像数据集进行的实验,所提出的方法比包括最先进方法在内的一般语义分割方法表现出更好的性能。此外,我们表明我们的方法可以在多尺度缺陷之间实现出色的检测平衡。