Department of Computer Engineering, Inha University, Incheon. 082, Republic of Korea.
Comput Intell Neurosci. 2019 Aug 1;2019:8527819. doi: 10.1155/2019/8527819. eCollection 2019.
With the widespread use of deep learning methods, semantic segmentation has achieved great improvements in recent years. However, many researchers have pointed out that with multiple uses of convolution and pooling operations, great information loss would occur in the extraction processes. To solve this problem, various operations or network architectures have been suggested to make up for the loss of information. We observed a trend in many studies to design a network as a symmetric type, with both parts representing the "encoding" and "decoding" stages. By "upsampling" operations in the "decoding" stage, feature maps are constructed in a certain way that would more or less make up for the losses in previous layers. In this paper, we focus on upsampling operations, make a detailed analysis, and compare current methods used in several famous neural networks. We also combine the knowledge on image restoration and design a new upsampled layer (or operation) named the TGV upsampling algorithm. We successfully replaced upsampling layers in the previous research with our new method. We found that our model can better preserve detailed textures and edges of feature maps and can, on average, achieve 1.4-2.3% improved accuracy compared to the original models.
随着深度学习方法的广泛应用,语义分割在近年来取得了巨大的进展。然而,许多研究人员指出,在卷积和池化操作的多次使用过程中,信息会大量丢失。为了解决这个问题,人们提出了各种操作或网络架构来弥补信息的损失。我们在许多研究中观察到一种趋势,即将网络设计为对称类型,其中两部分分别代表“编码”和“解码”阶段。通过“解码”阶段的“上采样”操作,以某种方式构建特征图,在一定程度上可以弥补前几层的损失。在本文中,我们专注于上采样操作,进行了详细的分析,并比较了几个著名神经网络中当前使用的方法。我们还结合图像恢复的知识,设计了一种新的上采样层(或操作),称为 TGV 上采样算法。我们成功地用新方法替换了之前研究中的上采样层。我们发现,我们的模型可以更好地保留特征图的细节纹理和边缘,与原始模型相比,平均可以提高 1.4-2.3%的精度。