Yu Lintao, Yao Anni, Duan Jin
College of Electronic Information Engineering, Changchun University of Science and Technology, Changchun 130022, China.
Entropy (Basel). 2023 Jun 2;25(6):891. doi: 10.3390/e25060891.
In this paper, we propose a method that uses the idea of decoupling and unites edge information for semantic segmentation. We build a new dual-stream CNN architecture that fully considers the interaction between the body and the edge of the object, and our method significantly improves the segmentation performance of small objects and object boundaries. The dual-stream CNN architecture mainly consists of a body-stream module and an edge-stream module, which process the feature map of the segmented object into two parts with low coupling: body features and edge features. The body stream warps the image features by learning the flow-field offset, warps the body pixels toward object inner parts, completes the generation of the body features, and enhances the object's inner consistency. In the generation of edge features, the current state-of-the-art model processes information such as color, shape, and texture under a single network, which will ignore the recognition of important information. Our method separates the edge-processing branch in the network, i.e., the edge stream. The edge stream processes information in parallel with the body stream and effectively eliminates the noise of useless information by introducing a non-edge suppression layer to emphasize the importance of edge information. We validate our method on the large-scale public dataset Cityscapes, and our method greatly improves the segmentation performance of hard-to-segment objects and achieves state-of-the-art result. Notably, the method in this paper can achieve 82.6% mIoU on the Cityscapes with only fine-annotated data.
在本文中,我们提出了一种利用解耦思想并结合边缘信息进行语义分割的方法。我们构建了一种新的双流卷积神经网络(CNN)架构,该架构充分考虑了物体主体与边缘之间的相互作用,并且我们的方法显著提高了小物体和物体边界的分割性能。双流CNN架构主要由主体流模块和边缘流模块组成,它们将分割物体的特征图处理为耦合度低的两部分:主体特征和边缘特征。主体流通过学习流场偏移来扭曲图像特征,将主体像素向物体内部扭曲,完成主体特征的生成,并增强物体的内部一致性。在边缘特征生成方面,当前最先进的模型在单个网络下处理颜色、形状和纹理等信息,这会忽略对重要信息的识别。我们的方法在网络中分离出边缘处理分支,即边缘流。边缘流与主体流并行处理信息,并通过引入非边缘抑制层来强调边缘信息的重要性,从而有效消除无用信息的噪声。我们在大规模公共数据集Cityscapes上验证了我们的方法,并且我们的方法极大地提高了难分割物体的分割性能,并取得了当前最优的结果。值得注意的是,本文的方法仅使用精细标注的数据就能在Cityscapes上达到82.6%的平均交并比(mIoU)。