College of Oceanography and Space Informatics, China University of Petroleum (East China), Qingdao 266580, China.
Sensors (Basel). 2023 Jun 3;23(11):5323. doi: 10.3390/s23115323.
The demand for semantic segmentation of ultra-high-resolution remote sensing images is becoming increasingly stronger in various fields, posing a great challenge with concern to the accuracy requirement. Most of the existing methods process ultra-high-resolution images using downsampling or cropping, but using this approach could result in a decline in the accuracy of segmenting data, as it may cause the omission of local details or global contextual information. Some scholars have proposed the two-branch structure, but the noise introduced by the global image will interfere with the result of semantic segmentation and reduce the segmentation accuracy. Therefore, we propose a model that can achieve ultra-high-precision semantic segmentation. The model consists of a local branch, a surrounding branch, and a global branch. To achieve high precision, the model is designed with a two-level fusion mechanism. The high-resolution fine structures are captured through the local and surrounding branches in the low-level fusion process, and the global contextual information is captured from downsampled inputs in the high-level fusion process. We conducted extensive experiments and analyses using the Potsdam and Vaihingen datasets of the ISPRS. The results show that our model has extremely high precision.
对超高分辨率遥感图像进行语义分割的需求在各个领域越来越强烈,这对准确性要求提出了巨大挑战。大多数现有的方法使用下采样或裁剪来处理超高分辨率图像,但这种方法可能会导致数据分割的准确性下降,因为它可能会导致局部细节或全局上下文信息的丢失。一些学者提出了两分支结构,但全局图像引入的噪声会干扰语义分割的结果,降低分割精度。因此,我们提出了一种可以实现超高精度语义分割的模型。该模型由局部分支、环绕分支和全局分支组成。为了实现高精度,模型采用了两级融合机制。通过低水平融合过程中的局部和环绕分支捕获高分辨率的精细结构,并通过高水平融合过程中的下采样输入捕获全局上下文信息。我们使用 ISPRS 的波茨坦和瓦兴根数据集进行了广泛的实验和分析。结果表明,我们的模型具有极高的精度。