Li Gongyang, Liu Zhi, Zeng Dan, Lin Weisi, Ling Haibin
IEEE Trans Cybern. 2023 Jan;53(1):526-538. doi: 10.1109/TCYB.2022.3162945. Epub 2022 Dec 23.
Salient object detection (SOD) in optical remote sensing images (RSIs), or RSI-SOD, is an emerging topic in understanding optical RSIs. However, due to the difference between optical RSIs and natural scene images (NSIs), directly applying NSI-SOD methods to optical RSIs fails to achieve satisfactory results. In this article, we propose a novel adjacent context coordination network (ACCoNet) to explore the coordination of adjacent features in an encoder-decoder architecture for RSI-SOD. Specifically, ACCoNet consists of three parts: 1) an encoder; 2) adjacent context coordination modules (ACCoMs); and 3) a decoder. As the key component of ACCoNet, ACCoM activates the salient regions of output features of the encoder and transmits them to the decoder. ACCoM contains a local branch and two adjacent branches to coordinate the multilevel features simultaneously. The local branch highlights the salient regions in an adaptive way, while the adjacent branches introduce global information of adjacent levels to enhance salient regions. In addition, to extend the capabilities of the classic decoder block (i.e., several cascaded convolutional layers), we extend it with two bifurcations and propose a bifurcation-aggregation block (BAB) to capture the contextual information in the decoder. Extensive experiments on two benchmark datasets demonstrate that the proposed ACCoNet outperforms 22 state-of-the-art methods under nine evaluation metrics, and runs up to 81 fps on a single NVIDIA Titan X GPU. The code and results of our method are available at https://github.com/MathLee/ACCoNet.
光学遥感图像(RSI)中的显著目标检测(SOD),即RSI - SOD,是理解光学遥感图像方面的一个新兴课题。然而,由于光学遥感图像与自然场景图像(NSI)之间存在差异,直接将NSI - SOD方法应用于光学遥感图像无法取得令人满意的结果。在本文中,我们提出了一种新颖的相邻上下文协调网络(ACCoNet),用于在编码器 - 解码器架构中探索相邻特征的协调,以实现RSI - SOD。具体而言,ACCoNet由三部分组成:1)一个编码器;2)相邻上下文协调模块(ACCoM);3)一个解码器。作为ACCoNet的关键组件,ACCoM激活编码器输出特征的显著区域并将其传输到解码器。ACCoM包含一个局部分支和两个相邻分支,以同时协调多级特征。局部分支以自适应方式突出显著区域,而相邻分支引入相邻层级的全局信息以增强显著区域。此外,为了扩展经典解码器模块(即几个级联卷积层)的能力,我们用两个分支对其进行扩展,并提出了一个分支聚合模块(BAB)来捕获解码器中的上下文信息。在两个基准数据集上进行的大量实验表明,所提出的ACCoNet在九个评估指标下优于22种先进方法,并且在单个NVIDIA Titan X GPU上运行速度高达81帧每秒。我们方法的代码和结果可在https://github.com/MathLee/ACCoNet获取。