基于全局引导的选择性上下文网络的场景解析。

Global-Guided Selective Context Network for Scene Parsing.

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1752-1764. doi: 10.1109/TNNLS.2020.3043808. Epub 2022 Apr 4.

DOI:10.1109/TNNLS.2020.3043808

Abstract

Recent studies on semantic segmentation are exploiting contextual information to address the problem of inconsistent parsing prediction in big objects and ignorance in small objects. However, they utilize multilevel contextual information equally across pixels, overlooking those different pixels may demand different levels of context. Motivated by the above-mentioned intuition, we propose a novel global-guided selective context network (GSCNet) to adaptively select contextual information for improving scene parsing. Specifically, we introduce two global-guided modules, called global-guided global module (GGM) and global-guided local module (GLM), to, respectively, select global context (GC) and local context (LC) for pixels. When given an input feature map, GGM jointly employs the input feature map and its globally pooled feature to learn its global contextual demand based on which per-pixel GC is selected. While GLM adopts low-level feature from the adjacent stage as LC and synthetically models the input feature map, its globally pooled feature and LC to generate local contextual demand, based on which per-pixel LC is selected. Furthermore, we combine these two modules as a selective context block and import such SCBs in different levels of the network to propagate contextual information in a coarse-to-fine manner. Finally, we conduct extensive experiments to verify the effectiveness of our proposed model and achieve state-of-the-art performance on four challenging scene parsing data sets, i.e., Cityscapes, ADE20K, PASCAL Context, and COCO Stuff. Especially, GSCNet-101 obtains 82.6% on Cityscapes test set without using coarse data and 56.22% on ADE20K test set.

摘要

最近的语义分割研究利用上下文信息来解决大物体解析预测不一致和小物体被忽略的问题。然而，它们在像素级别上平等地利用多层次的上下文信息，而忽略了不同的像素可能需要不同级别的上下文。受上述直觉的启发，我们提出了一种新颖的全局引导选择性上下文网络（GSCNet），以自适应地选择上下文信息，从而提高场景解析。具体来说，我们引入了两个全局引导模块，称为全局引导全局模块（GGM）和全局引导局部模块（GLM），分别用于为像素选择全局上下文（GC）和局部上下文（LC）。当给定一个输入特征图时，GGM 联合使用输入特征图及其全局池化特征，根据该特征图学习其全局上下文需求，从而选择每个像素的 GC。而 GLM 采用来自相邻阶段的低层次特征作为 LC，并综合建模输入特征图，其全局池化特征和 LC 用于生成局部上下文需求，从而选择每个像素的 LC。此外，我们将这两个模块组合成一个选择性上下文块，并在网络的不同级别中引入这些 SCB，以粗到细的方式传播上下文信息。最后，我们进行了广泛的实验来验证我们提出的模型的有效性，并在四个具有挑战性的场景解析数据集上取得了最先进的性能，即 Cityscapes、ADE20K、PASCAL Context 和 COCO Stuff。特别是，GSCNet-101 在不使用粗数据的情况下在 Cityscapes 测试集上获得了 82.6%的准确率，在 ADE20K 测试集上获得了 56.22%的准确率。

相似文献

Global-Guided Selective Context Network for Scene Parsing.基于全局引导的选择性上下文网络的场景解析。

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1752-1764. doi: 10.1109/TNNLS.2020.3043808. Epub 2022 Apr 4.

Global Aggregation Then Local Distribution for Scene Parsing.用于场景解析的全局聚合然后局部分布

IEEE Trans Image Process. 2021;30:6829-6842. doi: 10.1109/TIP.2021.3099366.

Scene Segmentation With Dual Relation-Aware Attention Network.基于双重关系感知注意力网络的场景分割。

IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2547-2560. doi: 10.1109/TNNLS.2020.3006524. Epub 2021 Jun 2.

Robust Scene Parsing by Mining Supportive Knowledge From Dataset.通过从数据集中挖掘支持性知识进行稳健的场景解析

IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2633-2646. doi: 10.1109/TNNLS.2021.3107194. Epub 2023 May 2.

CTNet: Context-Based Tandem Network for Semantic Segmentation.CTNet：用于语义分割的基于上下文的串联网络

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9904-9917. doi: 10.1109/TPAMI.2021.3132068. Epub 2022 Nov 7.

AlignSeg: Feature-Aligned Segmentation Networks.AlignSeg：特征对齐分割网络。

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):550-557. doi: 10.1109/TPAMI.2021.3062772. Epub 2021 Dec 7.

Global-and-Local Context Network for Semantic Segmentation of Street View Images.用于街景图像语义分割的全局-局部上下文网络。

Sensors (Basel). 2020 May 21;20(10):2907. doi: 10.3390/s20102907.

CCNet: Criss-Cross Attention for Semantic Segmentation.CCNet：用于语义分割的交叉注意力。

IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6896-6908. doi: 10.1109/TPAMI.2020.3007032. Epub 2023 May 5.

Perspective-Adaptive Convolutions for Scene Parsing.用于场景解析的视角自适应卷积

IEEE Trans Pattern Anal Mach Intell. 2020 Apr;42(4):909-924. doi: 10.1109/TPAMI.2018.2890637. Epub 2019 Jan 1.

An Efficient Sampling-Based Attention Network for Semantic Segmentation.一种用于语义分割的基于高效采样的注意力网络。

IEEE Trans Image Process. 2022;31:2850-2863. doi: 10.1109/TIP.2022.3162101. Epub 2022 Apr 5.

基于全局引导的选择性上下文网络的场景解析。

Global-Guided Selective Context Network for Scene Parsing.

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1752-1764. doi: 10.1109/TNNLS.2020.3043808. Epub 2022 Apr 4.

DOI:10.1109/TNNLS.2020.3043808

PMID:33378265

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于全局引导的选择性上下文网络的场景解析。

Global-Guided Selective Context Network for Scene Parsing.

出版信息

相似文献

基于全局引导的选择性上下文网络的场景解析。

Global-Guided Selective Context Network for Scene Parsing.

出版信息

相似文献