• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于双重关系感知注意力网络的场景分割。

Scene Segmentation With Dual Relation-Aware Attention Network.

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2547-2560. doi: 10.1109/TNNLS.2020.3006524. Epub 2021 Jun 2.

DOI:10.1109/TNNLS.2020.3006524
PMID:32745005
Abstract

In this article, we propose a Dual Relation-aware Attention Network (DRANet) to handle the task of scene segmentation. How to efficiently exploit context is essential for pixel-level recognition. To address the issue, we adaptively capture contextual information based on the relation-aware attention mechanism. Especially, we append two types of attention modules on the top of the dilated fully convolutional network (FCN), which model the contextual dependencies in spatial and channel dimensions, respectively. In the attention modules, we adopt a self-attention mechanism to model semantic associations between any two pixels or channels. Each pixel or channel can adaptively aggregate context from all pixels or channels according to their correlations. To reduce the high cost of computation and memory caused by the abovementioned pairwise association computation, we further design two types of compact attention modules. In the compact attention modules, each pixel or channel is built into association only with a few numbers of gathering centers and obtains corresponding context aggregation over these gathering centers. Meanwhile, we add a cross-level gating decoder to selectively enhance spatial details that boost the performance of the network. We conduct extensive experiments to validate the effectiveness of our network and achieve new state-of-the-art segmentation performance on four challenging scene segmentation data sets, i.e., Cityscapes, ADE20K, PASCAL Context, and COCO Stuff data sets. In particular, a Mean IoU score of 82.9% on the Cityscapes test set is achieved without using extra coarse annotated data.

摘要

本文提出了一种双重关系感知注意力网络(DRANet)来处理场景分割任务。如何有效地利用上下文信息对于像素级别的识别至关重要。为了解决这个问题,我们基于关系感知注意力机制自适应地捕获上下文信息。特别是,我们在扩张全卷积网络(FCN)的顶部添加了两种类型的注意力模块,分别在空间和通道维度上建模上下文依赖关系。在注意力模块中,我们采用自注意力机制来建模任意两个像素或通道之间的语义关联。每个像素或通道可以根据其相关性自适应地从所有像素或通道中聚合上下文信息。为了降低上述两两关联计算带来的高计算和内存成本,我们进一步设计了两种类型的紧凑注意力模块。在紧凑注意力模块中,每个像素或通道仅与少数几个汇聚中心建立关联,并通过这些汇聚中心获得相应的上下文聚合。同时,我们添加了一个跨层门控解码器,以选择性地增强空间细节,从而提高网络性能。我们进行了广泛的实验来验证我们的网络的有效性,并在四个具有挑战性的场景分割数据集(即 Cityscapes、ADE20K、PASCAL Context 和 COCO Stuff 数据集)上实现了新的最先进的分割性能。特别是,在不使用额外的粗标注数据的情况下,在 Cityscapes 测试集上达到了 82.9%的平均 IoU 得分。

相似文献

1
Scene Segmentation With Dual Relation-Aware Attention Network.基于双重关系感知注意力网络的场景分割。
IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2547-2560. doi: 10.1109/TNNLS.2020.3006524. Epub 2021 Jun 2.
2
Global-Guided Selective Context Network for Scene Parsing.基于全局引导的选择性上下文网络的场景解析。
IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1752-1764. doi: 10.1109/TNNLS.2020.3043808. Epub 2022 Apr 4.
3
Global Aggregation Then Local Distribution for Scene Parsing.用于场景解析的全局聚合然后局部分布
IEEE Trans Image Process. 2021;30:6829-6842. doi: 10.1109/TIP.2021.3099366.
4
Scene Segmentation with DAG-Recurrent Neural Networks.基于有向无环图递归神经网络的场景分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1480-1493. doi: 10.1109/TPAMI.2017.2712691. Epub 2017 Jun 6.
5
CTNet: Context-Based Tandem Network for Semantic Segmentation.CTNet:用于语义分割的基于上下文的串联网络
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9904-9917. doi: 10.1109/TPAMI.2021.3132068. Epub 2022 Nov 7.
6
Cross-Image Pixel Contrasting for Semantic Segmentation.用于语义分割的跨图像像素对比
IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5398-5412. doi: 10.1109/TPAMI.2024.3367952. Epub 2024 Jul 2.
7
An Efficient Sampling-Based Attention Network for Semantic Segmentation.一种用于语义分割的基于高效采样的注意力网络。
IEEE Trans Image Process. 2022;31:2850-2863. doi: 10.1109/TIP.2022.3162101. Epub 2022 Apr 5.
8
CCNet: Criss-Cross Attention for Semantic Segmentation.CCNet:用于语义分割的交叉注意力。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6896-6908. doi: 10.1109/TPAMI.2020.3007032. Epub 2023 May 5.
9
Multiple-Attention Mechanism Network for Semantic Segmentation.多注意力机制网络的语义分割。
Sensors (Basel). 2022 Jun 13;22(12):4477. doi: 10.3390/s22124477.
10
Double Similarity Distillation for Semantic Image Segmentation.用于语义图像分割的双相似性蒸馏
IEEE Trans Image Process. 2021;30:5363-5376. doi: 10.1109/TIP.2021.3083113. Epub 2021 Jun 3.

引用本文的文献

1
Fault diagnosis methods for imbalanced samples of hydraulic pumps based on DA-DCGAN.基于深度自编码器对抗生成网络的液压泵不平衡样本故障诊断方法
Sci Rep. 2025 Jul 1;15(1):21216. doi: 10.1038/s41598-025-04909-1.
2
CTH-Net: A CNN and Transformer hybrid network for skin lesion segmentation.CTH-Net:一种用于皮肤病变分割的卷积神经网络与Transformer混合网络。
iScience. 2024 Mar 6;27(4):109442. doi: 10.1016/j.isci.2024.109442. eCollection 2024 Apr 19.
3
A Heart Image Segmentation Method Based on Position Attention Mechanism and Inverted Pyramid.
基于位置注意力机制和倒金字塔的心脏图像分割方法。
Sensors (Basel). 2023 Nov 23;23(23):9366. doi: 10.3390/s23239366.
4
Boosting Semantic Segmentation by Conditioning the Backbone with Semantic Boundaries.通过语义边界调整主干网络来增强语义分割
Sensors (Basel). 2023 Aug 6;23(15):6980. doi: 10.3390/s23156980.
5
W-Net: Convolutional neural network for segmenting remote sensing images by dual path semantics.W-Net:基于双通道语义的遥感图像分割卷积神经网络。
PLoS One. 2023 Jul 27;18(7):e0288311. doi: 10.1371/journal.pone.0288311. eCollection 2023.
6
AEAU-Net: an unsupervised end-to-end registration network by combining affine transformation and deformable medical image registration.AEAU-Net:一种通过组合仿射变换和可变形医学图像配准的无监督端到端配准网络。
Med Biol Eng Comput. 2023 Nov;61(11):2859-2873. doi: 10.1007/s11517-023-02887-y. Epub 2023 Jul 27.
7
Automatic Detection of Secundum Atrial Septal Defect in Children Based on Color Doppler Echocardiographic Images Using Convolutional Neural Networks.基于卷积神经网络利用彩色多普勒超声心动图图像自动检测儿童继发孔型房间隔缺损
Front Cardiovasc Med. 2022 Apr 6;9:834285. doi: 10.3389/fcvm.2022.834285. eCollection 2022.
8
RDCTrans U-Net: A Hybrid Variable Architecture for Liver CT Image Segmentation.RDCTrans U-Net:一种用于肝脏 CT 图像分割的混合可变架构。
Sensors (Basel). 2022 Mar 23;22(7):2452. doi: 10.3390/s22072452.
9
A Cascade Attention Based Facial Expression Recognition Network by Fusing Multi-Scale Spatio-Temporal Features.基于级联注意力的融合多尺度时空特征的面部表情识别网络。
Sensors (Basel). 2022 Feb 10;22(4):1350. doi: 10.3390/s22041350.
10
An improved Deeplabv3+ semantic segmentation algorithm with multiple loss constraints.一种具有多重损失约束的改进型 Deeplabv3+ 语义分割算法。
PLoS One. 2022 Jan 19;17(1):e0261582. doi: 10.1371/journal.pone.0261582. eCollection 2022.