• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AlignSeg:特征对齐分割网络。

AlignSeg: Feature-Aligned Segmentation Networks.

作者信息

Huang Zilong, Wei Yunchao, Wang Xinggang, Liu Wenyu, Huang Thomas S, Shi Humphrey

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):550-557. doi: 10.1109/TPAMI.2021.3062772. Epub 2021 Dec 7.

DOI:10.1109/TPAMI.2021.3062772
PMID:33646946
Abstract

Aggregating features in terms of different convolutional blocks or contextual embeddings has been proven to be an effective way to strengthen feature representations for semantic segmentation. However, most of the current popular network architectures tend to ignore the misalignment issues during the feature aggregation process caused by step-by-step downsampling operations and indiscriminate contextual information fusion. In this paper, we explore the principles in addressing such feature misalignment issues and inventively propose Feature-Aligned Segmentation Networks (AlignSeg). AlignSeg consists of two primary modules, i.e., the Aligned Feature Aggregation (AlignFA) module and the Aligned Context Modeling (AlignCM) module. First, AlignFA adopts a simple learnable interpolation strategy to learn transformation offsets of pixels, which can effectively relieve the feature misalignment issue caused by multi-resolution feature aggregation. Second, with the contextual embeddings in hand, AlignCM enables each pixel to choose private custom contextual information adaptively, making the contextual embeddings be better aligned. We validate the effectiveness of our AlignSeg network with extensive experiments on Cityscapes and ADE20K, achieving new state-of-the-art mIoU scores of 82.6 and 45.95 percent, respectively. Our source code is available at https://github.com/speedinghzl/AlignSeg.

摘要

根据不同的卷积块或上下文嵌入聚合特征已被证明是一种有效的方法,可以加强语义分割的特征表示。然而,当前大多数流行的网络架构往往忽略了在特征聚合过程中由于逐步下采样操作和不加区分的上下文信息融合而导致的特征错位问题。在本文中,我们探索了解决此类特征错位问题的原理,并创造性地提出了特征对齐分割网络(AlignSeg)。AlignSeg由两个主要模块组成,即对齐特征聚合(AlignFA)模块和对齐上下文建模(AlignCM)模块。首先,AlignFA采用一种简单的可学习插值策略来学习像素的变换偏移,这可以有效地缓解多分辨率特征聚合导致的特征错位问题。其次,借助上下文嵌入,AlignCM使每个像素能够自适应地选择私有定制上下文信息,从而使上下文嵌入更好地对齐。我们通过在Cityscapes和ADE20K上进行的大量实验验证了我们的AlignSeg网络的有效性,分别达到了82.6%和45.95%的新的最优平均交并比分数。我们的源代码可在https://github.com/speedinghzl/AlignSeg获取。

相似文献

1
AlignSeg: Feature-Aligned Segmentation Networks.AlignSeg:特征对齐分割网络。
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):550-557. doi: 10.1109/TPAMI.2021.3062772. Epub 2021 Dec 7.
2
CTNet: Context-Based Tandem Network for Semantic Segmentation.CTNet:用于语义分割的基于上下文的串联网络
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9904-9917. doi: 10.1109/TPAMI.2021.3132068. Epub 2022 Nov 7.
3
Context and Spatial Feature Calibration for Real-Time Semantic Segmentation.用于实时语义分割的上下文和空间特征校准
IEEE Trans Image Process. 2023;32:5465-5477. doi: 10.1109/TIP.2023.3318967. Epub 2023 Oct 25.
4
CCNet: Criss-Cross Attention for Semantic Segmentation.CCNet:用于语义分割的交叉注意力。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6896-6908. doi: 10.1109/TPAMI.2020.3007032. Epub 2023 May 5.
5
Global-Guided Selective Context Network for Scene Parsing.基于全局引导的选择性上下文网络的场景解析。
IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1752-1764. doi: 10.1109/TNNLS.2020.3043808. Epub 2022 Apr 4.
6
Global Aggregation Then Local Distribution for Scene Parsing.用于场景解析的全局聚合然后局部分布
IEEE Trans Image Process. 2021;30:6829-6842. doi: 10.1109/TIP.2021.3099366.
7
BASeg: Boundary aware semantic segmentation for autonomous driving.BASeg:用于自动驾驶的边界感知语义分割。
Neural Netw. 2023 Jan;157:460-470. doi: 10.1016/j.neunet.2022.10.034. Epub 2022 Nov 9.
8
Scene Segmentation With Dual Relation-Aware Attention Network.基于双重关系感知注意力网络的场景分割。
IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2547-2560. doi: 10.1109/TNNLS.2020.3006524. Epub 2021 Jun 2.
9
Multiple-Attention Mechanism Network for Semantic Segmentation.多注意力机制网络的语义分割。
Sensors (Basel). 2022 Jun 13;22(12):4477. doi: 10.3390/s22124477.
10
Denoised Non-Local Neural Network for Semantic Segmentation.
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):7162-7174. doi: 10.1109/TNNLS.2022.3214216. Epub 2024 May 2.

引用本文的文献

1
Dual-stream hybrid architecture with adaptive multi-scale boundary-aware mechanisms for robust urban change detection in smart cities.具有自适应多尺度边界感知机制的双流混合架构用于智慧城市中的稳健城市变化检测
Sci Rep. 2025 Aug 21;15(1):30729. doi: 10.1038/s41598-025-16148-5.
2
UAV-DETR: An Enhanced RT-DETR Architecture for Efficient Small Object Detection in UAV Imagery.无人机检测Transformer(UAV-DETR):一种用于无人机图像中高效小目标检测的增强型实时检测Transformer(RT-DETR)架构。
Sensors (Basel). 2025 Jul 24;25(15):4582. doi: 10.3390/s25154582.
3
Preserving spatial and quantitative information in unpaired biomedical image-to-image translation.
在非配对生物医学图像到图像转换中保留空间和定量信息。
Cell Rep Methods. 2025 Jun 16;5(6):101074. doi: 10.1016/j.crmeth.2025.101074. Epub 2025 Jun 9.
4
HDB-Net: hierarchical dual-branch network for retinal layer segmentation in diseased OCT images.HDB-Net:用于病变光学相干断层扫描(OCT)图像视网膜层分割的分层双分支网络。
Biomed Opt Express. 2024 Aug 19;15(9):5359-5383. doi: 10.1364/BOE.530469. eCollection 2024 Sep 1.
5
Shared-Weight-Based Multi-Dimensional Feature Alignment Network for Oriented Object Detection in Remote Sensing Imagery.基于共享权值的多维特征对齐网络在遥感图像目标检测中的应用。
Sensors (Basel). 2022 Dec 25;23(1):207. doi: 10.3390/s23010207.
6
Blind Deblurring of Remote-Sensing Single Images Based on Feature Alignment.基于特征对齐的遥感单图像盲去模糊。
Sensors (Basel). 2022 Oct 17;22(20):7894. doi: 10.3390/s22207894.
7
Local Label Point Correction for Edge Detection of Overlapping Cervical Cells.用于重叠宫颈细胞边缘检测的局部标记点校正
Front Neuroinform. 2022 May 12;16:895290. doi: 10.3389/fninf.2022.895290. eCollection 2022.