• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于实时语义分割的上下文和空间特征校准

Context and Spatial Feature Calibration for Real-Time Semantic Segmentation.

作者信息

Li Kaige, Geng Qichuan, Wan Maoxian, Cao Xiaochun, Zhou Zhong

出版信息

IEEE Trans Image Process. 2023;32:5465-5477. doi: 10.1109/TIP.2023.3318967. Epub 2023 Oct 25.

DOI:10.1109/TIP.2023.3318967
PMID:37773909
Abstract

Context modeling or multi-level feature fusion methods have been proved to be effective in improving semantic segmentation performance. However, they are not specialized to deal with the problems of pixel-context mismatch and spatial feature misalignment, and the high computational complexity hinders their widespread application in real-time scenarios. In this work, we propose a lightweight Context and Spatial Feature Calibration Network (CSFCN) to address the above issues with pooling-based and sampling-based attention mechanisms. CSFCN contains two core modules: Context Feature Calibration (CFC) module and Spatial Feature Calibration (SFC) module. CFC adopts a cascaded pyramid pooling module to efficiently capture nested contexts, and then aggregates private contexts for each pixel based on pixel-context similarity to realize context feature calibration. SFC splits features into multiple groups of sub-features along the channel dimension and propagates sub-features therein by the learnable sampling to achieve spatial feature calibration. Extensive experiments on the Cityscapes and CamVid datasets illustrate that our method achieves a state-of-the-art trade-off between speed and accuracy. Concretely, our method achieves 78.7% mIoU with 70.0 FPS and 77.8% mIoU with 179.2 FPS on the Cityscapes and CamVid test sets, respectively. The code is available at https://nave.vr3i.com/ and https://github.com/kaigelee/CSFCN.

摘要

上下文建模或多级特征融合方法已被证明在提高语义分割性能方面是有效的。然而,它们并非专门用于处理像素上下文不匹配和空间特征未对齐的问题,并且高计算复杂度阻碍了它们在实时场景中的广泛应用。在这项工作中,我们提出了一种轻量级的上下文和空间特征校准网络(CSFCN),以通过基于池化和基于采样的注意力机制来解决上述问题。CSFCN包含两个核心模块:上下文特征校准(CFC)模块和空间特征校准(SFC)模块。CFC采用级联金字塔池化模块来有效地捕获嵌套上下文,然后基于像素上下文相似度为每个像素聚合私有上下文,以实现上下文特征校准。SFC沿着通道维度将特征拆分为多组子特征,并通过可学习采样在其中传播子特征,以实现空间特征校准。在Cityscapes和CamVid数据集上进行的大量实验表明,我们的方法在速度和准确性之间实现了最优平衡。具体而言,我们的方法在Cityscapes和CamVid测试集上分别以70.0 FPS的速度实现了78.7%的平均交并比(mIoU)和以179.2 FPS的速度实现了77.8%的mIoU。代码可在https://nave.vr3i.com/和https://github.com/kaigelee/CSFCN获取。

相似文献

1
Context and Spatial Feature Calibration for Real-Time Semantic Segmentation.用于实时语义分割的上下文和空间特征校准
IEEE Trans Image Process. 2023;32:5465-5477. doi: 10.1109/TIP.2023.3318967. Epub 2023 Oct 25.
2
BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation.BMSeNet:用于实时语义分割的多尺度上下文金字塔池化与空间细节增强网络
Sensors (Basel). 2024 Aug 9;24(16):5145. doi: 10.3390/s24165145.
3
Rethinking 1D convolution for lightweight semantic segmentation.重新思考用于轻量级语义分割的一维卷积
Front Neurorobot. 2023 Feb 9;17:1119231. doi: 10.3389/fnbot.2023.1119231. eCollection 2023.
4
Lightweight semantic segmentation network with configurable context and small object attention.具有可配置上下文和小目标注意力的轻量级语义分割网络。
Front Comput Neurosci. 2023 Oct 23;17:1280640. doi: 10.3389/fncom.2023.1280640. eCollection 2023.
5
AlignSeg: Feature-Aligned Segmentation Networks.AlignSeg:特征对齐分割网络。
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):550-557. doi: 10.1109/TPAMI.2021.3062772. Epub 2021 Dec 7.
6
A lightweight multi-dimension dynamic convolutional network for real-time semantic segmentation.一种用于实时语义分割的轻量级多维动态卷积网络。
Front Neurorobot. 2022 Dec 15;16:1075520. doi: 10.3389/fnbot.2022.1075520. eCollection 2022.
7
MFAFNet: A Lightweight and Efficient Network with Multi-Level Feature Adaptive Fusion for Real-Time Semantic Segmentation.MFAFNet:一种用于实时语义分割的具有多级特征自适应融合的轻量级高效网络。
Sensors (Basel). 2023 Jul 13;23(14):6382. doi: 10.3390/s23146382.
8
Multi-Level and Multi-Scale Feature Aggregation Network for Semantic Segmentation in Vehicle-Mounted Scenes.车载场景语义分割的多层次多尺度特征聚合网络。
Sensors (Basel). 2021 May 9;21(9):3270. doi: 10.3390/s21093270.
9
HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation.HAFormer:释放层次感知特征在轻量级语义分割中的力量
IEEE Trans Image Process. 2024;33:4202-4214. doi: 10.1109/TIP.2024.3425048. Epub 2024 Jul 22.
10
Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation.双边注意解码器:用于实时语义分割的轻量级解码器。
Neural Netw. 2021 May;137:188-199. doi: 10.1016/j.neunet.2021.01.021. Epub 2021 Jan 30.

引用本文的文献

1
UAV-DETR: An Enhanced RT-DETR Architecture for Efficient Small Object Detection in UAV Imagery.无人机检测Transformer(UAV-DETR):一种用于无人机图像中高效小目标检测的增强型实时检测Transformer(RT-DETR)架构。
Sensors (Basel). 2025 Jul 24;25(15):4582. doi: 10.3390/s25154582.
2
LN-DETR: cross-scale feature fusion and re-weighting for lung nodule detection.LN-DETR:用于肺结节检测的跨尺度特征融合与重新加权
Sci Rep. 2025 May 3;15(1):15543. doi: 10.1038/s41598-025-00309-7.
3
Multi-scale feature fusion and feature calibration with edge information enhancement for remote sensing object detection.
用于遥感目标检测的多尺度特征融合与带边缘信息增强的特征校准
Sci Rep. 2025 May 2;15(1):15371. doi: 10.1038/s41598-025-99835-7.