• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BMSeNet:用于实时语义分割的多尺度上下文金字塔池化与空间细节增强网络

BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation.

作者信息

Zhao Shan, Zhao Xin, Huo Zhanqiang, Zhang Fukai

机构信息

School of Software, Henan Polytechnic University, Jiaozuo 454000, China.

出版信息

Sensors (Basel). 2024 Aug 9;24(16):5145. doi: 10.3390/s24165145.

DOI:10.3390/s24165145
PMID:39204841
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11360195/
Abstract

Most real-time semantic segmentation networks use shallow architectures to achieve fast inference speeds. This approach, however, limits a network's receptive field. Concurrently, feature information extraction is restricted to a single scale, which reduces the network's ability to generalize and maintain robustness. Furthermore, loss of image spatial details negatively impacts segmentation accuracy. To address these limitations, this paper proposes a Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network (BMSeNet). First, to address the limitation of singular semantic feature scales, a Multiscale Context Pyramid Pooling Module (MSCPPM) is introduced. By leveraging various pooling operations, this module efficiently enlarges the receptive field and better aggregates multiscale contextual information. Moreover, a Spatial Detail Enhancement Module (SDEM) is designed, to effectively compensate for lost spatial detail information and significantly enhance the perception of spatial details. Finally, a Bilateral Attention Fusion Module (BAFM) is proposed. This module leverages pixel positional correlations to guide the network in assigning appropriate weights to the features extracted from the two branches, effectively merging the feature information of both branches. Extensive experiments were conducted on the Cityscapes and CamVid datasets. Experimental results show that the proposed BMSeNet achieves a good balance between inference speed and segmentation accuracy, outperforming some state-of-the-art real-time semantic segmentation methods.

摘要

大多数实时语义分割网络使用浅层架构来实现快速推理速度。然而,这种方法限制了网络的感受野。同时,特征信息提取被限制在单一尺度,这降低了网络的泛化能力和稳健性。此外,图像空间细节的丢失对分割精度产生负面影响。为了解决这些限制,本文提出了一种多尺度上下文金字塔池化和空间细节增强网络(BMSeNet)。首先,为了解决单一语义特征尺度的限制,引入了多尺度上下文金字塔池化模块(MSCPPM)。通过利用各种池化操作,该模块有效地扩大了感受野,并更好地聚合了多尺度上下文信息。此外,设计了一个空间细节增强模块(SDEM),以有效补偿丢失的空间细节信息,并显著增强对空间细节的感知。最后,提出了一个双边注意力融合模块(BAFM)。该模块利用像素位置相关性来指导网络为从两个分支提取的特征分配适当的权重,有效地融合两个分支的特征信息。在Cityscapes和CamVid数据集上进行了大量实验。实验结果表明,所提出的BMSeNet在推理速度和分割精度之间取得了良好的平衡,优于一些现有的实时语义分割方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/4b1e5786c05c/sensors-24-05145-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/d6272a316219/sensors-24-05145-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/a22e3363b996/sensors-24-05145-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/ddc69ceea6fe/sensors-24-05145-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/b0c5f19d6345/sensors-24-05145-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/b9e233d8c4ae/sensors-24-05145-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/05c8b1a56553/sensors-24-05145-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/4012e4931eec/sensors-24-05145-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/43aed3936792/sensors-24-05145-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/61b056b674ca/sensors-24-05145-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/00ce1f55b323/sensors-24-05145-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/cd0fa4bdbd16/sensors-24-05145-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/62cd3608c300/sensors-24-05145-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/4b1e5786c05c/sensors-24-05145-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/d6272a316219/sensors-24-05145-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/a22e3363b996/sensors-24-05145-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/ddc69ceea6fe/sensors-24-05145-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/b0c5f19d6345/sensors-24-05145-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/b9e233d8c4ae/sensors-24-05145-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/05c8b1a56553/sensors-24-05145-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/4012e4931eec/sensors-24-05145-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/43aed3936792/sensors-24-05145-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/61b056b674ca/sensors-24-05145-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/00ce1f55b323/sensors-24-05145-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/cd0fa4bdbd16/sensors-24-05145-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/62cd3608c300/sensors-24-05145-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e3a/11360195/4b1e5786c05c/sensors-24-05145-g013.jpg

相似文献

1
BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation.BMSeNet:用于实时语义分割的多尺度上下文金字塔池化与空间细节增强网络
Sensors (Basel). 2024 Aug 9;24(16):5145. doi: 10.3390/s24165145.
2
Context and Spatial Feature Calibration for Real-Time Semantic Segmentation.用于实时语义分割的上下文和空间特征校准
IEEE Trans Image Process. 2023;32:5465-5477. doi: 10.1109/TIP.2023.3318967. Epub 2023 Oct 25.
3
Automatic segmentation of spine x-ray images based on multiscale feature enhancement network.基于多尺度特征增强网络的脊柱 X 射线图像自动分割。
Med Phys. 2024 Oct;51(10):7282-7294. doi: 10.1002/mp.17278. Epub 2024 Jun 30.
4
MFAFNet: A Lightweight and Efficient Network with Multi-Level Feature Adaptive Fusion for Real-Time Semantic Segmentation.MFAFNet:一种用于实时语义分割的具有多级特征自适应融合的轻量级高效网络。
Sensors (Basel). 2023 Jul 13;23(14):6382. doi: 10.3390/s23146382.
5
Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation.双边注意解码器:用于实时语义分割的轻量级解码器。
Neural Netw. 2021 May;137:188-199. doi: 10.1016/j.neunet.2021.01.021. Epub 2021 Jan 30.
6
MFEAFN: Multi-scale feature enhanced adaptive fusion network for image semantic segmentation.MFEAFN:用于图像语义分割的多尺度特征增强自适应融合网络。
PLoS One. 2022 Sep 30;17(9):e0274249. doi: 10.1371/journal.pone.0274249. eCollection 2022.
7
Lightweight semantic segmentation network with configurable context and small object attention.具有可配置上下文和小目标注意力的轻量级语义分割网络。
Front Comput Neurosci. 2023 Oct 23;17:1280640. doi: 10.3389/fncom.2023.1280640. eCollection 2023.
8
Rethinking 1D convolution for lightweight semantic segmentation.重新思考用于轻量级语义分割的一维卷积
Front Neurorobot. 2023 Feb 9;17:1119231. doi: 10.3389/fnbot.2023.1119231. eCollection 2023.
9
Real-Time Semantic Segmentation via a Densely Aggregated Bilateral Network.通过密集聚合双边网络实现实时语义分割
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):381-392. doi: 10.1109/TNNLS.2023.3326665. Epub 2025 Jan 7.
10
PS5-Net: a medical image segmentation network with multiscale resolution.PS5-Net:一种具有多尺度分辨率的医学图像分割网络。
J Med Imaging (Bellingham). 2024 Jan;11(1):014008. doi: 10.1117/1.JMI.11.1.014008. Epub 2024 Feb 19.

本文引用的文献

1
CGNet: A Light-Weight Context Guided Network for Semantic Segmentation.CGNet:用于语义分割的轻量级上下文引导网络
IEEE Trans Image Process. 2021;30:1169-1179. doi: 10.1109/TIP.2020.3042065. Epub 2020 Dec 17.
2
Deep High-Resolution Representation Learning for Visual Recognition.用于视觉识别的深度高分辨率表征学习
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3349-3364. doi: 10.1109/TPAMI.2020.2983686. Epub 2021 Sep 2.
3
Her2Net: A Deep Framework for Semantic Segmentation and Classification of Cell Membranes and Nuclei in Breast Cancer Evaluation.
Her2Net:用于乳腺癌评估中细胞膜和细胞核语义分割与分类的深度框架。
IEEE Trans Image Process. 2018 May;27(5):2189-2200. doi: 10.1109/TIP.2018.2795742.
4
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
5
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.