• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MFEAFN:用于图像语义分割的多尺度特征增强自适应融合网络。

MFEAFN: Multi-scale feature enhanced adaptive fusion network for image semantic segmentation.

机构信息

State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University,Guiyang,Guizhou,China.

出版信息

PLoS One. 2022 Sep 30;17(9):e0274249. doi: 10.1371/journal.pone.0274249. eCollection 2022.

DOI:10.1371/journal.pone.0274249
PMID:36178906
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9524699/
Abstract

Low-level features contain spatial detail information, and high-level features contain rich semantic information. Semantic segmentation research focuses on fully acquiring and effectively fusing spatial detail with semantic information. This paper proposes a multiscale feature-enhanced adaptive fusion network named MFEAFN to improve semantic segmentation performance. First, we designed a Double Spatial Pyramid Module named DSPM to extract more high-level semantic information. Second, we designed a Focusing Selective Fusion Module named FSFM to fuse different scales and levels of feature maps. Specifically, the feature maps are enhanced to adaptively fuse these features by generating attention weights through a spatial attention mechanism and a two-dimensional discrete cosine transform, respectively. To validate the effectiveness of FSFM, we designed different fusion modules for comparison and ablation experiments. MFEAFN achieved 82.64% and 78.46% mIoU on the PASCAL VOC2012 and Cityscapes datasets. In addition, our method has better segmentation results than state-of-the-art methods.

摘要

底层特征包含空间细节信息,高层特征包含丰富的语义信息。语义分割研究集中于充分获取和有效融合空间细节与语义信息。本文提出了一种名为 MFEAFN 的多尺度特征增强自适应融合网络,以提高语义分割性能。首先,我们设计了一个名为 DSPM 的双空间金字塔模块,以提取更多的高层语义信息。其次,我们设计了一个名为 FSFM 的聚焦选择融合模块,以融合不同尺度和层次的特征图。具体来说,通过空间注意力机制和二维离散余弦变换分别生成注意力权重,对特征图进行增强以自适应地融合这些特征。为了验证 FSFM 的有效性,我们设计了不同的融合模块进行对比和消融实验。MFEAFN 在 PASCAL VOC2012 和 Cityscapes 数据集上分别实现了 82.64%和 78.46%的 mIoU。此外,我们的方法比最先进的方法具有更好的分割结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/738fed38c118/pone.0274249.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/05cf2cb445f2/pone.0274249.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/4f63c2f22910/pone.0274249.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/481d87dd69aa/pone.0274249.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/9cdf2af4283e/pone.0274249.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/a5b94ce095de/pone.0274249.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/6c3c3df5402c/pone.0274249.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/657c9ee4d13e/pone.0274249.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/738fed38c118/pone.0274249.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/05cf2cb445f2/pone.0274249.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/4f63c2f22910/pone.0274249.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/481d87dd69aa/pone.0274249.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/9cdf2af4283e/pone.0274249.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/a5b94ce095de/pone.0274249.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/6c3c3df5402c/pone.0274249.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/657c9ee4d13e/pone.0274249.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5752/9524699/738fed38c118/pone.0274249.g008.jpg

相似文献

1
MFEAFN: Multi-scale feature enhanced adaptive fusion network for image semantic segmentation.MFEAFN:用于图像语义分割的多尺度特征增强自适应融合网络。
PLoS One. 2022 Sep 30;17(9):e0274249. doi: 10.1371/journal.pone.0274249. eCollection 2022.
2
Multiple-Attention Mechanism Network for Semantic Segmentation.多注意力机制网络的语义分割。
Sensors (Basel). 2022 Jun 13;22(12):4477. doi: 10.3390/s22124477.
3
Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation.双边注意解码器:用于实时语义分割的轻量级解码器。
Neural Netw. 2021 May;137:188-199. doi: 10.1016/j.neunet.2021.01.021. Epub 2021 Jan 30.
4
Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation.用于街景分割的层次间特征平衡融合网络。
Sensors (Basel). 2021 Nov 25;21(23):7844. doi: 10.3390/s21237844.
5
BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation.BMSeNet:用于实时语义分割的多尺度上下文金字塔池化与空间细节增强网络
Sensors (Basel). 2024 Aug 9;24(16):5145. doi: 10.3390/s24165145.
6
Fusion network based on the dual attention mechanism and atrous spatial pyramid pooling for automatic segmentation in retinal vessel images.基于双注意力机制和空洞空间金字塔池化的融合网络,用于视网膜血管图像的自动分割。
J Opt Soc Am A Opt Image Sci Vis. 2022 Aug 1;39(8):1393-1402. doi: 10.1364/JOSAA.459912.
7
D-SAT: dual semantic aggregation transformer with dual attention for medical image segmentation.D-SAT:用于医学图像分割的具有双重注意力的双重语义聚合转换器。
Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/acf2e5.
8
A multibranch and multiscale neural network based on semantic perception for multimodal medical image fusion.基于语义感知的多分支多尺度神经网络用于多模态医学图像融合。
Sci Rep. 2024 Jul 30;14(1):17609. doi: 10.1038/s41598-024-68183-3.
9
Semantic segmentation method of underwater images based on encoder-decoder architecture.基于编解码器架构的水下图像语义分割方法。
PLoS One. 2022 Aug 25;17(8):e0272666. doi: 10.1371/journal.pone.0272666. eCollection 2022.
10
An ENet Semantic Segmentation Method Combined with Attention Mechanism.一种结合注意力机制的 ENet 语义分割方法。
Comput Intell Neurosci. 2023 Feb 22;2023:6965259. doi: 10.1155/2023/6965259. eCollection 2023.

本文引用的文献

1
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
2
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
3
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.
空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.