• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自适应注意力和深度融合的语义分割网络:利用多尺度扩张卷积金字塔

Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid.

作者信息

Zhao Shan, Wang Zihao, Huo Zhanqiang, Zhang Fukai

机构信息

School of Software, Henan Polytechnic University, Jiaozuo 454000, China.

出版信息

Sensors (Basel). 2024 Aug 16;24(16):5305. doi: 10.3390/s24165305.

DOI:10.3390/s24165305
PMID:39205000
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11359814/
Abstract

Deep learning has recently made significant progress in semantic segmentation. However, the current methods face critical challenges. The segmentation process often lacks sufficient contextual information and attention mechanisms, low-level features lack semantic richness, and high-level features suffer from poor resolution. These limitations reduce the model's ability to accurately understand and process scene details, particularly in complex scenarios, leading to segmentation outputs that may have inaccuracies in boundary delineation, misclassification of regions, and poor handling of small or overlapping objects. To address these challenges, this paper proposes a Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion with the Multi-Scale Dilated Convolutional Pyramid (SDAMNet). Specifically, the Dilated Convolutional Atrous Spatial Pyramid Pooling (DCASPP) module is developed to enhance contextual information in semantic segmentation. Additionally, a Semantic Channel Space Details Module (SCSDM) is devised to improve the extraction of significant features through multi-scale feature fusion and adaptive feature selection, enhancing the model's perceptual capability for key regions and optimizing semantic understanding and segmentation performance. Furthermore, a Semantic Features Fusion Module (SFFM) is constructed to address the semantic deficiency in low-level features and the low resolution in high-level features. The effectiveness of SDAMNet is demonstrated on two datasets, revealing significant improvements in Mean Intersection over Union (MIOU) by 2.89% and 2.13%, respectively, compared to the Deeplabv3+ network.

摘要

深度学习最近在语义分割方面取得了重大进展。然而,当前的方法面临着严峻的挑战。分割过程往往缺乏足够的上下文信息和注意力机制,低级特征缺乏语义丰富性,高级特征的分辨率较差。这些限制降低了模型准确理解和处理场景细节的能力,特别是在复杂场景中,导致分割输出在边界划定、区域误分类以及对小物体或重叠物体的处理不佳等方面可能存在不准确之处。为了应对这些挑战,本文提出了一种基于自适应注意力和多尺度扩张卷积金字塔深度融合的语义分割网络(SDAMNet)。具体而言,开发了扩张卷积空洞空间金字塔池化(DCASPP)模块以增强语义分割中的上下文信息。此外,设计了一个语义通道空间细节模块(SCSDM),通过多尺度特征融合和自适应特征选择来改进显著特征的提取,增强模型对关键区域的感知能力,并优化语义理解和分割性能。此外,构建了一个语义特征融合模块(SFFM)来解决低级特征中的语义不足和高级特征中的低分辨率问题。在两个数据集上验证了SDAMNet的有效性,与Deeplabv3+网络相比,平均交并比(MIOU)分别显著提高了2.89%和2.13%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/6f9b9d2c0dbd/sensors-24-05305-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/c8ea843878c3/sensors-24-05305-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/79c39efd8a79/sensors-24-05305-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/4fc3cf20fea8/sensors-24-05305-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/1608f11a5a79/sensors-24-05305-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/4e816cf37d5f/sensors-24-05305-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/6c75e80565cd/sensors-24-05305-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/398079fdfa28/sensors-24-05305-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/4a5f8eb9d4a0/sensors-24-05305-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/5beb7d79fc41/sensors-24-05305-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/5c9a1e20110a/sensors-24-05305-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/df49c1697789/sensors-24-05305-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/5bbd0dbd6d64/sensors-24-05305-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/3a838c168acb/sensors-24-05305-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/8d05f4f5a992/sensors-24-05305-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/faba9ca966d5/sensors-24-05305-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/65e9f1fef81c/sensors-24-05305-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/0502e50052af/sensors-24-05305-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/c4e93a04c5f4/sensors-24-05305-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/6f9b9d2c0dbd/sensors-24-05305-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/c8ea843878c3/sensors-24-05305-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/79c39efd8a79/sensors-24-05305-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/4fc3cf20fea8/sensors-24-05305-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/1608f11a5a79/sensors-24-05305-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/4e816cf37d5f/sensors-24-05305-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/6c75e80565cd/sensors-24-05305-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/398079fdfa28/sensors-24-05305-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/4a5f8eb9d4a0/sensors-24-05305-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/5beb7d79fc41/sensors-24-05305-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/5c9a1e20110a/sensors-24-05305-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/df49c1697789/sensors-24-05305-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/5bbd0dbd6d64/sensors-24-05305-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/3a838c168acb/sensors-24-05305-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/8d05f4f5a992/sensors-24-05305-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/faba9ca966d5/sensors-24-05305-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/65e9f1fef81c/sensors-24-05305-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/0502e50052af/sensors-24-05305-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/c4e93a04c5f4/sensors-24-05305-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f00/11359814/6f9b9d2c0dbd/sensors-24-05305-g019.jpg

相似文献

1
Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid.基于自适应注意力和深度融合的语义分割网络:利用多尺度扩张卷积金字塔
Sensors (Basel). 2024 Aug 16;24(16):5305. doi: 10.3390/s24165305.
2
Cattle Target Segmentation Method in Multi-Scenes Using Improved DeepLabV3+ Method.基于改进的DeepLabV3+方法的多场景牛目标分割方法
Animals (Basel). 2023 Aug 4;13(15):2521. doi: 10.3390/ani13152521.
3
Multi-Scale Deep Neural Network Based on Dilated Convolution for Spacecraft Image Segmentation.基于扩张卷积的多尺度深度神经网络在航天器图像分割中的应用。
Sensors (Basel). 2022 Jun 1;22(11):4222. doi: 10.3390/s22114222.
4
Novel Method of Semantic Segmentation Applicable to Augmented Reality.适用于增强现实的语义分割新方法。
Sensors (Basel). 2020 Mar 20;20(6):1737. doi: 10.3390/s20061737.
5
FGCN: Image-Fused Point Cloud Semantic Segmentation with Fusion Graph Convolutional Network.FGCN:基于融合图卷积网络的图像融合点云语义分割
Sensors (Basel). 2023 Oct 9;23(19):8338. doi: 10.3390/s23198338.
6
RTC_TongueNet: An improved tongue image segmentation model based on DeepLabV3.RTC_TongueNet:一种基于DeepLabV3的改进型舌图像分割模型。
Digit Health. 2024 Mar 28;10:20552076241242773. doi: 10.1177/20552076241242773. eCollection 2024 Jan-Dec.
7
PMJAF-Net: Pyramidal multi-scale joint attention and adaptive fusion network for explainable skin lesion segmentation.PMJAF-Net:用于可解释皮肤病变分割的金字塔多尺度联合注意和自适应融合网络。
Comput Biol Med. 2023 Oct;165:107454. doi: 10.1016/j.compbiomed.2023.107454. Epub 2023 Sep 12.
8
DeepMDSCBA: An Improved Semantic Segmentation Model Based on DeepLabV3+ for Apple Images.深度MDSCBA:一种基于DeepLabV3+的用于苹果图像的改进语义分割模型。
Foods. 2022 Dec 10;11(24):3999. doi: 10.3390/foods11243999.
9
BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation.BMSeNet:用于实时语义分割的多尺度上下文金字塔池化与空间细节增强网络
Sensors (Basel). 2024 Aug 9;24(16):5145. doi: 10.3390/s24165145.
10
MFEAFN: Multi-scale feature enhanced adaptive fusion network for image semantic segmentation.MFEAFN:用于图像语义分割的多尺度特征增强自适应融合网络。
PLoS One. 2022 Sep 30;17(9):e0274249. doi: 10.1371/journal.pone.0274249. eCollection 2022.

本文引用的文献

1
A multi-branched semantic segmentation network based on twisted information sharing pattern for medical images.基于扭曲信息共享模式的医学图像多分支语义分割网络。
Comput Methods Programs Biomed. 2024 Jan;243:107914. doi: 10.1016/j.cmpb.2023.107914. Epub 2023 Nov 13.
2
Using AAEHS-Net as an Attention-Based Auxiliary Extraction and Hybrid Subsampled Network for Semantic Segmentation.基于 AAEHS-Net 的注意力辅助提取与混合子采样网络的语义分割。
Comput Intell Neurosci. 2022 Oct 14;2022:1536976. doi: 10.1155/2022/1536976. eCollection 2022.
3
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.
SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
4
A computational approach to edge detection.一种基于计算的边缘检测方法。
IEEE Trans Pattern Anal Mach Intell. 1986 Jun;8(6):679-98.