• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GLE-net:用于遥感图像语义分割的全局-局部信息增强

GLE-net: global-local information enhancement for semantic segmentation of remote sensing images.

作者信息

Yang Junliang, Chen Guorong, Huang Jiaming, Ma Denglong, Liu Jingcheng, Zhu Huazheng

机构信息

Department of Intelligent Technology and Engineering, Chongqing University of Science and Technology, No.20, East University Town Road, Shapingba District, Chongqing, 401331, China.

School of Mechanical Engineering, Xi'an Jiaotong University, No. 28, Xianning West Road, Xi'an, 710049, Shaanxi, China.

出版信息

Sci Rep. 2024 Oct 25;14(1):25282. doi: 10.1038/s41598-024-76622-4.

DOI:10.1038/s41598-024-76622-4
PMID:39455717
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11512047/
Abstract

Remote sensing (RS) images contain a wealth of information with expansive potential for applications in image segmentation. However, Convolutional Neural Networks (CNN) face challenges in fully harnessing the global contextual information. Leveraging the formidable capabilities of global information modeling with Swin-Transformer, a novel RS images segmentation model with CNN (GLE-Net) was introduced. This integration gives rise to a revamped encoder structure. The subbranch initiates the process by extracting features at varying scales within the RS images using the Multiscale Feature Fusion Module (MFM), acquiring rich semantic information, discerning localized finer features, and adeptly handling occlusions. Subsequently, Feature Compression Module (FCM) is introduced in main branch to downsize the feature map, effectively reducing information loss while preserving finer details, enhancing segmentation accuracy for smaller targets. Finally, we integrate local features and global features through Spatial Information Enhancement Module (SIEM) for comprehensive feature modeling, augmenting the segmentation capabilities of model. We performed experiments on public datasets provided by ISPRS, yielding notably remarkable experimental outcomes. This underscores the substantial potential of our model in the realm of RS image segmentation within the context of scientific research.

摘要

遥感(RS)图像包含丰富的信息,在图像分割应用方面具有广阔的潜力。然而,卷积神经网络(CNN)在充分利用全局上下文信息方面面临挑战。利用Swin-Transformer强大的全局信息建模能力,引入了一种新型的基于CNN的RS图像分割模型(GLE-Net)。这种整合产生了一种改进的编码器结构。子分支通过使用多尺度特征融合模块(MFM)在RS图像中提取不同尺度的特征来启动该过程,获取丰富的语义信息,识别局部更精细的特征,并巧妙地处理遮挡。随后,在主分支中引入特征压缩模块(FCM)来缩小特征图的尺寸,有效减少信息损失,同时保留更精细的细节,提高对较小目标的分割精度。最后,我们通过空间信息增强模块(SIEM)整合局部特征和全局特征,进行全面的特征建模,增强模型的分割能力。我们在国际摄影测量与遥感学会(ISPRS)提供的公共数据集上进行了实验,取得了显著的实验结果。这凸显了我们的模型在科学研究背景下的RS图像分割领域的巨大潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/dc06c3c322b5/41598_2024_76622_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/a29c188ee3d0/41598_2024_76622_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/989cc3912592/41598_2024_76622_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/257ed2ae5079/41598_2024_76622_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/c63b70597ddf/41598_2024_76622_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/f054062f5f14/41598_2024_76622_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/fd5ee2e70b63/41598_2024_76622_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/f6e84e148326/41598_2024_76622_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/dc06c3c322b5/41598_2024_76622_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/a29c188ee3d0/41598_2024_76622_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/989cc3912592/41598_2024_76622_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/257ed2ae5079/41598_2024_76622_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/c63b70597ddf/41598_2024_76622_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/f054062f5f14/41598_2024_76622_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/fd5ee2e70b63/41598_2024_76622_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/f6e84e148326/41598_2024_76622_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/239b/11512047/dc06c3c322b5/41598_2024_76622_Fig8_HTML.jpg

相似文献

1
GLE-net: global-local information enhancement for semantic segmentation of remote sensing images.GLE-net:用于遥感图像语义分割的全局-局部信息增强
Sci Rep. 2024 Oct 25;14(1):25282. doi: 10.1038/s41598-024-76622-4.
2
TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images.TMNet:一种用于遥感图像的两分支多尺度语义分割网络。
Sensors (Basel). 2023 Jun 26;23(13):5909. doi: 10.3390/s23135909.
3
PS5-Net: a medical image segmentation network with multiscale resolution.PS5-Net:一种具有多尺度分辨率的医学图像分割网络。
J Med Imaging (Bellingham). 2024 Jan;11(1):014008. doi: 10.1117/1.JMI.11.1.014008. Epub 2024 Feb 19.
4
Swin-Net: A Swin-Transformer-Based Network Combing with Multi-Scale Features for Segmentation of Breast Tumor Ultrasound Images.Swin-Net:一种基于Swin-Transformer并结合多尺度特征的用于乳腺肿瘤超声图像分割的网络。
Diagnostics (Basel). 2024 Jan 26;14(3):269. doi: 10.3390/diagnostics14030269.
5
Transformer-Based Model with Dynamic Attention Pyramid Head for Semantic Segmentation of VHR Remote Sensing Imagery.基于Transformer且带有动态注意力金字塔头的甚高分辨率遥感影像语义分割模型
Entropy (Basel). 2022 Nov 6;24(11):1619. doi: 10.3390/e24111619.
6
Enhancing skin lesion segmentation with a fusion of convolutional neural networks and transformer models.通过融合卷积神经网络和Transformer模型增强皮肤病变分割
Heliyon. 2024 May 17;10(10):e31395. doi: 10.1016/j.heliyon.2024.e31395. eCollection 2024 May 30.
7
CPFTransformer: transformer fusion context pyramid medical image segmentation network.CPFTransformer:变换器融合上下文金字塔医学图像分割网络。
Front Neurosci. 2023 Dec 7;17:1288366. doi: 10.3389/fnins.2023.1288366. eCollection 2023.
8
TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.TGDAUNet:基于 Transformer 和 GCNN 的双分支注意力 U-Net 用于医学图像分割。
Comput Biol Med. 2023 Dec;167:107583. doi: 10.1016/j.compbiomed.2023.107583. Epub 2023 Oct 21.
9
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
10
ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation.ST-Unet:具有跨层特征增强的 Swin Transformer 增强型 U-Net,用于医学图像分割。
Comput Biol Med. 2023 Feb;153:106516. doi: 10.1016/j.compbiomed.2022.106516. Epub 2023 Jan 6.

引用本文的文献

1
Enhancing cross view geo localization through global local quadrant interaction network.通过全局局部象限交互网络增强跨视图地理定位
Sci Rep. 2025 Sep 29;15(1):33431. doi: 10.1038/s41598-025-18935-6.
2
A Cross-Modal Attention-Driven Multi-Sensor Fusion Method for Semantic Segmentation of Point Clouds.一种用于点云语义分割的跨模态注意力驱动多传感器融合方法。
Sensors (Basel). 2025 Apr 14;25(8):2474. doi: 10.3390/s25082474.

本文引用的文献

1
GQE-Net: A Graph-Based Quality Enhancement Network for Point Cloud Color Attribute.GQE-Net:一种用于点云颜色属性的基于图的质量增强网络。
IEEE Trans Image Process. 2023;32:6303-6317. doi: 10.1109/TIP.2023.3330086. Epub 2023 Nov 20.
2
Exploiting multimodal CNN architecture for automated teeth segmentation on dental panoramic X-ray images.利用多模态卷积神经网络架构对牙科全景X射线图像进行牙齿自动分割。
Proc Inst Mech Eng H. 2023 Mar;237(3):395-405. doi: 10.1177/09544119231157137. Epub 2023 Feb 18.
3
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.
DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
4
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
5
Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.