• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RST-Net:一种基于双分支编码器结构的遥感图像语义分割网络。

RST-Net: A Semantic Segmentation Network for Remote Sensing Images Based on a Dual-Branch Encoder Structure.

作者信息

Yang Na, Tian Chuanzhao, Gu Xingfa, Zhang Yanting, Li Xuewen, Zhang Feng

机构信息

College of Remote Sensing and Information Engineering, North China Institute of Aerospace Engineering, Langfang 065000, China.

Collaborative Innovation Center of Aerospace Remote Sensing Information Processing and Application of Hebei Province, Langfang 065000, China.

出版信息

Sensors (Basel). 2025 Sep 5;25(17):5531. doi: 10.3390/s25175531.

DOI:10.3390/s25175531
PMID:40942960
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12431347/
Abstract

High-resolution remote sensing images often suffer from inadequate fusion between global and local features, leading to the loss of long-range dependencies and blurred spatial details, while also exhibiting limited adaptability to multi-scale object segmentation. To overcome these limitations, this study proposes RST-Net, a semantic segmentation network featuring a dual-branch encoder structure. The encoder integrates a ResNeXt-50-based CNN branch for extracting local spatial features and a Shunted Transformer (ST) branch for capturing global contextual information. To further enhance multi-scale representation, the multi-scale feature enhancement module (MSFEM) is embedded in the CNN branch, leveraging atrous and depthwise separable convolutions to dynamically aggregate features. Additionally, the residual dynamic feature fusion (RDFF) module is incorporated into skip connections to improve interactions between encoder and decoder features. Experiments on the Vaihingen and Potsdam datasets show that RST-Net achieves promising performance, with MIoU scores of 77.04% and 79.56%, respectively, validating its effectiveness in semantic segmentation tasks.

摘要

高分辨率遥感图像常常存在全局特征与局部特征融合不足的问题,导致长距离依赖关系丢失、空间细节模糊,同时在多尺度目标分割方面的适应性也有限。为克服这些局限性,本研究提出了RST-Net,这是一种具有双分支编码器结构的语义分割网络。该编码器集成了一个基于ResNeXt-50的卷积神经网络分支,用于提取局部空间特征,以及一个分流变压器(ST)分支,用于捕获全局上下文信息。为进一步增强多尺度表示,多尺度特征增强模块(MSFEM)被嵌入到卷积神经网络分支中,利用空洞卷积和深度可分离卷积动态聚合特征。此外,残余动态特征融合(RDFF)模块被纳入跳跃连接中,以改善编码器和解码器特征之间的交互。在Vaihingen和Potsdam数据集上的实验表明,RST-Net取得了良好的性能,平均交并比(MIoU)分数分别为77.04%和79.56%,验证了其在语义分割任务中的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/4f4893e27fda/sensors-25-05531-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/8f0009dd2868/sensors-25-05531-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/ddb66ad42b84/sensors-25-05531-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/3eda544b1217/sensors-25-05531-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/3a8781a24141/sensors-25-05531-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/96da30608e8c/sensors-25-05531-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/a02f820c3257/sensors-25-05531-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/4f4893e27fda/sensors-25-05531-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/8f0009dd2868/sensors-25-05531-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/ddb66ad42b84/sensors-25-05531-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/3eda544b1217/sensors-25-05531-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/3a8781a24141/sensors-25-05531-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/96da30608e8c/sensors-25-05531-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/a02f820c3257/sensors-25-05531-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fd6/12431347/4f4893e27fda/sensors-25-05531-g007.jpg

相似文献

1
RST-Net: A Semantic Segmentation Network for Remote Sensing Images Based on a Dual-Branch Encoder Structure.RST-Net:一种基于双分支编码器结构的遥感图像语义分割网络。
Sensors (Basel). 2025 Sep 5;25(17):5531. doi: 10.3390/s25175531.
2
DGCFNet: Dual Global Context Fusion Network for remote sensing image semantic segmentation.DGCFNet:用于遥感图像语义分割的双全局上下文融合网络
PeerJ Comput Sci. 2025 Mar 27;11:e2786. doi: 10.7717/peerj-cs.2786. eCollection 2025.
3
MFPI-Net: A Multi-Scale Feature Perception and Interaction Network for Semantic Segmentation of Urban Remote Sensing Images.MFPI-Net:一种用于城市遥感影像语义分割的多尺度特征感知与交互网络
Sensors (Basel). 2025 Jul 27;25(15):4660. doi: 10.3390/s25154660.
4
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
5
DCMC-UNet: A Novel Segmentation Model for Carbon Traces in Oil-Immersed Transformers Improved with Dynamic Feature Fusion and Adaptive Illumination Enhancement.DCMC-UNet:一种通过动态特征融合和自适应光照增强改进的油浸式变压器碳痕分割新模型。
Sensors (Basel). 2025 Jun 23;25(13):3904. doi: 10.3390/s25133904.
6
Liver Semantic Segmentation Method Based on Multi-Channel Feature Extraction and Cross Fusion.基于多通道特征提取与交叉融合的肝脏语义分割方法
Bioengineering (Basel). 2025 Jun 11;12(6):636. doi: 10.3390/bioengineering12060636.
7
Multi-scale fusion semantic enhancement network for medical image segmentation.用于医学图像分割的多尺度融合语义增强网络。
Sci Rep. 2025 Jul 2;15(1):23018. doi: 10.1038/s41598-025-07806-9.
8
TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet:一种新颖的跨尺度级联分层Transformer 网络,用于增强视网膜血管分割。
Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.
9
VMDU-net: a dual encoder multi-scale fusion network for polyp segmentation with Vision Mamba and Cross-Shape Transformer integration.VMDU-net:一种用于息肉分割的双编码器多尺度融合网络,集成了视觉曼巴和十字形变换器
Front Artif Intell. 2025 Jun 18;8:1557508. doi: 10.3389/frai.2025.1557508. eCollection 2025.
10
Dynamic atrous attention and dual branch context fusion for cross scale Building segmentation in high resolution remote sensing imagery.用于高分辨率遥感影像跨尺度建筑物分割的动态空洞注意力与双分支上下文融合
Sci Rep. 2025 Aug 21;15(1):30800. doi: 10.1038/s41598-025-14751-0.

本文引用的文献

1
A semantic segmentation framework with UNet-pyramid for landslide prediction using remote sensing data.一种基于UNet金字塔的语义分割框架,用于利用遥感数据进行滑坡预测。
Sci Rep. 2024 Dec 3;14(1):30071. doi: 10.1038/s41598-024-79266-6.
2
MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation.MS-TCNet:一种基于多尺度特征学习的有效的 Transformer-CNN 组合网络,用于 3D 医学图像分割。
Comput Biol Med. 2024 Mar;170:108057. doi: 10.1016/j.compbiomed.2024.108057. Epub 2024 Jan 28.
3
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.
DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
4
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.