• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用前景感知和多尺度卷积注意力机制对卫星图像进行语义分割以检测山体滑坡

Semantic Segmentation of Satellite Images for Landslide Detection Using Foreground-Aware and Multi-Scale Convolutional Attention Mechanism.

作者信息

Yu Chih-Chang, Chen Yuan-Di, Cheng Hsu-Yung, Jiang Chi-Lun

机构信息

Department of Information and Computer Engineering, Chung Yuan Christian University, Taoyuan City 320, Taiwan.

Department of Computer Science and Information Engineering, National Central University, Taoyuan City 320, Taiwan.

出版信息

Sensors (Basel). 2024 Oct 10;24(20):6539. doi: 10.3390/s24206539.

DOI:10.3390/s24206539
PMID:39460020
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11511481/
Abstract

Advancements in satellite and aerial imagery technology have made it easier to obtain high-resolution remote sensing images, leading to widespread research and applications in various fields. Remote sensing image semantic segmentation is a crucial task that provides semantic and localization information for target objects. In addition to the large-scale variation issues common in most semantic segmentation datasets, aerial images present unique challenges, including high background complexity and imbalanced foreground-background ratios. However, general semantic segmentation methods primarily address scale variations in natural scenes and often neglect the specific challenges in remote sensing images, such as inadequate foreground modeling. In this paper, we present a foreground-aware remote sensing semantic segmentation model. The model introduces a multi-scale convolutional attention mechanism and utilizes a feature pyramid network architecture to extract multi-scale features, addressing the multi-scale problem. Additionally, we introduce a Foreground-Scene Relation Module to mitigate false alarms. The model enhances the foreground features by modeling the relationship between the foreground and the scene. In the loss function, a Soft Focal Loss is employed to focus on foreground samples during training, alleviating the foreground-background imbalance issue. Experimental results indicate that our proposed method outperforms current state-of-the-art general semantic segmentation methods and transformer-based methods on the LS dataset benchmark.

摘要

卫星和航空影像技术的进步使得获取高分辨率遥感影像变得更加容易,从而在各个领域引发了广泛的研究和应用。遥感影像语义分割是一项关键任务,它为目标物体提供语义和定位信息。除了大多数语义分割数据集中常见的大规模变化问题外,航空影像还存在独特的挑战,包括高背景复杂性和前景-背景比例失衡。然而,一般的语义分割方法主要解决自然场景中的尺度变化问题,往往忽略了遥感影像中的特定挑战,如前景建模不足。在本文中,我们提出了一种前景感知遥感语义分割模型。该模型引入了多尺度卷积注意力机制,并利用特征金字塔网络架构来提取多尺度特征,解决了多尺度问题。此外,我们引入了一个前景-场景关系模块来减少误报。该模型通过对前景与场景之间的关系进行建模来增强前景特征。在损失函数中,采用软焦损失在训练期间关注前景样本,缓解了前景-背景不平衡问题。实验结果表明,我们提出的方法在LS数据集基准上优于当前最先进的一般语义分割方法和基于Transformer的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/bbeeb1dff0c4/sensors-24-06539-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/bcf747abb31d/sensors-24-06539-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/141f34adc062/sensors-24-06539-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/dc97b2fe80db/sensors-24-06539-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/961a54e471f0/sensors-24-06539-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/55bbab7a2ff3/sensors-24-06539-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/523ae950410d/sensors-24-06539-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/2bb45242c602/sensors-24-06539-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/c5875ea1e385/sensors-24-06539-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/c5ff6ae2b51e/sensors-24-06539-g009a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/7be6216e4d5e/sensors-24-06539-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/bbeeb1dff0c4/sensors-24-06539-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/bcf747abb31d/sensors-24-06539-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/141f34adc062/sensors-24-06539-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/dc97b2fe80db/sensors-24-06539-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/961a54e471f0/sensors-24-06539-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/55bbab7a2ff3/sensors-24-06539-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/523ae950410d/sensors-24-06539-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/2bb45242c602/sensors-24-06539-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/c5875ea1e385/sensors-24-06539-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/c5ff6ae2b51e/sensors-24-06539-g009a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/7be6216e4d5e/sensors-24-06539-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30d7/11511481/bbeeb1dff0c4/sensors-24-06539-g011.jpg

相似文献

1
Semantic Segmentation of Satellite Images for Landslide Detection Using Foreground-Aware and Multi-Scale Convolutional Attention Mechanism.利用前景感知和多尺度卷积注意力机制对卫星图像进行语义分割以检测山体滑坡
Sensors (Basel). 2024 Oct 10;24(20):6539. doi: 10.3390/s24206539.
2
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation.RSSFormer:用于遥感土地覆盖分割的前景显著增强
IEEE Trans Image Process. 2023;32:1052-1064. doi: 10.1109/TIP.2023.3238648. Epub 2023 Feb 3.
3
FarSeg++: Foreground-Aware Relation Network for Geospatial Object Segmentation in High Spatial Resolution Remote Sensing Imagery.FarSeg++:用于高空间分辨率遥感影像中地理空间目标分割的前景感知关系网络
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13715-13729. doi: 10.1109/TPAMI.2023.3296757. Epub 2023 Oct 3.
4
EMR-HRNet: A Multi-Scale Feature Fusion Network for Landslide Segmentation from Remote Sensing Images.EMR-HRNet:一种用于从遥感图像中进行滑坡分割的多尺度特征融合网络。
Sensors (Basel). 2024 Jun 6;24(11):3677. doi: 10.3390/s24113677.
5
UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images.UNeXt:一种用于高分辨率遥感影像语义分割的高效网络。
Sensors (Basel). 2024 Oct 16;24(20):6655. doi: 10.3390/s24206655.
6
Efficient Patch-Wise Semantic Segmentation for Large-Scale Remote Sensing Images.高效的基于补丁的大规模遥感图像语义分割。
Sensors (Basel). 2018 Sep 25;18(10):3232. doi: 10.3390/s18103232.
7
TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images.TMNet:一种用于遥感图像的两分支多尺度语义分割网络。
Sensors (Basel). 2023 Jun 26;23(13):5909. doi: 10.3390/s23135909.
8
High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network.高分辨率航空影像语义标注的密集金字塔网络方法
Sensors (Basel). 2018 Nov 5;18(11):3774. doi: 10.3390/s18113774.
9
A transformer-based approach empowered by a self-attention technique for semantic segmentation in remote sensing.一种基于自注意力技术的基于Transformer的方法用于遥感语义分割。
Heliyon. 2024 Apr 12;10(8):e29396. doi: 10.1016/j.heliyon.2024.e29396. eCollection 2024 Apr 30.
10
An Interactive Image Segmentation Method Based on Multi-Level Semantic Fusion.一种基于多级语义融合的交互式图像分割方法。
Sensors (Basel). 2023 Jul 14;23(14):6394. doi: 10.3390/s23146394.

本文引用的文献

1
Deep High-Resolution Representation Learning for Visual Recognition.用于视觉识别的深度高分辨率表征学习
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3349-3364. doi: 10.1109/TPAMI.2020.2983686. Epub 2021 Sep 2.
2
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
3
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.
SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
4
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.