• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

光谱GPT:光谱遥感基础模型。

SpectralGPT: Spectral Remote Sensing Foundation Model.

作者信息

Hong Danfeng, Zhang Bing, Li Xuyang, Li Yuxuan, Li Chenyu, Yao Jing, Yokoya Naoto, Li Hao, Ghamisi Pedram, Jia Xiuping, Plaza Antonio, Gamba Paolo, Benediktsson Jon Atli, Chanussot Jocelyn

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5227-5244. doi: 10.1109/TPAMI.2024.3362475. Epub 2024 Jul 2.

DOI:10.1109/TPAMI.2024.3362475
PMID:38568772
Abstract

The foundation model has recently garnered significant attention due to its potential to revolutionize the field of visual representation learning in a self-supervised manner. While most foundation models are tailored to effectively process RGB images for various visual tasks, there is a noticeable gap in research focused on spectral data, which offers valuable information for scene understanding, especially in remote sensing (RS) applications. To fill this gap, we created for the first time a universal RS foundation model, named SpectralGPT, which is purpose-built to handle spectral RS images using a novel 3D generative pretrained transformer (GPT). Compared to existing foundation models, SpectralGPT 1) accommodates input images with varying sizes, resolutions, time series, and regions in a progressive training fashion, enabling full utilization of extensive RS Big Data; 2) leverages 3D token generation for spatial-spectral coupling; 3) captures spectrally sequential patterns via multi-target reconstruction; and 4) trains on one million spectral RS images, yielding models with over 600 million parameters. Our evaluation highlights significant performance improvements with pretrained SpectralGPT models, signifying substantial potential in advancing spectral RS Big Data applications within the field of geoscience across four downstream tasks: single/multi-label scene classification, semantic segmentation, and change detection.

摘要

基础模型最近因其有潜力以自监督方式彻底改变视觉表征学习领域而备受关注。虽然大多数基础模型都是为有效处理用于各种视觉任务的RGB图像而量身定制的,但在专注于光谱数据的研究方面存在明显差距,光谱数据为场景理解提供了有价值的信息,尤其是在遥感(RS)应用中。为了填补这一差距,我们首次创建了一个通用的RS基础模型,名为SpectralGPT,它专门用于使用新型3D生成预训练变压器(GPT)来处理光谱RS图像。与现有基础模型相比,SpectralGPT:1)以渐进训练方式适应具有不同大小、分辨率、时间序列和区域的输入图像,从而能够充分利用大量的RS大数据;2)利用3D令牌生成进行空间光谱耦合;3)通过多目标重建捕获光谱序列模式;4)在100万张光谱RS图像上进行训练,生成具有超过6亿个参数的模型。我们的评估突出了预训练的SpectralGPT模型在性能上的显著提升,这表明在推进地球科学领域内光谱RS大数据应用方面,在四个下游任务(单/多标签场景分类、语义分割和变化检测)中具有巨大潜力。

相似文献

1
SpectralGPT: Spectral Remote Sensing Foundation Model.光谱GPT:光谱遥感基础模型。
IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5227-5244. doi: 10.1109/TPAMI.2024.3362475. Epub 2024 Jul 2.
2
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model.ChangEn2:多时态遥感生成式变化基础模型
IEEE Trans Pattern Anal Mach Intell. 2025 Feb;47(2):725-741. doi: 10.1109/TPAMI.2024.3475824. Epub 2025 Jan 9.
3
Self-supervised learning for remote sensing scene classification under the few shot scenario.基于小样本场景的遥感场景分类的自监督学习。
Sci Rep. 2023 Jan 9;13(1):433. doi: 10.1038/s41598-022-27313-5.
4
A transformer-based approach empowered by a self-attention technique for semantic segmentation in remote sensing.一种基于自注意力技术的基于Transformer的方法用于遥感语义分割。
Heliyon. 2024 Apr 12;10(8):e29396. doi: 10.1016/j.heliyon.2024.e29396. eCollection 2024 Apr 30.
5
Semi-supervised bidirectional alignment for Remote Sensing cross-domain scene classification.用于遥感跨域场景分类的半监督双向对齐
ISPRS J Photogramm Remote Sens. 2023 Jan;195:192-203. doi: 10.1016/j.isprsjprs.2022.11.013.
6
SAGN: Semantic-Aware Graph Network for Remote Sensing Scene Classification.SAGN:用于遥感场景分类的语义感知图网络。
IEEE Trans Image Process. 2023;32:1011-1025. doi: 10.1109/TIP.2023.3238310. Epub 2023 Jan 31.
7
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
8
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
9
RS-SSKD: Self-Supervision Equipped with Knowledge Distillation for Few-Shot Remote Sensing Scene Classification.RS - SSKD:用于少样本遥感场景分类的知识蒸馏自监督方法
Sensors (Basel). 2021 Feb 24;21(5):1566. doi: 10.3390/s21051566.
10
Cross-Modal Remote Sensing Image-Text Retrieval via Context and Uncertainty-Aware Prompt.通过上下文和不确定性感知提示实现跨模态遥感图像-文本检索
IEEE Trans Neural Netw Learn Syst. 2025 Jun;36(6):11384-11398. doi: 10.1109/TNNLS.2024.3458898.

引用本文的文献

1
DBRSNet: a dual-branch remote sensing image segmentation model based on feature interaction and multi-scale feature fusion.DBRSNet:一种基于特征交互和多尺度特征融合的双分支遥感图像分割模型。
Sci Rep. 2025 Jul 30;15(1):27786. doi: 10.1038/s41598-025-13236-4.
2
Generative Artificial Intelligence for Synthetic Spectral Data Augmentation in Sensor-Based Plastic Recycling.用于基于传感器的塑料回收中合成光谱数据增强的生成式人工智能
Sensors (Basel). 2025 Jul 1;25(13):4114. doi: 10.3390/s25134114.
3
High-precision inversion of vegetation parameters in the AI era: Integrating hyperspectral remote sensing and deep learning.
人工智能时代植被参数的高精度反演:融合高光谱遥感与深度学习
Innovation (Camb). 2025 Mar 5;6(6):100868. doi: 10.1016/j.xinn.2025.100868. eCollection 2025 Jun 2.
4
A novel hybrid deep learning approach for super-resolution and objects detection in remote sensing.一种用于遥感超分辨率和目标检测的新型混合深度学习方法。
Sci Rep. 2025 May 17;15(1):17221. doi: 10.1038/s41598-025-01476-3.
5
Detection of Disease Features on Retinal OCT Scans Using RETFound.使用RETFound在视网膜光学相干断层扫描(OCT)上检测疾病特征
Bioengineering (Basel). 2024 Nov 25;11(12):1186. doi: 10.3390/bioengineering11121186.
6
Artificial intelligence for geoscience: Progress, challenges, and perspectives.地球科学中的人工智能:进展、挑战与展望。
Innovation (Camb). 2024 Aug 22;5(5):100691. doi: 10.1016/j.xinn.2024.100691. eCollection 2024 Sep 9.
7
Interpretable foundation models as decryptors peering into the Earth system.可解释的基础模型作为洞察地球系统的解密器。
Innovation (Camb). 2024 Aug 5;5(5):100682. doi: 10.1016/j.xinn.2024.100682. eCollection 2024 Sep 9.
8
Implications of Convolutional Neural Network for Brain MRI Image Classification to Identify Alzheimer's Disease.卷积神经网络对脑磁共振成像图像进行分类以识别阿尔茨海默病的意义。
Parkinsons Dis. 2024 Aug 22;2024:6111483. doi: 10.1155/2024/6111483. eCollection 2024.
9
Integrated single-cell and bulk characterization of branched chain amino acid metabolism-related key gene BCAT1 and association with prognosis and immunogenicity of clear cell renal cell carcinoma.分支链氨基酸代谢相关关键基因 BCAT1 的单细胞和批量综合特征及其与透明细胞肾细胞癌预后和免疫原性的关联。
Aging (Albany NY). 2024 Feb 2;16(3):2715-2735. doi: 10.18632/aging.205506.