• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChangEn2:多时态遥感生成式变化基础模型

Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model.

作者信息

Zheng Zhuo, Ermon Stefano, Kim Dongjun, Zhang Liangpei, Zhong Yanfei

出版信息

IEEE Trans Pattern Anal Mach Intell. 2025 Feb;47(2):725-741. doi: 10.1109/TPAMI.2024.3475824. Epub 2025 Jan 9.

DOI:10.1109/TPAMI.2024.3475824
PMID:39388323
Abstract

Our understanding of the temporal dynamics of the Earth's surface has been significantly advanced by deep vision models, which often require a massive amount of labeled multi-temporal images for training. However, collecting, preprocessing, and annotating multi-temporal remote sensing images at scale is non-trivial since it is expensive and knowledge-intensive. In this paper, we present scalable multi-temporal change data generators based on generative models, which are cheap and automatic, alleviating these data problems. Our main idea is to simulate a stochastic change process over time. We describe the stochastic change process as a probabilistic graphical model, namely the generative probabilistic change model (GPCM), which factorizes the complex simulation problem into two more tractable sub-problems, i.e., condition-level change event simulation and image-level semantic change synthesis. To solve these two problems, we present Changen2, a GPCM implemented with a resolution-scalable diffusion transformer which can generate time series of remote sensing images and corresponding semantic and change labels from labeled and even unlabeled single-temporal images. Changen2 is a "generative change foundation model" that can be trained at scale via self-supervision, and is capable of producing change supervisory signals from unlabeled single-temporal images. Unlike existing "foundation models", our generative change foundation model synthesizes change data to train task-specific foundation models for change detection. The resulting model possesses inherent zero-shot change detection capabilities and excellent transferability. Comprehensive experiments suggest Changen2 has superior spatiotemporal scalability in data generation, e.g., Changen2 model trained on 256 pixel single-temporal images can yield time series of any length and resolutions of 1,024 pixels. Changen2 pre-trained models exhibit superior zero-shot performance (narrowing the performance gap to 3% on LEVIR-CD and approximately 10% on both S2Looking and SECOND, compared to fully supervised counterpart) and transferability across multiple types of change tasks, including ordinary and off-nadir building change, land-use/land-cover change, and disaster assessment.

摘要

深度视觉模型极大地推动了我们对地球表面时间动态的理解,这类模型通常需要大量带标签的多时间序列图像进行训练。然而,大规模收集、预处理和标注多时间序列遥感图像并非易事,因为这既昂贵又需要专业知识。在本文中,我们提出了基于生成模型的可扩展多时间序列变化数据生成器,该生成器成本低廉且自动化,缓解了这些数据问题。我们的主要思路是模拟随时间变化的随机过程。我们将随机变化过程描述为概率图模型,即生成概率变化模型(GPCM),它将复杂的模拟问题分解为两个更易于处理的子问题,即条件级变化事件模拟和图像级语义变化合成。为了解决这两个问题,我们提出了Changen2,这是一个用分辨率可扩展的扩散变换器实现的GPCM,它可以从带标签甚至无标签的单时间序列图像生成遥感图像的时间序列以及相应的语义和变化标签。Changen2是一个“生成式变化基础模型”,可以通过自监督进行大规模训练,并且能够从未标注的单时间序列图像中生成变化监督信号。与现有的“基础模型”不同,我们的生成式变化基础模型合成变化数据以训练用于变化检测的特定任务基础模型。所得模型具有固有的零样本变化检测能力和出色的可迁移性。综合实验表明,Changen2在数据生成方面具有卓越的时空可扩展性,例如,在256像素单时间序列图像上训练的Changen2模型可以生成任意长度的时间序列和1024像素分辨率的图像。Changen2预训练模型表现出卓越的零样本性能(与完全监督的对应模型相比,在LEVIR-CD上性能差距缩小到3%,在S2Looking和SECOND上均缩小到约10%)以及跨多种变化任务的可迁移性,包括普通和斜距建筑物变化、土地利用/土地覆盖变化以及灾害评估。

相似文献

1
Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model.ChangEn2:多时态遥感生成式变化基础模型
IEEE Trans Pattern Anal Mach Intell. 2025 Feb;47(2):725-741. doi: 10.1109/TPAMI.2024.3475824. Epub 2025 Jan 9.
2
MetaEarth: a Generative Foundation Model for Global-scale Remote Sensing Image Generation.MetaEarth:一种用于全球尺度遥感图像生成的生成式基础模型。
IEEE Trans Pattern Anal Mach Intell. 2024 Nov 26;PP. doi: 10.1109/TPAMI.2024.3507010.
3
SpectralGPT: Spectral Remote Sensing Foundation Model.光谱GPT:光谱遥感基础模型。
IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5227-5244. doi: 10.1109/TPAMI.2024.3362475. Epub 2024 Jul 2.
4
A hybrid generative adversarial network for weakly-supervised cloud detection in multispectral images.一种用于多光谱图像中弱监督云检测的混合生成对抗网络。
Remote Sens Environ. 2022 Oct;280:113197. doi: 10.1016/j.rse.2022.113197.
5
A Change Detection Method for Remote Sensing Images Based on Coupled Dictionary and Deep Learning.基于耦合字典和深度学习的遥感图像变化检测方法。
Comput Intell Neurosci. 2022 Jan 17;2022:3404858. doi: 10.1155/2022/3404858. eCollection 2022.
6
Swin-HSTPS: Research on Target Detection Algorithms for Multi-Source High-Resolution Remote Sensing Images.Swin-HSTPS:多源高分遥感图像目标检测算法研究。
Sensors (Basel). 2021 Dec 4;21(23):8113. doi: 10.3390/s21238113.
7
Difference-complementary Learning and Label Reassignment for Multimodal Semi-Supervised Semantic Segmentation of Remote Sensing Images.用于遥感图像多模态半监督语义分割的差异互补学习与标签重新分配
IEEE Trans Image Process. 2025 Jan 10;PP. doi: 10.1109/TIP.2025.3526064.
8
Fully Convolutional Change Detection Framework With Generative Adversarial Network for Unsupervised, Weakly Supervised and Regional Supervised Change Detection.基于生成对抗网络的全卷积变化检测框架用于无监督、弱监督和区域监督变化检测。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):9774-9788. doi: 10.1109/TPAMI.2023.3237896. Epub 2023 Jun 30.
9
A survey on generative adversarial networks for imbalance problems in computer vision tasks.关于计算机视觉任务中不平衡问题的生成对抗网络调查。
J Big Data. 2021;8(1):27. doi: 10.1186/s40537-021-00414-0. Epub 2021 Jan 29.
10
Changes to Captions: An Attentive Network for Remote Sensing Change Captioning.字幕变更:一种用于遥感变化字幕的注意力网络。
IEEE Trans Image Process. 2023;32:6047-6060. doi: 10.1109/TIP.2023.3328224. Epub 2023 Nov 8.

引用本文的文献

1
CViT Weakly Supervised Network Fusing Dual-Branch Local-Global Features for Hyperspectral Image Classification.用于高光谱图像分类的融合双分支局部-全局特征的CViT弱监督网络
Entropy (Basel). 2025 Aug 15;27(8):869. doi: 10.3390/e27080869.
2
LKAFFNet: A Novel Large-Kernel Attention Feature Fusion Network for Land Cover Segmentation.LKAFFNet:一种用于土地覆盖分割的新型大内核注意力特征融合网络。
Sensors (Basel). 2024 Dec 25;25(1):54. doi: 10.3390/s25010054.