Xu Junwei, Wu Fangfang, Li Xin, Dong Weisheng, Huang Tao, Shi Guangming
IEEE Trans Image Process. 2023;32:4416-4431. doi: 10.1109/TIP.2023.3299189. Epub 2023 Aug 4.
In recent years, researchers have become more interested in hyperspectral image fusion (HIF) as a potential alternative to expensive high-resolution hyperspectral imaging systems, which aims to recover a high-resolution hyperspectral image (HR-HSI) from two images obtained from low-resolution hyperspectral (LR-HSI) and high-spatial-resolution multispectral (HR-MSI). It is generally assumed that degeneration in both the spatial and spectral domains is known in traditional model-based methods or that there existed paired HR-LR training data in deep learning-based methods. However, such an assumption is often invalid in practice. Furthermore, most existing works, either introducing hand-crafted priors or treating HIF as a black-box problem, cannot take full advantage of the physical model. To address those issues, we propose a deep blind HIF method by unfolding model-based maximum a posterior (MAP) estimation into a network implementation in this paper. Our method works with a Laplace distribution (LD) prior that does not need paired training data. Moreover, we have developed an observation module to directly learn degeneration in the spatial domain from LR-HSI data, addressing the challenge of spatially-varying degradation. We also propose to learn the uncertainty (mean and variance) of LD models using a novel Swin-Transformer-based denoiser and to estimate the variance of degraded images from residual errors (rather than treating them as global scalars). All parameters of the MAP estimation algorithm and the observation module can be jointly optimized through end-to-end training. Extensive experiments on both synthetic and real datasets show that the proposed method outperforms existing competing methods in terms of both objective evaluation indexes and visual qualities.
近年来,作为昂贵的高分辨率高光谱成像系统的一种潜在替代方案,研究人员对高光谱图像融合(HIF)越来越感兴趣,该技术旨在从低分辨率高光谱图像(LR-HSI)和高空间分辨率多光谱图像(HR-MSI)获取的两幅图像中恢复高分辨率高光谱图像(HR-HSI)。通常认为,在基于传统模型的方法中,空间和光谱域的退化是已知的,或者在基于深度学习的方法中存在配对的高分辨率-低分辨率训练数据。然而,这种假设在实践中往往是无效的。此外,大多数现有工作,要么引入手工制作的先验,要么将HIF视为一个黑箱问题,无法充分利用物理模型。为了解决这些问题,我们在本文中提出了一种深度盲HIF方法,即将基于模型的最大后验(MAP)估计展开为网络实现。我们的方法使用拉普拉斯分布(LD)先验,该先验不需要配对训练数据。此外,我们开发了一个观测模块,直接从LR-HSI数据中学习空间域的退化,解决了空间变化退化的挑战。我们还建议使用一种基于新型Swin-Transformer的去噪器来学习LD模型的不确定性(均值和方差),并从残差误差中估计退化图像的方差(而不是将它们视为全局标量)。MAP估计算法和观测模块的所有参数都可以通过端到端训练进行联合优化。在合成数据集和真实数据集上的大量实验表明,所提出的方法在客观评价指标和视觉质量方面均优于现有的竞争方法。