Wang Zhengjue, Chen Bo, Zhang Hao, Liu Hongwei
IEEE Trans Neural Netw Learn Syst. 2022 Feb;33(2):721-735. doi: 10.1109/TNNLS.2020.3028772. Epub 2022 Feb 3.
Due to hardware limitations, it is challenging for sensors to acquire images of high resolution in both spatial and spectral domains, which arouses a trend that utilizing a low-resolution hyperspectral image (LR-HSI) and a high-resolution multispectral image (HR-MSI) to fuse an HR-HSI in an unsupervised manner. Considering the fact that most existing methods are restricted by using linear spectral unmixing, we propose a nonlinear variational probabilistic generative model (NVPGM) for the unsupervised fusion task based on nonlinear unmixing. We model the joint full likelihood of the observed pixels in an LR-HSI and an HR-MSI, both of which are assumed to be generated from the corresponding latent representations, i.e., the abundance vectors. The sufficient statistics of the generative conditional distributions are nonlinear functions with respect to the latent variable, realized by neural networks, which results in a nonlinear spectral mixture model. For scalability and efficiency, we construct two recognition models to infer the latent representations, which are parameterized by neural networks as well. Simultaneously inferring the latent representations and optimizing the parameters are achieved using stochastic gradient variational inference, after which the target HR-HSI is retrieved via feedforward mapping. Though without supervised information about the HR-HSI, NVPGM still can be trained based on extra LR-HSI and HR-MSI data sets in advance unsupervisedly and processes the images at the test phase in real time. Three commonly used data sets are used to evaluate the effectiveness and efficiency of NVPGM, illustrating the outperformance of NVPGM in the unsupervised LR-HSI and HR-MSI fusion task.
由于硬件限制,传感器在空间和光谱域获取高分辨率图像具有挑战性,这引发了一种利用低分辨率高光谱图像(LR-HSI)和高分辨率多光谱图像(HR-MSI)以无监督方式融合高分辨率高光谱图像(HR-HSI)的趋势。考虑到大多数现有方法受限于使用线性光谱解混,我们基于非线性解混提出了一种用于无监督融合任务的非线性变分概率生成模型(NVPGM)。我们对LR-HSI和HR-MSI中观测像素的联合全似然进行建模,假设这两者均由相应的潜在表示(即丰度向量)生成。生成条件分布的充分统计量是关于潜在变量的非线性函数,通过神经网络实现,这导致了一个非线性光谱混合模型。为了实现可扩展性和效率,我们构建了两个识别模型来推断潜在表示,它们也由神经网络进行参数化。使用随机梯度变分推理同时推断潜在表示并优化参数,之后通过前馈映射检索目标HR-HSI。尽管没有关于HR-HSI的监督信息,但NVPGM仍可预先基于额外的LR-HSI和HR-MSI数据集进行无监督训练,并在测试阶段实时处理图像。使用三个常用数据集来评估NVPGM的有效性和效率,表明NVPGM在无监督LR-HSI和HR-MSI融合任务中的优越性。