Henan Key Laboratory of Big Data Analysis and Processing, School of Computer and Information Engineering, Henan University, Kaifeng 475000, China.
Henan Engineering Laboratory of Spatial Information Processing, Henan University, Kaifeng 475000, China.
Comput Intell Neurosci. 2023 Mar 1;2023:4725986. doi: 10.1155/2023/4725986. eCollection 2023.
Due to the imaging mechanism of hyperspectral images, the spatial resolution of the resulting images is low. An effective method to solve this problem is to fuse the low-resolution hyperspectral image (LR-HSI) with the high-resolution multispectral image (HR-MSI) to generate the high-resolution hyperspectral image (HR-HSI). Currently, the state-of-the-art fusion approach is based on convolutional neural networks (CNN), and few have attempted to use Transformer, which shows impressive performance on advanced vision tasks. In this paper, a simple and efficient hybrid architecture network based on Transformer is proposed to solve the hyperspectral image fusion super-resolution problem. We use the clever combination of convolution and Transformer as the backbone network to fully extract spatial-spectral information by taking advantage of the local and global concerns of both. In order to pay more attention to the information features such as high-frequency information conducive to HR-HSI reconstruction and explore the correlation between spectra, the convolutional attention mechanism is used to further refine the extracted features in spatial and spectral dimensions, respectively. In addition, considering that the resolution of HSI is usually large, we use the feature split module (FSM) to replace the self-attention computation method of the native Transformer to reduce the computational complexity and storage scale of the model and greatly improve the efficiency of model training. Many experiments show that the proposed network architecture achieves the best qualitative and quantitative performance compared with the latest HSI super-resolution methods.
由于高光谱图像的成像机制,得到的图像的空间分辨率较低。解决这个问题的一种有效方法是将低分辨率高光谱图像(LR-HSI)与高分辨率多光谱图像(HR-MSI)融合,以生成高分辨率高光谱图像(HR-HSI)。目前,最先进的融合方法基于卷积神经网络(CNN),很少有人尝试使用 Transformer,它在高级视觉任务中表现出令人印象深刻的性能。在本文中,提出了一种简单而有效的基于 Transformer 的混合架构网络,以解决高光谱图像融合超分辨率问题。我们使用卷积和 Transformer 的巧妙组合作为骨干网络,充分利用两者的局部和全局关注来提取空间-光谱信息。为了更多地关注有利于 HR-HSI 重建的高频信息等信息特征,并探索光谱之间的相关性,使用卷积注意力机制分别在空间和光谱维度上进一步细化提取的特征。此外,考虑到 HSI 的分辨率通常较大,我们使用特征分割模块(FSM)来替换本地 Transformer 的自注意力计算方法,以降低模型的计算复杂度和存储规模,并大大提高模型训练的效率。大量实验表明,与最新的 HSI 超分辨率方法相比,所提出的网络架构在定性和定量性能方面都达到了最佳。