Wang Hu, Quan Sixiang, Liu Jun, Xiao Hai, Peng Yingying, Wang Zhihui, Li Huali
School of Informatics, Hunan University of Chinese Medicine, Changsha, 410208, China.
AI TCM Lab Hunan, Changsha, 410208, China.
Sci Rep. 2025 Aug 11;15(1):29288. doi: 10.1038/s41598-025-14844-w.
In recent years, due to the unique spatial-spectral characteristics of hyperspectral images, they have played a crucial role in many fields. The effective extraction of features using deep neural networks, followed by the design of efficient and high-precision network algorithm structures, has gradually become a research hotspot. Hyperspectral images are difficult to obtain and have limited samples. Although hyperspectral image classification methods based on convolutional neural networks (CNN) have noticeably improved performance, there are still certain shortcomings in the extraction of detailed and local features. Therefore, how to fully utilize spatial and spectral information in situations with limited samples has become a challenging problem. To address this issue, inspired by the PID controller, this paper proposes a Progressive Multi-Scale Multi-Attention Fusion (PMMF) network structure that simultaneously extracts features from the Proportional (P), Integral (I), and Derivative (D) branches. The complementary responsibilities of the three branches address the issue of feature loss in details and improve the network's learning efficiency across feature maps of different scales. By cleverly extracting features from different branches multiple times, the fusion of multi-scale features is achieved, avoiding the limitations of single-scale feature representation. The proposed multi-attention fusion module applies the most suitable attention mechanism according to the representation form of each branch, fully extracting features from each branch, enriching the information contained in the feature maps, and greatly enhancing the classification accuracy of hyperspectral images.
近年来,由于高光谱图像独特的空间光谱特性,它们在许多领域发挥了关键作用。利用深度神经网络有效提取特征,进而设计高效高精度的网络算法结构,逐渐成为研究热点。高光谱图像难以获取且样本有限。尽管基于卷积神经网络(CNN)的高光谱图像分类方法性能有显著提升,但在详细和局部特征提取方面仍存在一定不足。因此,如何在样本有限的情况下充分利用空间和光谱信息已成为一个具有挑战性的问题。为解决这一问题,受PID控制器启发,本文提出一种渐进式多尺度多注意力融合(PMMF)网络结构,该结构同时从比例(P)、积分(I)和微分(D)分支提取特征。三个分支的互补作用解决了细节特征丢失问题,提高了网络在不同尺度特征图上的学习效率。通过多次巧妙地从不同分支提取特征,实现了多尺度特征融合,避免了单尺度特征表示的局限性。所提出的多注意力融合模块根据每个分支的表示形式应用最合适的注意力机制,充分从每个分支提取特征,丰富特征图中包含的信息,大大提高了高光谱图像的分类精度。