Wang Bin, Chen Gongchao, Wen Juan, Li Linfang, Jin Songlin, Li Yan, Zhou Ling, Zhang Weidong
School of Life Sciences, Henan Institute of Science and Technology, Xinxiang, China.
School of Information Engineering, Henan Institute of Science and Technology, Xinxiang, China.
Front Plant Sci. 2025 Jan 16;15:1458978. doi: 10.3389/fpls.2024.1458978. eCollection 2024.
Hyperspectral images are rich in spectral and spatial information, providing a detailed and comprehensive description of objects, which makes hyperspectral image analysis technology essential in intelligent agriculture. With various corn seed varieties exhibiting significant internal structural differences, accurate classification is crucial for planting, monitoring, and consumption. However, due to the large volume and complex features of hyperspectral corn image data, existing methods often fall short in feature extraction and utilization, leading to low classification accuracy. To address these issues, this paper proposes a spectral-spatial attention transformer network (SSATNet) for hyperspectral corn image classification. Specifically, SSATNet utilizes 3D and 2D convolutions to effectively extract local spatial, spectral, and textural features from the data while incorporating spectral and spatial morphological structures to understand the internal structure of the data better. Additionally, a transformer encoder with cross-attention extracts and refines feature information from a global perspective. Finally, a classifier generates the prediction results. Compared to existing state-of-the-art classification methods, our model performs better on the hyperspectral corn image dataset, demonstrating its effectiveness.
高光谱图像富含光谱和空间信息,能对物体进行详细而全面的描述,这使得高光谱图像分析技术在智能农业中至关重要。由于各种玉米种子品种在内部结构上存在显著差异,准确分类对于种植、监测和消费至关重要。然而,由于高光谱玉米图像数据量庞大且特征复杂,现有方法在特征提取和利用方面往往存在不足,导致分类准确率较低。为了解决这些问题,本文提出了一种用于高光谱玉米图像分类的光谱-空间注意力Transformer网络(SSATNet)。具体而言,SSATNet利用3D和2D卷积有效地从数据中提取局部空间、光谱和纹理特征,同时纳入光谱和空间形态结构以更好地理解数据的内部结构。此外,带有交叉注意力的Transformer编码器从全局角度提取和细化特征信息。最后,分类器生成预测结果。与现有的先进分类方法相比,我们的模型在高光谱玉米图像数据集上表现更好,证明了其有效性。