Wu Yunlong, Tang Lingdi, Yuan Shouqi
Research Center of Fluid Machinery Engineering and Technology, Jiangsu University, Zhenjiang, 212013, China.
Data and Informatization Department, Jiangsu University, Zhenjiang, 212013, China.
Sci Rep. 2025 Apr 28;15(1):14751. doi: 10.1038/s41598-025-98449-3.
In complex farmland environments, wheat canopy coverage is insufficient at the tillering stage, posing a considerable challenge to the accurate extraction of its canopy using UAV(unmanned air vehicle) remote sensing images. In this paper, an end-to-end semantic segmentation method based on visible light (RGB) and thermal infrared (TIR) images, Tiff-SegFormer, which fused spectral features and temperature features effectively, is proposed for accurate pixel-level classification of winter wheat at the tillering stage images taken by UAV to segment the wheat canopy and background. Tiff-SegFormer utilizes hierarchical feature representation and efficient self-attention in the encoder stage to extract features of detail contours of RGB images and temperature changes of TIFF images, respectively. In the decoder stage, the features are concatenated and then the channel and spatial attention mechanisms are superimposed, aiming to further improve the segmentation accuracy and efficiency of winter wheat at the tillering stage in UAV remote sensing images. The results show that Tiff-SegFormer can achieve accurate segmentation of wheat canopy and background from UAV images of winter wheat at the tillering stage (mIoU = 84.28%, mPA = 88.97%, accuracy = 94.55%). In order to verify the efficiency of the proposed method, Tiff-SegFormer is compared with four widely used semantic segmentation methods, all of which show better performance. The four methods are UNet, DeepLabv3+, HRNet, SegFormer and four-channel (RGB + TIFF) Segformer. The generalization test shows that the proposed Tiff-SegFormer also achieves better performance than other comparison methods (mIoU = 84.94%, mPA = 91.46%, accuracy = 94.71%). Tiff-SegFormer provides a robust and efficient tool for segmenting winter wheat canopy from UAV remote sensing images of winter wheat at the tillering stage, and has great potential in applications (model implementation and results can be found at https://github.com/wylSUGAR/Tiff-SegFormer ).
在复杂的农田环境中,小麦在分蘖期的冠层覆盖率不足,这给利用无人机遥感图像准确提取其冠层带来了相当大的挑战。本文提出了一种基于可见光(RGB)和热红外(TIR)图像的端到端语义分割方法Tiff-SegFormer,该方法有效地融合了光谱特征和温度特征,用于对无人机拍摄的冬小麦分蘖期图像进行精确的像素级分类,以分割小麦冠层和背景。Tiff-SegFormer在编码器阶段利用分层特征表示和高效的自注意力机制,分别提取RGB图像的细节轮廓特征和TIFF图像的温度变化特征。在解码器阶段,将这些特征进行拼接,然后叠加通道和空间注意力机制,旨在进一步提高无人机遥感图像中冬小麦分蘖期的分割精度和效率。结果表明,Tiff-SegFormer能够从冬小麦分蘖期的无人机图像中准确分割出小麦冠层和背景(平均交并比mIoU = 84.28%,平均像素精度mPA = 88.97%,准确率accuracy = 94.55%)。为了验证所提方法的效率,将Tiff-SegFormer与四种广泛使用的语义分割方法进行了比较,结果显示Tiff-SegFormer均表现出更好的性能。这四种方法分别是UNet、DeepLabv3+、HRNet、SegFormer以及四通道(RGB + TIFF)Segformer。泛化测试表明,所提的Tiff-SegFormer也比其他比较方法具有更好的性能(mIoU = 84.94%,mPA = 91.46%,accuracy = 94.71%)。Tiff-SegFormer为从冬小麦分蘖期的无人机遥感图像中分割冬小麦冠层提供了一个强大而高效的工具,并且在应用中具有巨大潜力(模型实现和结果可在https://github.com/wylSUGAR/Tiff-SegFormer上获取)。