Tang Zhaohui, Xuan Chuanzhong, Zhang Tao, Gao Xinyu, Liu Suhui, Zhang Mengqin
College of Mechanical and Electrical Engineering, Inner Mongolia Agricultural University, No. 306 Zhaowuda Road, Saihan District, Hohhot, 010018, Inner Mongolia, China.
Inner Mongolia Engineering Research Center for Intelligent Facilities in Prataculture and Livestock Breeding, No. 306 Zhaowuda Road, Saihan District, Hohhot, 010018, Inner Mongolia, China.
Sci Rep. 2025 Aug 21;15(1):30678. doi: 10.1038/s41598-025-15566-9.
The biodiversity function of the desert steppe ecosystem faces many challenges under the pressure of climate change and human activities. Accurate and efficient assessment of plant diversity is critical for guiding desert steppe restoration efforts. However, desert steppe vegetation has sparse leaves and sparse distribution. It is difficult to accurately distinguish micro-vegetation types based on a single spectrum, vegetation index or texture feature, and the resolution of satellite remote sensing cannot meet the needs of high-precision diversity assessment. To this end, this study proposed a novel method for assessing plant diversity index in degraded desert grassland based on multimodal UAV hyperspectral data and Encoder-CNN. Through experiments on different modal feature combinations, spatial spectra, vegetation indices and texture features were targeted and fused. Channel Attention Fusion (CAF) was introduced into Encoder to achieve cross-layer "soft" residual fusion, the Encoder and CNN models were fused to construct a global-local co-expression structure, and finally the quantitative calculation of the plant diversity index at the pixel level was realized. The results show that the vegetation types determined by the fusion of multimodal data and deep learning are consistent with the existing species, dominant species and sub-dominant species of the actual community, and the calculated diversity index results are also consistent with the actual situation. The use of multimodal data combining spatial spectral features with index features, combined with the Encode-CNN model, can provide the most accurate information on community composition. The overall accuracy of sparse vegetation classification can reach 90.01%, and the average accuracy can reach 85.23%, which is better than single mode or traditional 3DCNN, VIT models. This study demonstrates the application potential of UAV hyperspectral multimodal technology and deep learning in the assessment of desert steppe plant diversity, providing important technical support for ecological protection and conservation.
在气候变化和人类活动的压力下,荒漠草原生态系统的生物多样性功能面临诸多挑战。准确、高效地评估植物多样性对于指导荒漠草原恢复工作至关重要。然而,荒漠草原植被叶片稀疏且分布零散。基于单一光谱、植被指数或纹理特征难以准确区分微观植被类型,且卫星遥感分辨率无法满足高精度多样性评估的需求。为此,本研究提出了一种基于多模态无人机高光谱数据和编码器卷积神经网络(Encoder-CNN)的退化荒漠草原植物多样性指数评估新方法。通过对不同模态特征组合进行实验,针对空间光谱、植被指数和纹理特征进行融合。将通道注意力融合(CAF)引入编码器以实现跨层“软”残差融合,融合编码器和卷积神经网络模型构建全局-局部共表达结构,最终实现像素级植物多样性指数的定量计算。结果表明,多模态数据与深度学习融合确定的植被类型与实际群落的现有物种、优势物种和亚优势物种一致,计算得到的多样性指数结果也与实际情况相符。利用结合空间光谱特征与指数特征的多模态数据,结合Encode-CNN模型,能够提供关于群落组成的最准确信息。稀疏植被分类的总体准确率可达90.01%,平均准确率可达85.23%,优于单模态或传统的3DCNN、VIT模型。本研究证明了无人机高光谱多模态技术和深度学习在荒漠草原植物多样性评估中的应用潜力,为生态保护和保育提供了重要技术支持。