Miao Sheng, Wang Chuanlong, Kong Guangze, Yuan Xiuhe, Shen Xiang, Liu Chao
School of Information and Control Engineering, Qingdao University of Technology, Qingdao, 266520, China.
School of Environmental and Municipal Engineering, Qingdao University of Technology, Qingdao, 266520, China.
Sci Rep. 2024 Dec 28;14(1):31061. doi: 10.1038/s41598-024-82248-3.
This paper presents a deep learning model based on an active learning strategy. The model achieves accurate identification of vegetation types in the study area by utilizing multispectral data obtained from preprocessing of unmanned aerial vehicle (UAV) remote sensing equipment. This approach offers advantages such as high data accuracy, mobility, and easy data collection. In active learning, the minimum confidence scoring method and a sampling technique based on a data pool are employed to reduce labeling costs. The deep learning model incorporates a semantic segmentation gated full fusion module that integrates a dual attention mechanism. This module enhances the capture of detailed texture information, optimally allocates spectral weights, and improves the model's ability to distinguish between similar categories. At a labeling cost of 20%, the average accuracy of the model is 93.2%. Compared with other models, the proposed model achieved the highest classification accuracy in the case of limited training samples. At full annotation cost, the average accuracy is 95.32%, with only a difference of about 2%, but saving 80% of annotation cost. Therefore, active learning strategies can filter out high-value samples that are beneficial for model training, greatly reducing the annotation cost of samples Finally, the recognition results of surface vegetation cover types in the study area are presented, and the model's accuracy is verified through field investigation.
本文提出了一种基于主动学习策略的深度学习模型。该模型通过利用无人机(UAV)遥感设备预处理获得的多光谱数据,实现了对研究区域植被类型的准确识别。这种方法具有数据准确性高、机动性强和数据采集容易等优点。在主动学习中,采用最小置信度评分方法和基于数据池的采样技术来降低标注成本。深度学习模型包含一个语义分割门控全融合模块,该模块集成了双重注意力机制。该模块增强了对详细纹理信息的捕捉,优化了光谱权重分配,并提高了模型区分相似类别的能力。在标注成本为20%时,模型的平均准确率为93.2%。与其他模型相比,在训练样本有限的情况下,所提出的模型实现了最高的分类准确率。在全标注成本下,平均准确率为95.32%,仅相差约2%,但节省了80%的标注成本。因此,主动学习策略可以筛选出有利于模型训练的高价值样本,大大降低样本的标注成本。最后,给出了研究区域地表植被覆盖类型的识别结果,并通过实地调查验证了模型的准确性。