Egusquiza Itziar, Picon Artzai, Irusta Unai, Bereciartua-Perez Arantza, Eggers Till, Klukas Christian, Aramendi Elisabete, Navarra-Mestre Ramon
TECNALIA, Basque Research and Technology Alliance (BRTA), Parque Tecnológico de Bizkaia, Derio, Spain.
University of the Basque Country, Bilbao, Spain.
Front Plant Sci. 2022 Mar 7;13:813237. doi: 10.3389/fpls.2022.813237. eCollection 2022.
Plant fungal diseases are one of the most important causes of crop yield losses. Therefore, plant disease identification algorithms have been seen as a useful tool to detect them at early stages to mitigate their effects. Although deep-learning based algorithms can achieve high detection accuracies, they require large and manually annotated image datasets that is not always accessible, specially for rare and new diseases. This study focuses on the development of a plant disease detection algorithm and strategy requiring few plant images (Few-shot learning algorithm). We extend previous work by using a novel challenging dataset containing more than 100,000 images. This dataset includes images of leaves, panicles and stems of five different crops (barley, corn, rape seed, rice, and wheat) for a total of 17 different diseases, where each disease is shown at different disease stages. In this study, we propose a deep metric learning based method to extract latent space representations from plant diseases with just few images by means of a Siamese network and triplet loss function. This enhances previous methods that require a support dataset containing a high number of annotated images to perform metric learning and few-shot classification. The proposed method was compared over a traditional network that was trained with the cross-entropy loss function. Exhaustive experiments have been performed for validating and measuring the benefits of metric learning techniques over classical methods. Results show that the features extracted by the metric learning based approach present better discriminative and clustering properties. Davis-Bouldin index and Silhouette score values have shown that triplet loss network improves the clustering properties with respect to the categorical-cross entropy loss. Overall, triplet loss approach improves the DB index value by 22.7% and Silhouette score value by 166.7% compared to the categorical cross-entropy loss model. Moreover, the F-score parameter obtained from the Siamese network with the triplet loss performs better than classical approaches when there are few images for training, obtaining a 6% improvement in the F-score mean value. Siamese networks with triplet loss have improved the ability to learn different plant diseases using few images of each class. These networks based on metric learning techniques improve clustering and classification results over traditional categorical cross-entropy loss networks for plant disease identification.
植物真菌病害是造成作物产量损失的最重要原因之一。因此,植物病害识别算法被视为在早期阶段检测病害以减轻其影响的有用工具。尽管基于深度学习的算法可以实现较高的检测准确率,但它们需要大量且经过人工标注的图像数据集,而这些数据集并非总是能够获取到,特别是对于罕见和新出现的病害。本研究专注于开发一种只需少量植物图像的植物病害检测算法和策略(少样本学习算法)。我们通过使用一个包含超过10万张图像的全新且具有挑战性的数据集扩展了先前的工作。该数据集包括五种不同作物(大麦、玉米、油菜籽、水稻和小麦)的叶片、穗和茎的图像,共计17种不同病害,每种病害在不同病害阶段展示。在本研究中,我们提出一种基于深度度量学习的方法,通过连体网络和三元组损失函数,仅用少量图像从植物病害中提取潜在空间表示。这改进了先前的方法,那些方法需要一个包含大量标注图像的支持数据集来执行度量学习和少样本分类。将所提出的方法与使用交叉熵损失函数训练的传统网络进行了比较。已经进行了详尽的实验来验证和衡量度量学习技术相对于经典方法的优势。结果表明,基于度量学习的方法提取的特征具有更好的判别性和聚类特性。戴维斯 - 布尔丁指数和轮廓系数值表明,相对于分类交叉熵损失,三元组损失网络改善了聚类特性。总体而言,与分类交叉熵损失模型相比,三元组损失方法使戴维斯 - 布尔丁指数值提高了22.7%,轮廓系数值提高了166.7%。此外,当用于训练的图像较少时,表示使用三元组损失的连体网络获得的F分数参数比经典方法表现更好,F分数平均值提高了6%。带有三元组损失的连体网络提高了使用每个类别的少量图像学习不同植物病害的能力。这些基于度量学习技术的网络在植物病害识别方面比传统的分类交叉熵损失网络改进了聚类和分类结果。