Department of Biology and Ecology, Faculty of Science, University of Kragujevac, Radoja Domanovića 12, 34000 Kragujevac, Serbia.
Faculty of Electronic Engineering, University of Niš, Aleksandra Medvedeva 14, 18000 Niš, Serbia.
Sci Total Environ. 2024 Jul 20;935:172877. doi: 10.1016/j.scitotenv.2024.172877. Epub 2024 May 11.
Deep learning techniques have recently found application in biodiversity research. Mayflies (Ephemeroptera), stoneflies (Plecoptera) and caddisflies (Trichoptera), often abbreviated as EPT, are frequently used for freshwater biomonitoring due to their large numbers and sensitivity to environmental changes. However, the morphological identification of EPT species is a challenging but fundamental task. Morphological identification of these freshwater insects is therefore not only extremely time-consuming and costly, but also often leads to misjudgments or generates datasets with low taxonomic resolution. Here, we investigated the application of deep learning to increase the efficiency and taxonomic resolution of biomonitoring programs. Our database contains 90 EPT taxa (genus or species level), with the number of images per category ranging from 21 to 300 (16,650 in total). Upon completion of training, a CNN (Convolutional Neural Network) model was created, capable of automatically classifying these taxa into their appropriate taxonomic categories with an accuracy of 98.7 %. Our model achieved a perfect classification rate of 100 % for 68 of the taxa in our dataset. We achieved noteworthy classification accuracy with morphologically closely related taxa within the training data (e.g., species of the genus Baetis, Hydropsyche, Perla). Gradient-weighted Class Activation Mapping (Grad-CAM) visualized the morphological features responsible for the classification of the treated species in the CNN models. Within Ephemeroptera, the head was the most important feature, while the thorax and abdomen were equally important for the classification of Plecoptera taxa. For the order Trichoptera, the head and thorax were almost equally important. Our database is recognized as the most extensive aquatic insect database, notably distinguished by its wealth of included categories (taxa). Our approach can help solve long-standing challenges in biodiversity research and address pressing issues in monitoring programs by saving time in sample identification.
深度学习技术最近在生物多样性研究中得到了应用。蜉蝣目(Ephemeroptera)、石蝇目(Plecoptera)和毛翅目(Trichoptera),通常缩写为 EPT,由于数量众多且对环境变化敏感,常用于淡水生物监测。然而,EPT 物种的形态鉴定是一项具有挑战性但基本的任务。因此,这些淡水昆虫的形态鉴定不仅极其耗时且昂贵,而且还常常导致误判或生成分类分辨率较低的数据集。在这里,我们研究了深度学习在提高生物监测计划效率和分类分辨率方面的应用。我们的数据库包含 90 个 EPT 类群(属或种级),每个类别中的图像数量从 21 到 300 不等(总计 16650 张)。在完成训练后,创建了一个 CNN(卷积神经网络)模型,能够自动将这些类群分类到适当的分类类别中,准确率为 98.7%。我们的模型在数据集中的 68 个类群中达到了 100%的完美分类率。我们在训练数据中形态上密切相关的类群中实现了值得注意的分类准确性(例如,Baetis、Hydropsyche、Perla 属的物种)。梯度加权类激活映射(Grad-CAM)可视化了负责在 CNN 模型中对处理物种进行分类的形态特征。在蜉蝣目中,头部是最重要的特征,而胸部和腹部对 Plecoptera 类群的分类同样重要。对于 Trichoptera 目,头部和胸部几乎同样重要。我们的数据库被认为是最广泛的水生昆虫数据库,其显著特点是包含的类别(类群)丰富。我们的方法可以通过节省样本鉴定时间来帮助解决生物多样性研究中的长期挑战,并解决监测计划中的紧迫问题。