Georgetown University, Washington, DC, USA.
University of Wisconsin, Madison, WI, USA.
Sci Rep. 2024 Nov 7;14(1):27039. doi: 10.1038/s41598-024-78509-w.
Rapid technological advances and growing participation from amateur naturalists have made countless images of insects in their natural habitats available on global web portals. Despite advances in automated species identification, traits like developmental stage or health remain underexplored or manually annotated, with limited focus on automating these features. As a proof-of-concept, we developed a computer vision model utilizing the YOLOv5 algorithm to accurately detect monarch butterfly caterpillars in photographs and classify them into their five developmental stages (instars). The training data were obtained from the iNaturalist portal, and the photographs were first classified and annotated by experts to allow supervised training of models. Our best trained model demonstrates excellent performance on object detection, achieving a mean average precision score of 95% across all five instars. In terms of classification, the YOLOv5l version yielded the best performance, reaching 87% instar classification accuracy for all classes in the test set. Our approach and model show promise in developing detection and classification models for developmental stages for insects, a resource that can be used for large-scale mechanistic studies. These photos hold valuable untapped information, and we've released our annotated collection as an open dataset to support replication and expansion of our methods.
快速的技术进步和业余自然主义者的日益参与,使得无数昆虫在其自然栖息地的图像在全球网络门户上可用。尽管在自动化物种识别方面取得了进展,但发育阶段或健康等特征仍未得到充分探索或手动注释,很少关注这些特征的自动化。作为概念验证,我们开发了一个计算机视觉模型,利用 YOLOv5 算法准确地检测照片中的帝王蝶幼虫,并将它们分类为五个发育阶段(龄期)。训练数据来自 iNaturalist 门户,照片首先由专家进行分类和注释,以允许对模型进行监督训练。我们最好的训练模型在物体检测方面表现出色,在所有五个龄期中的平均精度得分为 95%。在分类方面,YOLOv5l 版本的表现最佳,在测试集中对所有类别的龄期分类准确率达到 87%。我们的方法和模型在开发昆虫发育阶段的检测和分类模型方面具有很大的潜力,这是一个可以用于大规模机制研究的资源。这些照片蕴含着有价值的未开发信息,我们已经发布了经过注释的数据集,以支持我们方法的复制和扩展。