Marais G Christopher, Stratton Isabelle C, Johnson Andrew J, Hulcr Jiri
School of Forest, Fisheries, and Geomatic Sciences, University of Florida, Gainesville, Florida, United States of America.
PLoS One. 2025 Jun 5;20(6):e0310716. doi: 10.1371/journal.pone.0310716. eCollection 2025.
This study presents an initial model for bark beetle identification, serving as a foundational step toward developing a fully functional and practical identification tool. Bark beetles are known for extensive damage to forests globally, as well as for uniform and homoplastic morphology which poses identification challenges. Utilizing a MaxViT-based deep learning backbone which utilizes local and global attention to classify bark beetles down to the genus level from images containing multiple beetles. The methodology involves a process of image collection, preparation, and model training, leveraging pre-classified beetle species to ensure accuracy and reliability. The model's F1 score estimates of 0.99 and 1.0 indicates a strong ability to accurately classify genera in the collected data, including those previously unknown to the model. This makes it a valuable first step towards building a tool for applications in forest management and ecological research. While the current model distinguishes among 12 genera, further refinement and additional data will be necessary to achieve reliable species-level identification, which is particularly important for detecting new invasive species. Despite the controlled conditions of image collection and potential challenges in real-world application, this study provides the first model capable of identifying the bark beetle genera, and by far the largest training set of images for any comparable insect group. We also designed a function that reports if a species appears to be unknown. Further research is suggested to enhance the model's generalization capabilities and scalability, emphasizing the integration of advanced machine learning techniques for improved species classification and the detection of invasive or undescribed species.
本研究提出了一种用于树皮甲虫识别的初始模型,这是朝着开发一个功能完备且实用的识别工具迈出的基础一步。树皮甲虫因对全球森林造成广泛破坏而闻名,同时其形态单一且趋同,这给识别工作带来了挑战。利用基于MaxViT的深度学习主干,该主干利用局部和全局注意力从包含多个甲虫的图像中将树皮甲虫分类到属级水平。该方法包括图像收集、准备和模型训练过程,利用预先分类的甲虫物种来确保准确性和可靠性。该模型的F1分数估计值为0.99和1.0,表明其在收集的数据中准确分类属的能力很强,包括那些模型之前未知的属。这使其成为朝着构建一个用于森林管理和生态研究应用工具迈出的有价值的第一步。虽然当前模型能够区分12个属,但要实现可靠的物种水平识别还需要进一步优化和更多数据,这对于检测新的入侵物种尤为重要。尽管图像收集条件受到控制且实际应用中可能存在挑战,但本研究提供了第一个能够识别树皮甲虫属的模型,也是迄今为止任何可比昆虫群体中最大的图像训练集。我们还设计了一个功能,用于报告某个物种是否看起来未知。建议进一步开展研究以提高模型的泛化能力和可扩展性,强调整合先进的机器学习技术以改进物种分类以及检测入侵或未描述的物种。