Suppr超能文献

通过自监督视觉变换器增强蚊媒物种的分类

Enhance fashion classification of mosquito vector species via self-supervised vision transformer.

作者信息

Kittichai Veerayuth, Kaewthamasorn Morakot, Chaiphongpachara Tanawat, Laojun Sedthapong, Saiwichai Tawee, Naing Kaung Myat, Tongloy Teerawat, Boonsang Siridech, Chuwongin Santhad

机构信息

Faculty of Medicine, King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand.

Veterinary Parasitology Research Unit, Faculty of Veterinary Science, Chulalongkorn University, Bangkok, Thailand.

出版信息

Sci Rep. 2024 Dec 28;14(1):31517. doi: 10.1038/s41598-024-83358-8.

Abstract

Vector-borne diseases pose a major worldwide health concern, impacting more than 1 billion people globally. Among various blood-feeding arthropods, mosquitoes stand out as the primary carriers of diseases significant in both medical and veterinary fields. Hence, comprehending their distinct role fulfilled by different mosquito types is crucial for efficiently addressing and enhancing control measures against mosquito-transmitted diseases. The conventional method for identifying mosquito species is laborious and requires significant effort to learn. Classification is subsequently carried out by skilled laboratory personnel, rendering the process inherently time-intensive and restricting the task to entomology specialists. Therefore, integrating artificial intelligence with standard taxonomy, such as molecular techniques, is essential for accurate mosquito species identification. Advancement in novel tools with artificial intelligence has challenged the task of developing an automated system for sample collection and identification. This study aims to introduce a self-supervised Vision Transformer supporting an automatic model for classifying mosquitoes found across various regions of Thailand. The objective is to utilize self-distillation with unlabeled data (DINOv2) to develop models on a mobile phone-captured dataset containing 16 species of female mosquitoes, including those known for transmitting malaria and dengue. The DINOv2 model surpassed the ViT baseline model in precision and recall for all mosquito species. When compared on a species-specific level, utilizing the DINOv2 model resulted in reductions in false negatives and false positives, along with enhancements in precision and recall values, in contrast to the baseline model, across all mosquito species. Notably, at least 10 classes exhibited outstanding performance, achieving above precision and recall rates exceeding 90%. Remarkably, when applying cropping techniques to the dataset instead of utilizing the original photographs, there was a significant improvement in performance across all DINOv2 models studied. This is demonstrated by an increase in recall to 87.86%, precision to 91.71%, F1 score to 88.71%, and accuracy to 98.45%, respectively. Malaria mosquito species can be easily distinguished from another genus like Aedes, Mansonia, Armigeres, and Culex, respectively. While classifying malaria vector species presented challenges for the DINOv2 model, utilizing the cropped images enhanced precision by up to 96% for identifying one of the top three malaria vectors in Thailand, Anopheles minimus. A proficiently trained DINOv2 model, coupled with effective data management, can contribute to the development of a mobile phone application. Furthermore, this method shows promise in supporting field professionals who are not entomology experts in effectively addressing pathogens responsible for diseases transmitted by female mosquitoes.

摘要

媒介传播疾病是全球主要的健康问题,影响着全球超过10亿人。在各种吸血节肢动物中,蚊子是医学和兽医领域重要疾病的主要传播者。因此,了解不同类型蚊子所发挥的独特作用对于有效应对和加强针对蚊媒疾病的控制措施至关重要。传统的蚊子种类鉴定方法费力且需要花费大量精力去学习。随后由熟练的实验室人员进行分类,这使得该过程本质上耗时较长,并且将这项任务局限于昆虫学专家。因此,将人工智能与标准分类学(如分子技术)相结合对于准确鉴定蚊子种类至关重要。人工智能新型工具的发展对开发样本采集和鉴定自动化系统的任务提出了挑战。本研究旨在引入一种自监督视觉Transformer,以支持对泰国不同地区发现的蚊子进行分类的自动模型。目标是利用无标签数据的自蒸馏(DINOv2)在包含16种雌蚊的手机拍摄数据集上开发模型,这些雌蚊包括以传播疟疾和登革热而闻名的种类。DINOv2模型在所有蚊子种类的精度和召回率方面超过了ViT基线模型。在特定种类水平上进行比较时,与基线模型相比,使用DINOv2模型减少了假阴性和假阳性,同时提高了精度和召回率值。值得注意的是,至少有10个类别表现出色,精度和召回率超过90%。显著的是,当对数据集应用裁剪技术而不是使用原始照片时,在所研究的所有DINOv2模型中性能都有显著提高。这分别表现为召回率提高到87.86%、精度提高到91.71%、F1分数提高到88.71%以及准确率提高到98.45%。疟蚊种类可以很容易地分别与伊蚊、曼蚊、阿蚊和库蚊等其他属区分开来。虽然对疟疾病媒种类进行分类对DINOv2模型来说存在挑战,但利用裁剪后的图像在识别泰国三大疟疾病媒之一微小按蚊时精度提高了高达96%。一个训练有素的DINOv2模型,再加上有效的数据管理,有助于开发手机应用程序。此外,这种方法有望支持非昆虫学专家的现场专业人员有效应对由雌蚊传播疾病的病原体。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52f7/11682170/83cd67a1ab48/41598_2024_83358_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验