用于高效白细胞分类的融入领域知识的预训练深度学习模型。

Domain knowledge-infused pre-trained deep learning models for efficient white blood cell classification.

作者信息

Jeneessha P, Balasubramanian Vinoth Kumar

机构信息

Department of Information Technology, PSG College of Technology, Coimbatore, 641004, India.

出版信息

Sci Rep. 2025 May 4;15(1):15608. doi: 10.1038/s41598-025-00563-9.

DOI:10.1038/s41598-025-00563-9

PMID:40320432

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12050290/

Abstract

White blood cell (WBC) classification is a crucial step in assessing a patient's health and validating medical treatment in the medical domain. Hence, efficient computer vision solutions to the classification of WBC will be an effective aid to medical practitioners. Computer-aided diagnosis (CAD) reduces manual intervention, avoids errors, speeds up medical analysis, and provides accurate medical reports. Though a lot of research has been taken up to develop deep learning models for efficient classification of WBCs, there is still scope for improvement to support the data insufficiency issue in medical data sets. Data augmentation and normalization techniques increase the quantity of data but don't enhance the quality of the data. Hence, deep learning models though performing well can still be made efficient and effective when quality data is fused along with the available image dataset. This paper aims to utilize domain knowledge and image data to improve the classification performance of pre-trained models namely Inception V3, DenseNet 121, ResNet 50, MobileNet V2, and VGG 16. The models performance, with and without domain knowledge infused, is analyzed on the BCCD and LISC datasets. On the BCCD dataset, the average accuracies increased from 82.7%, 98.8%, 98.38%, 98.56%, and 98.5%-99.38%, 99.05%, 99.05%, 98.67%, and 98.75% for Inception V3, DenseNet 121, ResNet 50, MobileNet V2, and VGG 16, respectively. Similarly, on the LISC dataset, the accuracies improved from 86.76%, 92.2%, 91.76%, 92.8%, and 94.4%-92.05%, 95.88%, 95.58%, 95.2%, and 95.2%, respectively.

摘要

白细胞（WBC）分类是医学领域评估患者健康状况和验证治疗效果的关键步骤。因此，针对白细胞分类的高效计算机视觉解决方案将对医学从业者提供有效的帮助。计算机辅助诊断（CAD）减少了人工干预，避免了错误，加快了医学分析速度，并提供准确的医学报告。尽管已经开展了大量研究来开发用于白细胞高效分类的深度学习模型，但在支持医学数据集中的数据不足问题方面仍有改进空间。数据增强和归一化技术增加了数据量，但并未提高数据质量。因此，尽管深度学习模型表现良好，但当将高质量数据与可用图像数据集融合时，仍可使其更高效和有效。本文旨在利用领域知识和图像数据来提高预训练模型（即Inception V3、DenseNet 121、ResNet 50、MobileNet V2和VGG 16）的分类性能。在BCCD和LISC数据集上分析了注入和未注入领域知识时模型的性能。在BCCD数据集上，Inception V3、DenseNet 121、ResNet 50、MobileNet V2和VGG 16的平均准确率分别从82.7%、98.8%、98.38%、98.56%和98.5%提高到99.38%、99.05%、99.05%、98.67%和98.75%。同样，在LISC数据集上，准确率分别从86.76%、92.2%、91.76%、92.8%和94.4%提高到92.05%、95.88%、95.58%、95.2%和95.2%。