Suppr超能文献

一种基于可解释视觉Transformer模型的白细胞分类与定位

An Explainable Vision Transformer Model Based White Blood Cells Classification and Localization.

作者信息

Katar Oguzhan, Yildirim Ozal

机构信息

Department of Software Engineering, Firat University, Elazig 23119, Turkey.

Department of Artificial Intelligence and Data Engineering, Firat University, Elazig 23119, Turkey.

出版信息

Diagnostics (Basel). 2023 Jul 24;13(14):2459. doi: 10.3390/diagnostics13142459.

Abstract

White blood cells (WBCs) are crucial components of the immune system that play a vital role in defending the body against infections and diseases. The identification of WBCs subtypes is useful in the detection of various diseases, such as infections, leukemia, and other hematological malignancies. The manual screening of blood films is time-consuming and subjective, leading to inconsistencies and errors. Convolutional neural networks (CNN)-based models can automate such classification processes, but are incapable of capturing long-range dependencies and global context. This paper proposes an explainable Vision Transformer (ViT) model for automatic WBCs detection from blood films. The proposed model uses a self-attention mechanism to extract features from input images. Our proposed model was trained and validated on a public dataset of 16,633 samples containing five different types of WBCs. As a result of experiments on the classification of five different types of WBCs, our model achieved an accuracy of 99.40%. Moreover, the model's examination of misclassified test samples revealed a correlation between incorrect predictions and the presence or absence of granules in the cell samples. To validate this observation, we divided the dataset into two classes, Granulocytes and Agranulocytes, and conducted a secondary training process. The resulting ViT model, trained for binary classification, achieved impressive performance metrics during the test phase, including an accuracy of 99.70%, recall of 99.54%, precision of 99.32%, and F-1 score of 99.43%. To ensure the reliability of the ViT model's, we employed the Score-CAM algorithm to visualize the pixel areas on which the model focuses during its predictions. Our proposed method is suitable for clinical use due to its explainable structure as well as its superior performance compared to similar studies in the literature. The classification and localization of WBCs with this model can facilitate the detection and reporting process for the pathologist.

摘要

白细胞(WBCs)是免疫系统的关键组成部分,在保护身体免受感染和疾病侵害方面发挥着至关重要的作用。白细胞亚型的识别有助于检测各种疾病,如感染、白血病和其他血液系统恶性肿瘤。人工筛查血涂片既耗时又主观,会导致不一致和错误。基于卷积神经网络(CNN)的模型可以使此类分类过程自动化,但无法捕捉长距离依赖性和全局上下文。本文提出了一种可解释的视觉Transformer(ViT)模型,用于从血涂片中自动检测白细胞。所提出的模型使用自注意力机制从输入图像中提取特征。我们提出的模型在一个包含16633个样本的公共数据集上进行了训练和验证,该数据集包含五种不同类型的白细胞。通过对五种不同类型白细胞进行分类实验,我们的模型准确率达到了99.40%。此外,该模型对误分类测试样本的检查揭示了错误预测与细胞样本中颗粒的存在与否之间的相关性。为了验证这一观察结果,我们将数据集分为粒细胞和无粒细胞两类,并进行了二次训练过程。经过二元分类训练的最终ViT模型在测试阶段取得了令人印象深刻的性能指标,包括准确率99.70%、召回率99.54%·、精确率99.32%和F1分数99.43%。为了确保ViT模型的可靠性,我们采用Score-CAM算法来可视化模型在预测过程中关注的像素区域。我们提出的方法具有可解释的结构,并且与文献中的类似研究相比具有卓越的性能,因此适用于临床应用。使用该模型对白细胞进行分类和定位可以促进病理学家的检测和报告过程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a1c/10378025/28ac0345880d/diagnostics-13-02459-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验