Ahmed Ibrahim Abdulrab, Senan Ebrahim Mohammed, Shatnawi Hamzeh Salameh Ahmad, Alkhraisha Ziad Mohammad, Al-Azzam Mamoun Mohammad Ali
Computer Department, Applied College, Najran University, Najran 66462, Saudi Arabia.
Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Alrazi University, Sana'a, Yemen.
Diagnostics (Basel). 2023 Mar 8;13(6):1026. doi: 10.3390/diagnostics13061026.
Acute lymphoblastic leukemia (ALL) is one of the deadliest forms of leukemia due to the bone marrow producing many white blood cells (WBC). ALL is one of the most common types of cancer in children and adults. Doctors determine the treatment of leukemia according to its stages and its spread in the body. Doctors rely on analyzing blood samples under a microscope. Pathologists face challenges, such as the similarity between infected and normal WBC in the early stages. Manual diagnosis is prone to errors, differences of opinion, and the lack of experienced pathologists compared to the number of patients. Thus, computer-assisted systems play an essential role in assisting pathologists in the early detection of ALL. In this study, systems with high efficiency and high accuracy were developed to analyze the images of C-NMC 2019 and ALL-IDB2 datasets. In all proposed systems, blood micrographs were improved and then fed to the active contour method to extract WBC-only regions for further analysis by three CNN models (DenseNet121, ResNet50, and MobileNet). The first strategy for analyzing ALL images of the two datasets is the hybrid technique of CNN-RF and CNN-XGBoost. DenseNet121, ResNet50, and MobileNet models extract deep feature maps. CNN models produce high features with redundant and non-significant features. So, CNN deep feature maps were fed to the Principal Component Analysis (PCA) method to select highly representative features and sent to RF and XGBoost classifiers for classification due to the high similarity between infected and normal WBC in early stages. Thus, the strategy for analyzing ALL images using serially fused features of CNN models. The deep feature maps of DenseNet121-ResNet50, ResNet50-MobileNet, DenseNet121-MobileNet, and DenseNet121-ResNet50-MobileNet were merged and then classified by RF classifiers and XGBoost. The RF classifier with fused features for DenseNet121-ResNet50-MobileNet reached an AUC of 99.1%, accuracy of 98.8%, sensitivity of 98.45%, precision of 98.7%, and specificity of 98.85% for the C-NMC 2019 dataset. With the ALL-IDB2 dataset, hybrid systems achieved 100% results for AUC, accuracy, sensitivity, precision, and specificity.
急性淋巴细胞白血病(ALL)是最致命的白血病形式之一,因为骨髓会产生大量白细胞(WBC)。ALL是儿童和成人中最常见的癌症类型之一。医生根据白血病的阶段及其在体内的扩散情况来确定治疗方案。医生依靠在显微镜下分析血样。病理学家面临诸多挑战,比如早期感染的白细胞与正常白细胞之间存在相似性。与患者数量相比,人工诊断容易出错、存在意见分歧且缺乏经验丰富的病理学家。因此,计算机辅助系统在协助病理学家早期检测ALL方面发挥着至关重要的作用。在本研究中,开发了高效且高精度的系统来分析C-NMC 2019和ALL-IDB2数据集的图像。在所有提出的系统中,对血液显微照片进行了改进,然后将其输入主动轮廓法以提取仅包含白细胞的区域,供三个卷积神经网络模型(DenseNet121、ResNet50和MobileNet)进行进一步分析。分析这两个数据集的ALL图像的第一种策略是CNN-RF和CNN-XGBoost的混合技术。DenseNet121、ResNet50和MobileNet模型提取深度特征图。卷积神经网络模型会产生具有冗余和非显著特征的高维特征。所以,由于早期感染的白细胞与正常白细胞高度相似,将卷积神经网络深度特征图输入主成分分析(PCA)方法以选择具有高度代表性的特征,并将其发送到随机森林(RF)和极端梯度提升(XGBoost)分类器进行分类。因此,这是使用卷积神经网络模型的串行融合特征来分析ALL图像的策略。将DenseNet121-ResNet50、ResNet50-MobileNet、DenseNet121-MobileNet和DenseNet121-ResNet50-MobileNet的深度特征图进行合并,然后由随机森林分类器和XGBoost进行分类。对于C-NMC 2019数据集,DenseNet121-ResNet50-MobileNet融合特征的随机森林分类器的曲线下面积(AUC)达到99.1%,准确率为98.8%,灵敏度为98.45%,精确率为98.7%,特异性为98.85%。对于ALL-IDB2数据集,混合系统在AUC、准确率、灵敏度、精确率和特异性方面均取得了100%的结果。