Shah Pir Masoom, Ullah Hamid, Ullah Rahim, Shah Dilawar, Wang Yulin, Islam Saif Ul, Gani Abdullah, Rodrigues Joel J P C
School of Computer Science Wuhan University Wuhan China.
Department of Computer Science Bacha Khan University Charsadda Pakistan.
Expert Syst. 2022 Mar;39(3):e12823. doi: 10.1111/exsy.12823. Epub 2021 Oct 19.
Currently, many deep learning models are being used to classify COVID-19 and normal cases from chest X-rays. However, the available data (X-rays) for COVID-19 is limited to train a robust deep-learning model. Researchers have used data augmentation techniques to tackle this issue by increasing the numbers of samples through flipping, translation, and rotation. However, by adopting this strategy, the model compromises for the learning of high-dimensional features for a given problem. Hence, there are high chances of overfitting. In this paper, we used deep-convolutional generative adversarial networks algorithm to address this issue, which generates synthetic images for all the classes (Normal, Pneumonia, and COVID-19). To validate whether the generated images are accurate, we used the k-mean clustering technique with three clusters (Normal, Pneumonia, and COVID-19). We only selected the X-ray images classified in the correct clusters for training. In this way, we formed a synthetic dataset with three classes. The generated dataset was then fed to The EfficientNetB4 for training. The experiments achieved promising results of 95% in terms of area under the curve (AUC). To validate that our network has learned discriminated features associated with lung in the X-rays, we used the Grad-CAM technique to visualize the underlying pattern, which leads the network to its final decision.
目前,许多深度学习模型被用于从胸部X光片中对新冠肺炎病例和正常病例进行分类。然而,用于新冠肺炎的可用数据(X光片)有限,难以训练出一个强大的深度学习模型。研究人员采用数据增强技术来解决这个问题,通过翻转、平移和旋转来增加样本数量。然而,采用这种策略时,模型在学习给定问题的高维特征方面会有所妥协。因此,存在过拟合的高风险。在本文中,我们使用深度卷积生成对抗网络算法来解决这个问题,该算法为所有类别(正常、肺炎和新冠肺炎)生成合成图像。为了验证生成的图像是否准确,我们使用了具有三个聚类(正常、肺炎和新冠肺炎)的k均值聚类技术。我们只选择正确聚类中分类的X光图像进行训练。通过这种方式,我们形成了一个包含三个类别的合成数据集。然后将生成的数据集输入到EfficientNetB4中进行训练。实验在曲线下面积(AUC)方面取得了95%的良好结果。为了验证我们的网络是否学习到了与X光片中肺部相关的判别特征,我们使用Grad-CAM技术来可视化潜在模式,该模式引导网络做出最终决策。