Li Chun, Yang Yunyun, Liang Hui, Wu Boying
School of Science, Harbin Institute of Technology, Shenzhen, 518055, China.
Department of Mathematics, Harbin Institute of Technology, Harbin, 150006, China.
Knowl Based Syst. 2021 Apr 22;218:106849. doi: 10.1016/j.knosys.2021.106849. Epub 2021 Feb 6.
The coronavirus disease, called COVID-19, which is spreading fast worldwide since the end of 2019, and has become a global challenging pandemic. Until 27th May 2020, it caused more than 5.6 million individuals infected throughout the world and resulted in greater than 348,145 deaths. CT images-based classification technique has been tried to use the identification of COVID-19 with CT imaging by hospitals, which aims to minimize the possibility of virus transmission and alleviate the burden of clinicians and radiologists. Early diagnosis of COVID-19, which not only prevents the disease from spreading further but allows more reasonable allocation of limited medical resources. Therefore, CT images play an essential role in identifying cases of COVID-19 that are in great need of intensive clinical care. Unfortunately, the current public health emergency, which has caused great difficulties in collecting a large set of precise data for training neural networks. To tackle this challenge, our first thought is transfer learning, which is a technique that aims to transfer the knowledge from one or more source tasks to a target task when the latter has fewer training data. Since the training data is relatively limited, so a transfer learning-based DensNet-121 approach for the identification of COVID-19 is established. The proposed method is inspired by the precious work of predecessors such as CheXNet for identifying common Pneumonia, which was trained using the large Chest X-ray14 dataset, and the dataset contains 112,120 frontal chest X-rays of 14 different chest diseases (including Pneumonia) that are individually labeled and achieved good performance. Therefore, CheXNet as the pre-trained network was used for the target task (COVID-19 classification) by fine-tuning the network weights on the small-sized dataset in the target task. Finally, we evaluated our proposed method on the COVID-19-CT dataset. Experimentally, our method achieves state-of-the-art performance for the accuracy (ACC) and F1-score. The quantitative indicators show that the proposed method only uses a GPU can reach the best performance, up to 0.87 and 0.86, respectively, compared with some widely used and recent deep learning methods, which are helpful for COVID-19 diagnosis and patient triage. The codes used in this manuscript are publicly available on GitHub at (https://github.com/lichun0503/CT-Classification).
冠状病毒病,即COVID-19,自2019年底以来在全球迅速传播,已成为一场具有全球挑战性的大流行病。截至2020年5月27日,它已导致全球超过560万人感染,死亡人数超过348145人。基于CT图像的分类技术已被医院尝试用于通过CT成像识别COVID-19,其目的是尽量减少病毒传播的可能性,并减轻临床医生和放射科医生的负担。COVID-19的早期诊断,不仅可以防止疾病进一步传播,还能使有限的医疗资源得到更合理的分配。因此,CT图像在识别急需重症临床护理的COVID-19病例中起着至关重要的作用。不幸的是,当前的公共卫生紧急情况给收集大量精确数据用于训练神经网络带来了巨大困难。为应对这一挑战,我们首先想到的是迁移学习,这是一种旨在在目标任务训练数据较少时,将知识从一个或多个源任务转移到目标任务的技术。由于训练数据相对有限,因此建立了一种基于迁移学习的DensNet-121方法来识别COVID-19。所提出的方法受到了前人宝贵工作的启发,比如用于识别常见肺炎的CheXNet,它使用大型胸部X线14数据集进行训练,该数据集包含14种不同胸部疾病(包括肺炎)的112120张胸部正位X线片,这些片子都有单独标注且取得了良好性能。因此,将CheXNet作为预训练网络,通过在目标任务的小型数据集上微调网络权重来用于目标任务(COVID-19分类)。最后,我们在COVID-19-CT数据集上评估了我们提出的方法。实验表明,我们的方法在准确率(ACC)和F1分数方面达到了当前最优性能。定量指标显示,所提出的方法仅使用一块GPU就能达到最佳性能,分别高达0.87和0.86,与一些广泛使用的近期深度学习方法相比,这对COVID-19诊断和患者分流有帮助。本手稿中使用的代码可在GitHub上公开获取(https://github.com/lichun0503/CT-Classification)。