Zheng Wenbo, Yan Lan, Gou Chao, Zhang Zhi-Cheng, Zhang Jun J, Hu Ming, Wang Fei-Yue
School of Software Engineering Xi'an Jiaotong University Xi'an China.
The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation Chinese Academy of Sciences Beijing China.
Int J Intell Syst. 2021 Aug;36(8):4033-4064. doi: 10.1002/int.22449. Epub 2021 May 13.
The goal of diagnosing the coronavirus disease 2019 (COVID-19) from suspected pneumonia cases, that is, recognizing COVID-19 from chest X-ray or computed tomography (CT) images, is to improve diagnostic accuracy, leading to faster intervention. The most important and challenging problem here is to design an effective and robust diagnosis model. To this end, there are three challenges to overcome: (1) The lack of training samples limits the success of existing deep-learning-based methods. (2) Many public COVID-19 data sets contain only a few images without fine-grained labels. (3) Due to the explosive growth of suspected cases, it is and to diagnose not only COVID-19 cases but also the cases of other types of pneumonia that are similar to the symptoms of COVID-19. To address these issues, we propose a novel framework called to address the problem of differentiating COVID-19 from pneumonia cases. During training, our model cannot use any true labels and aims to gain the ability of learning to learn by itself. In particular, we first present a deep diagnosis model based on a relation network to capture and memorize the relation among different images. Second, to enhance the performance of our model, we design a self-knowledge distillation mechanism that distills knowledge within our model itself. Our network is divided into several parts, and the knowledge in the deeper parts is squeezed into the shallow ones. The final results are derived from our model by learning to compare the features of images. Experimental results demonstrate that our approach achieves significantly higher performance than other state-of-the-art methods. Moreover, we construct a new COVID-19 pneumonia data set based on text mining, consisting of 2696 COVID-19 images (347 X-ray + 2349 CT), 10,155 images (9661 X-ray + 494 CT) about other types of pneumonia, and the fine-grained labels of all. Our data set considers not only a bacterial infection or viral infection which causes pneumonia but also a viral infection derived from the influenza virus or coronavirus.
从疑似肺炎病例中诊断新型冠状病毒肺炎(COVID-19),即从胸部X光或计算机断层扫描(CT)图像中识别COVID-19,目的是提高诊断准确性,从而实现更快的干预。这里最重要且最具挑战性的问题是设计一个有效且稳健的诊断模型。为此,有三个挑战需要克服:(1)训练样本的缺乏限制了现有基于深度学习方法的成功。(2)许多公开的COVID-19数据集仅包含少量图像且没有细粒度标签。(3)由于疑似病例的爆发式增长,不仅要诊断COVID-19病例,还要诊断与COVID-19症状相似的其他类型肺炎病例,这既困难又具有挑战性。为了解决这些问题,我们提出了一个名为 的新颖框架,以解决区分COVID-19和肺炎病例的问题。在训练过程中,我们的模型不能使用任何真实标签,旨在获得自我学习学习的能力。具体而言,我们首先提出一种基于关系网络的深度诊断模型,以捕捉和记忆不同图像之间的关系。其次,为了提高我们模型的性能,我们设计了一种自知识蒸馏机制,在我们的模型自身内部进行知识蒸馏。我们的网络分为几个部分,较深部分的知识被压缩到较浅部分。最终结果是通过学习比较图像特征从我们的模型中得出的。实验结果表明,我们的方法比其他现有最先进方法具有显著更高的性能。此外,我们基于文本挖掘构建了一个新的COVID-19肺炎数据集,该数据集由2696张COVID-19图像(347张X光 + 2349张CT)、10155张关于其他类型肺炎的图像(9661张X光 + 494张CT)以及所有图像的细粒度标签组成。我们的数据集不仅考虑了导致肺炎的细菌感染或病毒感染,还考虑了源自流感病毒或冠状病毒的病毒感染。