de Vente Coen, Boulogne Luuk H, Venkadesh Kiran Vaidhya, Sital Cheryl, Lessmann Nikolas, Jacobs Colin, Sanchez Clara I, van Ginneken Bram
Radboud University Medical Center, Donders Institute for Brain, Cognition and BehaviourDepartment of Medical Imaging6525GANijmegenThe Netherlands.
Informatics Institute, Faculty of ScienceUniversity of Amsterdam 1012 WX Amsterdam The Netherlands.
IEEE Trans Artif Intell. 2021 Oct 8;3(2):129-138. doi: 10.1109/TAI.2021.3115093. eCollection 2022 Apr.
Amidst the ongoing pandemic, the assessment of computed tomography (CT) images for COVID-19 presence can exceed the workload capacity of radiologists. Several studies addressed this issue by automating COVID-19 classification and grading from CT images with convolutional neural networks (CNNs). Many of these studies reported initial results of algorithms that were assembled from commonly used components. However, the choice of the components of these algorithms was often pragmatic rather than systematic and systems were not compared to each other across papers in a fair manner. We systematically investigated the effectiveness of using 3-D CNNs instead of 2-D CNNs for seven commonly used architectures, including DenseNet, Inception, and ResNet variants. For the architecture that performed best, we furthermore investigated the effect of initializing the network with pretrained weights, providing automatically computed lesion maps as additional network input, and predicting a continuous instead of a categorical output. A 3-D DenseNet-201 with these components achieved an area under the receiver operating characteristic curve of 0.930 on our test set of 105 CT scans and an AUC of 0.919 on a publicly available set of 742 CT scans, a substantial improvement in comparison with a previously published 2-D CNN. This article provides insights into the performance benefits of various components for COVID-19 classification and grading systems. We have created a challenge on grand-challenge.org to allow for a fair comparison between the results of this and future research.
在持续的疫情大流行期间,对计算机断层扫描(CT)图像进行新型冠状病毒肺炎(COVID-19)检测的工作量可能会超出放射科医生的负荷能力。多项研究通过使用卷积神经网络(CNN)对CT图像进行COVID-19分类和分级自动化来解决这一问题。这些研究中有许多报告了由常用组件组装而成的算法的初步结果。然而,这些算法组件的选择往往是出于实际考虑而非系统选择,而且不同论文中的系统之间没有以公平的方式相互比较。我们系统地研究了对于七种常用架构(包括DenseNet、Inception和ResNet变体)使用三维卷积神经网络(3-D CNN)而非二维卷积神经网络(2-D CNN)的有效性。对于表现最佳的架构,我们还进一步研究了用预训练权重初始化网络、将自动计算的病变图作为额外网络输入以及预测连续输出而非分类输出的效果。具有这些组件的3-D DenseNet-201在我们包含105例CT扫描的测试集上的受试者操作特征曲线下面积为0.930,在一组公开的742例CT扫描上的AUC为0.919,与之前发表的二维卷积神经网络相比有显著改进。本文深入探讨了各种组件对COVID-19分类和分级系统的性能优势。我们在grand-challenge.org上发起了一项挑战,以便对本研究结果与未来研究结果进行公平比较。