Rajaraman Sivaramakrishnan, Siegelman Jen, Alderson Philip O, Folio Lucas S, Folio Les R, Antani Sameer K
Lister Hill National Center for Biomedical Communications, National Library of Medicine, Bethesda, MD 20894 USA.
Takeda Pharmaceuticals, Cambridge, MA 02139 USA.
IEEE Access. 2020;8:115041-115050. doi: 10.1109/access.2020.3003810. Epub 2020 Jun 19.
We demonstrate use of iteratively pruned deep learning model ensembles for detecting pulmonary manifestation of COVID-19 with chest X-rays. This disease is caused by the novel Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) virus, also known as the novel Coronavirus (2019-nCoV). A custom convolutional neural network and a selection of ImageNet pretrained models are trained and evaluated at patient-level on publicly available CXR collections to learn modality-specific feature representations. The learned knowledge is transferred and fine-tuned to improve performance and generalization in the related task of classifying CXRs as normal, showing bacterial pneumonia, or COVID-19-viral abnormalities. The best performing models are iteratively pruned to reduce complexity and improve memory efficiency. The predictions of the best-performing pruned models are combined through different ensemble strategies to improve classification performance. Empirical evaluations demonstrate that the weighted average of the best-performing pruned models significantly improves performance resulting in an accuracy of 99.01% and area under the curve of 0.9972 in detecting COVID-19 findings on CXRs. The combined use of modality-specific knowledge transfer, iterative model pruning, and ensemble learning resulted in improved predictions. We expect that this model can be quickly adopted for COVID-19 screening using chest radiographs.
我们展示了如何使用迭代剪枝的深度学习模型集成,通过胸部X光检测新型冠状病毒肺炎(COVID-19)的肺部表现。这种疾病由新型严重急性呼吸综合征冠状病毒2(SARS-CoV-2)引起,也被称为新型冠状病毒(2019-nCoV)。在公开可用的胸部X光片集合上,针对患者级别训练并评估了一个定制的卷积神经网络和一系列在ImageNet上预训练的模型,以学习特定模态的特征表示。将所学知识进行迁移和微调,以提高在将胸部X光片分类为正常、显示细菌性肺炎或COVID-19病毒异常的相关任务中的性能和泛化能力。对表现最佳的模型进行迭代剪枝,以降低复杂度并提高内存效率。通过不同的集成策略组合表现最佳的剪枝模型的预测结果,以提高分类性能。实证评估表明,表现最佳的剪枝模型的加权平均值显著提高了性能,在检测胸部X光片中的COVID-19表现时,准确率达到99.01%,曲线下面积为0.9972。特定模态的知识迁移、迭代模型剪枝和集成学习的联合使用提高了预测能力。我们期望该模型能够迅速应用于使用胸部X光片进行的COVID-19筛查。