Department of Eye and Vision Science, University of Liverpool, Liverpool, United Kingdom.
Advanced Research Computing, University of Liverpool, Liverpool, United Kingdom.
Med Image Anal. 2023 Feb;84:102722. doi: 10.1016/j.media.2022.102722. Epub 2022 Dec 15.
Coronavirus disease (COVID-19) has caused a worldwide pandemic, putting millions of people's health and lives in jeopardy. Detecting infected patients early on chest computed tomography (CT) is critical in combating COVID-19. Harnessing uncertainty-aware consensus-assisted multiple instance learning (UC-MIL), we propose to diagnose COVID-19 using a new bilateral adaptive graph-based (BA-GCN) model that can use both 2D and 3D discriminative information in 3D CT volumes with arbitrary number of slices. Given the importance of lung segmentation for this task, we have created the largest manual annotation dataset so far with 7,768 slices from COVID-19 patients, and have used it to train a 2D segmentation model to segment the lungs from individual slices and mask the lungs as the regions of interest for the subsequent analyses. We then used the UC-MIL model to estimate the uncertainty of each prediction and the consensus between multiple predictions on each CT slice to automatically select a fixed number of CT slices with reliable predictions for the subsequent model reasoning. Finally, we adaptively constructed a BA-GCN with vertices from different granularity levels (2D and 3D) to aggregate multi-level features for the final diagnosis with the benefits of the graph convolution network's superiority to tackle cross-granularity relationships. Experimental results on three largest COVID-19 CT datasets demonstrated that our model can produce reliable and accurate COVID-19 predictions using CT volumes with any number of slices, which outperforms existing approaches in terms of learning and generalisation ability. To promote reproducible research, we have made the datasets, including the manual annotations and cleaned CT dataset, as well as the implementation code, available at https://doi.org/10.5281/zenodo.6361963.
冠状病毒病(COVID-19)已在全球范围内造成大流行,危及数百万人的健康和生命。在胸部计算机断层扫描(CT)上尽早发现感染患者对于抗击 COVID-19 至关重要。利用不确定性感知共识辅助多实例学习(UC-MIL),我们提出了一种新的双边自适应图基(BA-GCN)模型,该模型可以使用 3D CT 容积中任意数量切片的 2D 和 3D 鉴别信息。鉴于该任务中肺分割的重要性,我们创建了迄今为止最大的手动标注数据集,其中包含来自 COVID-19 患者的 7768 个切片,并使用该数据集训练了 2D 分割模型,以便从各个切片中分割出肺部,并将肺部掩膜为后续分析的感兴趣区域。然后,我们使用 UC-MIL 模型来估计每个预测的不确定性以及多个预测之间的共识,以便自动选择具有可靠预测的固定数量的 CT 切片,用于后续模型推理。最后,我们自适应地构建了一个具有不同粒度级别(2D 和 3D)顶点的 BA-GCN,以聚合多层次特征,用于最终诊断,充分利用图卷积网络在处理跨粒度关系方面的优势。在三个最大的 COVID-19 CT 数据集上的实验结果表明,我们的模型可以使用任意数量切片的 CT 容积生成可靠且准确的 COVID-19 预测,在学习和泛化能力方面优于现有方法。为了促进可重复研究,我们在 https://doi.org/10.5281/zenodo.6361963 上提供了数据集,包括手动标注和清理后的 CT 数据集以及实现代码。