Horry Michael James, Chakraborty Subrata, Pradhan Biswajeet, Fallahpoor Maryam, Chegeni Hossein, Paul Manoranjan
Center for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and Information Technology, University of Technology Sydney, Australia.
Center of Excellence for Climate Change Research, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
Math Biosci Eng. 2021 Oct 27;18(6):9264-9293. doi: 10.3934/mbe.2021456.
The COVID-19 pandemic has inspired unprecedented data collection and computer vision modelling efforts worldwide, focused on the diagnosis of COVID-19 from medical images. However, these models have found limited, if any, clinical application due in part to unproven generalization to data sets beyond their source training corpus. This study investigates the generalizability of deep learning models using publicly available COVID-19 Computed Tomography data through cross dataset validation. The predictive ability of these models for COVID-19 severity is assessed using an independent dataset that is stratified for COVID-19 lung involvement. Each inter-dataset study is performed using histogram equalization, and contrast limited adaptive histogram equalization with and without a learning Gabor filter. We show that under certain conditions, deep learning models can generalize well to an external dataset with F1 scores up to 86%. The best performing model shows predictive accuracy of between 75% and 96% for lung involvement scoring against an external expertly stratified dataset. From these results we identify key factors promoting deep learning generalization, being primarily the uniform acquisition of training images, and secondly diversity in CT slice position.
新冠疫情激发了全球范围内前所未有的数据收集和计算机视觉建模工作,重点是从医学图像中诊断新冠病毒。然而,这些模型的临床应用有限,部分原因是它们对源训练语料库以外的数据集的泛化能力未经证实。本研究通过跨数据集验证,利用公开可用的新冠计算机断层扫描数据,研究深度学习模型的泛化能力。使用一个针对新冠肺部受累情况进行分层的独立数据集,评估这些模型对新冠严重程度的预测能力。每个数据集间的研究都使用直方图均衡化,以及带有和不带有学习型伽柏滤波器的对比度受限自适应直方图均衡化。我们表明,在某些条件下,深度学习模型可以很好地泛化到外部数据集,F1分数高达86%。表现最佳的模型在针对外部专家分层数据集的肺部受累评分中,预测准确率在75%至96%之间。从这些结果中,我们确定了促进深度学习泛化的关键因素,主要是训练图像的统一采集,其次是CT切片位置的多样性。