University of Maryland, 1000 Hilltop Circle, 21250, Baltimore, MD, USA.
Institute for Data Science and Computing, University of Miami, 33124, Coral Gables, FL, USA.
J Digit Imaging. 2023 Aug;36(4):1376-1389. doi: 10.1007/s10278-023-00811-2. Epub 2023 Apr 17.
We present a novel algorithm that is able to generate deep synthetic COVID-19 pneumonia CT scan slices using a very small sample of positive training images in tandem with a larger number of normal images. This generative algorithm produces images of sufficient accuracy to enable a DNN classifier to achieve high classification accuracy using as few as 10 positive training slices (from 10 positive cases), which to the best of our knowledge is one order of magnitude fewer than the next closest published work at the time of writing. Deep learning with extremely small positive training volumes is a very difficult problem and has been an important topic during the COVID-19 pandemic, because for quite some time it was difficult to obtain large volumes of COVID-19-positive images for training. Algorithms that can learn to screen for diseases using few examples are an important area of research. Furthermore, algorithms to produce deep synthetic images with smaller data volumes have the added benefit of reducing the barriers of data sharing between healthcare institutions. We present the cycle-consistent segmentation-generative adversarial network (CCS-GAN). CCS-GAN combines style transfer with pulmonary segmentation and relevant transfer learning from negative images in order to create a larger volume of synthetic positive images for the purposes of improving diagnostic classification performance. The performance of a VGG-19 classifier plus CCS-GAN was trained using a small sample of positive image slices ranging from at most 50 down to as few as 10 COVID-19-positive CT scan images. CCS-GAN achieves high accuracy with few positive images and thereby greatly reduces the barrier of acquiring large training volumes in order to train a diagnostic classifier for COVID-19.
我们提出了一种新颖的算法,该算法能够使用非常小的阳性训练图像样本和大量的正常图像生成深度合成的 COVID-19 肺炎 CT 扫描切片。该生成算法生成的图像具有足够的准确性,使得 DNN 分类器仅使用 10 个阳性训练切片(来自 10 个阳性病例)即可实现高分类准确性,据我们所知,这比当时已发表的最接近的工作少了一个数量级。在 COVID-19 大流行期间,使用极小的阳性训练量进行深度学习是一个非常困难的问题,因为在相当长的一段时间内,很难获得大量 COVID-19 阳性图像进行训练。能够使用少量示例学习筛选疾病的算法是一个重要的研究领域。此外,使用较小数据量生成深度合成图像的算法还有助于减少医疗机构之间数据共享的障碍。我们提出了循环一致的分割生成对抗网络(CCS-GAN)。CCS-GAN 将风格转换与肺部分割以及来自阴性图像的相关迁移学习相结合,以便为提高诊断分类性能创建更大体积的合成阳性图像。使用 VGG-19 分类器和 CCS-GAN 对阳性图像切片数量最多为 50 个、最少为 10 个 COVID-19 阳性 CT 扫描图像的小样本进行了训练。CCS-GAN 在使用少量阳性图像的情况下实现了高精度,从而大大降低了获取大量训练数据的障碍,以便为 COVID-19 训练诊断分类器。