Kooi Thijs, van Ginneken Bram, Karssemeijer Nico, den Heeten Ard
Department of Radiology and Nuclear Medicine, RadboudUMC, Geert Grooteplein Zuid 10, Nijmegen, 6535, The Netherlands.
Department of Radiology, Academic Medical Center Amsterdam, P.O. Box 22660, DD Amsterdam, 1100, The Netherlands.
Med Phys. 2017 Mar;44(3):1017-1027. doi: 10.1002/mp.12110.
It is estimated that 7% of women in the western world will develop palpable breast cysts in their lifetime. Even though cysts have been correlated with risk of developing breast cancer, many of them are benign and do not require follow-up. We develop a method to discriminate benign solitary cysts from malignant masses in digital mammography. We think a system like this can have merit in the clinic as a decision aid or complementary to specialized modalities.
We employ a deep convolutional neural network (CNN) to classify cyst and mass patches. Deep CNNs have been shown to be powerful classifiers, but need a large amount of training data for which medical problems are often difficult to come by. The key contribution of this paper is that we show good performance can be obtained on a small dataset by pretraining the network on a large dataset of a related task. We subsequently investigate the following: (a) when a mammographic exam is performed, two different views of the same breast are recorded. We investigate the merit of combining the output of the classifier from these two views. (b) We evaluate the importance of the resolution of the patches fed to the network. (c) A method dubbed tissue augmentation is subsequently employed, where we extract normal tissue from normal patches and superimpose this onto the actual samples aiming for a classifier invariant to occluding tissue. (d) We combine the representation extracted using the deep CNN with our previously developed features.
We show that using the proposed deep learning method, an area under the ROC curve (AUC) value of 0.80 can be obtained on a set of benign solitary cysts and malignant mass findings recalled in screening. We find that it works significantly better than our previously developed approach by comparing the AUC of the ROC using bootstrapping. By combining views, the results can be further improved, though this difference was not found to be significant. We find no significant difference between using a resolution of 100 versus 200 micron. The proposed tissue augmentations give a small improvement in performance, but this improvement was also not found to be significant. The final system obtained an AUC of 0.80 with 95% confidence interval [0.78, 0.83], calculated using bootstrapping. The system works best for lesions larger than 27 mm where it obtains an AUC value of 0.87.
We have presented a computer-aided diagnosis (CADx) method to discriminate cysts from solid lesion in mammography using features from a deep CNN trained on a large set of mass candidates, obtaining an AUC of 0.80 on a set of diagnostic exams recalled from screening. We believe the system shows great potential and comes close to the performance of recently developed spectral mammography. We think the system can be further improved when more data and computational power becomes available.
据估计,西方世界7%的女性一生中会出现可触及的乳腺囊肿。尽管囊肿与患乳腺癌的风险相关,但其中许多是良性的,无需随访。我们开发了一种在数字化乳腺X线摄影中区分良性孤立囊肿与恶性肿块的方法。我们认为这样的系统在临床上作为决策辅助工具或作为专门检查方法的补充可能具有价值。
我们采用深度卷积神经网络(CNN)对囊肿和肿块图像块进行分类。深度CNN已被证明是强大的分类器,但需要大量的训练数据,而医学问题往往难以获得这些数据。本文的关键贡献在于,我们表明通过在相关任务的大数据集上对网络进行预训练,可以在小数据集上获得良好的性能。我们随后研究了以下内容:(a)进行乳腺X线检查时,会记录同一乳房的两个不同视图。我们研究了将这两个视图的分类器输出相结合的优点。(b)我们评估输入到网络的图像块分辨率的重要性。(c)随后采用一种称为组织增强的方法,我们从正常图像块中提取正常组织并将其叠加到实际样本上,以使分类器对遮挡组织具有不变性。(d)我们将使用深度CNN提取的特征与我们之前开发的特征相结合。
我们表明,使用所提出的深度学习方法,在一组筛查中召回的良性孤立囊肿和恶性肿块发现上,ROC曲线下面积(AUC)值可达0.80。通过使用自助法比较ROC的AUC,我们发现它的效果明显优于我们之前开发的方法。通过结合视图,结果可以进一步改善,尽管这种差异不显著。我们发现使用100微米与200微米分辨率之间没有显著差异。所提出的组织增强方法在性能上有小幅提升,但这种提升也不显著。使用自助法计算,最终系统的AUC为0.80,95%置信区间为[0.78, 0.83]。该系统对大于27毫米的病变效果最佳,此时AUC值为0.87。
我们提出了一种计算机辅助诊断(CADx)方法,利用在大量肿块候选数据集上训练的深度CNN的特征,在乳腺X线摄影中区分囊肿与实性病变,在一组筛查召回的诊断检查中AUC达到0.80。我们相信该系统显示出巨大潜力,接近最近开发的光谱乳腺摄影的性能。我们认为当有更多数据和计算能力时,该系统可以进一步改进。