Department of Radiological Sciences, David Geffen School of Medicine at UCLA, Ronald Reagan-UCLA Medical Center, 924 Westwood Boulevard, Suite 615, Los Angeles, CA, 90024, USA.
Department of Computer Science, School of Engineering at UCLA, NRB 635 Charles E. Young Dr. South Suite 116, BOX 951769, Los Angeles, CA, 90095-1769, USA.
Abdom Radiol (NY). 2019 Jun;44(6):2009-2020. doi: 10.1007/s00261-019-01929-0.
Currently, all solid enhancing renal masses without microscopic fat are considered malignant until proven otherwise and there is substantial overlap in the imaging findings of benign and malignant renal masses, particularly between clear cell RCC (ccRCC) and benign oncocytoma (ONC). Radiomics has attracted increased attention for its utility in pre-operative work-up on routine clinical images. Radiomics based approaches have converted medical images into mineable data and identified prognostic imaging signatures that machine learning algorithms can use to construct predictive models by learning the decision boundaries of the underlying data distribution. The TensorFlow™ framework from Google is a state-of-the-art open-source software library that can be used for training deep learning neural networks for performing machine learning tasks. The purpose of this study was to investigate the diagnostic value and feasibility of a deep learning-based renal lesion classifier using open-source Google TensorFlow™ Inception in differentiating ccRCC from ONC on routine four-phase MDCT in patients with pathologically confirmed renal masses.
With institutional review board approval for this 1996 Health Insurance Portability and Accountability Act compliant retrospective study and a waiver of informed consent, we queried our institution's pathology, clinical, and radiology databases for histologically proven cases of ccRCC and ONC obtained between January 2000 and January 2016 scanned with a an intravenous contrast-enhanced four-phase renal mass protocol (unenhanced (UN), corticomedullary (CM), nephrographic (NP), and excretory (EX) phases). To extract features to be used for the machine learning model, the entire renal mass was contoured in the axial plane in each of the four phases, resulting in a 3D volume of interest (VOI) representative of the entire renal mass. We investigated thirteen different approaches to convert the acquired VOI data into a set of images that adequately represented each tumor which was used to train the final layer of the neural network model. Training was performed over 4000 iterations. In each iteration, 90% of the data were designated as training data and the remaining 10% served as validation data and a leave-one-out cross-validation scheme was implemented. Accuracy, sensitivity, specificity, positive (PPV) and negative predictive (NPV) values, and CIs were calculated for the classification of the thirteen processing modes.
We analyzed 179 consecutive patients with 179 lesions (128 ccRCC and 51 ONC). The ccRCC cohort had a mean size of 3.8 cm (range 0.8-14.6 cm) and the ONC cohort had a mean lesion size of 3.9 cm (range 1.0-13.1 cm). The highest specificity and PPV (52.9% and 80.3%, respectively) were achieved in the EX phase when we analyzed the single mid-slice of the tumor in the axial, coronal and sagittal plane, and when we increased the number of mid-slices of the tumor to three, with an accuracy of 75.4%, which also increased the sensitivity to 88.3% and the PPV to 79.6%. Using the entire tumor volume also showed that classification performance was best in the EX phase with an accuracy of 74.4%, a sensitivity of 85.8% and a PPV of 80.1%. When the entire tumor volume, plus mid-slices from all phases and all planes presented as tiled images, were submitted to the final layer of the neural network we achieved a PPV of 82.5%.
The best classification result was obtained in the EX phase among the thirteen classification methods tested. Our proof of concept study is the first step towards understanding the utility of machine learning in the differentiation of ccRCC from ONC on routine CT images. We hope this could lead to future investigation into the development of a multivariate machine learning model which may augment our ability to accurately predict renal lesion histology on imaging.
目前,所有无微观脂肪的实性强化肾肿块均被认为是恶性的,除非有其他证明,良性和恶性肾肿块的影像学表现有很大的重叠,特别是透明细胞肾细胞癌(ccRCC)和良性嗜酸细胞瘤(ONC)之间。放射组学因其在常规临床图像术前评估中的应用而受到越来越多的关注。基于放射组学的方法将医学图像转化为可挖掘的数据,并确定了预后影像学特征,机器学习算法可以通过学习数据分布的决策边界,使用这些特征来构建预测模型。来自谷歌的 TensorFlow™ 框架是一个最先进的开源软件库,可用于训练深度学习神经网络,以执行机器学习任务。本研究的目的是调查使用开源谷歌 TensorFlow™Inception 的基于深度学习的肾病变分类器在区分 ccRCC 和 ONC 方面的诊断价值和可行性,该分类器在经静脉增强四期 MDCT 对病理证实的肾肿块患者中进行。
本研究获得机构审查委员会批准,符合 1996 年《健康保险携带和责任法案》的要求,并豁免了知情同意书,我们在机构的病理、临床和放射学数据库中查询了 2000 年 1 月至 2016 年 1 月期间获得的经静脉增强四期肾肿块方案(未增强(UN)、皮质髓质(CM)、肾图(NP)和排泄(EX)期)的组织学证实的 ccRCC 和 ONC 病例。为了提取用于机器学习模型的特征,在每个相位的轴向平面上对整个肾肿块进行轮廓勾画,得到代表整个肾肿块的 3D 感兴趣区(VOI)。我们研究了十三种不同的方法将获得的 VOI 数据转换为一组充分代表每个肿瘤的图像,这些图像用于训练神经网络模型的最后一层。训练在 4000 次迭代中进行。在每次迭代中,90%的数据被指定为训练数据,其余 10%的数据用于验证数据,并实施了一次留一交叉验证方案。计算了十三种处理模式的分类的准确性、敏感性、特异性、阳性(PPV)和阴性预测值(NPV)和置信区间。
我们分析了 179 例连续患者的 179 个病变(128 个 ccRCC 和 51 个 ONC)。ccRCC 组的平均大小为 3.8cm(范围 0.8-14.6cm),ONC 组的平均病变大小为 3.9cm(范围 1.0-13.1cm)。在 EX 期分析肿瘤的单个中slice,在轴向、冠状和矢状平面上,当增加肿瘤的中slice数量到三个时,特异性和 PPV 最高(分别为 52.9%和 80.3%),准确率为 75.4%,这也将敏感性提高到 88.3%,PPV 提高到 79.6%。使用整个肿瘤体积也表明,在 EX 期的分类性能最好,准确率为 74.4%,敏感性为 85.8%,PPV 为 80.1%。当整个肿瘤体积、所有相位和所有平面的中slice以及平铺图像提交给神经网络的最后一层时,我们获得了 82.5%的 PPV。
在测试的十三种分类方法中,EX 期获得了最好的分类结果。我们的概念验证研究是理解机器学习在常规 CT 图像上区分 ccRCC 和 ONC 中的应用的第一步。我们希望这能促使我们未来对开发多变量机器学习模型进行研究,这可能会提高我们在影像学上准确预测肾病变组织学的能力。