Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH, USA.
OHSU Knight Cancer Institute, Oregon Health & Science University, Oregon, USA.
Eur J Cancer. 2021 May;148:146-158. doi: 10.1016/j.ejca.2021.02.008. Epub 2021 Mar 17.
To identify stable and discriminating radiomic features on non-contrast CT scans to develop more generalisable radiomic classifiers for distinguishing granulomas from adenocarcinomas.
In total, 412 patients with adenocarcinomas and granulomas from three institutions were retrospectively included. Segmentations of the lung nodules were performed manually by an expert radiologist in a 2D axial view. Radiomic features were extracted from intra- and perinodular regions. A total of 145 patients were used as part of the training set (S), whereas 205 patients were used as part of test set I (S) and 62 patients were used as part of independent test set II (S). To mitigate the variation of CT acquisition parameters, we defined 'stable' radiomic features as those for which the feature expression remains relatively unchanged between different sites, as assessed using a Wilcoxon rank-sum test. These stable features were used to develop more generalisable radiomic classifiers that were more resilient to variations in lung CT scans. Features were ranked based on two criteria, firstly based on discriminability (i.e. maximising AUC) alone and subsequently based on maximising both feature stability and discriminability. Different machine-learning classifiers (Linear discriminant analysis, Quadratic discriminant analysis, Support vector machines and random forest) were trained with features selected using the two different criteria and then compared on the two independent test sets for distinguishing granulomas from adenocarcinomas, in terms of area under the receiver operating characteristic curve.
In the test sets, classifiers constructed using the criteria involving maximising feature stability and discriminability simultaneously achieved higher AUC compared with the discriminating alone criteria (S [n = 205]: maximum AUCs of 0.85versus . 0.80; p-value = 0.047 and S [n = 62]: maximum AUCs of 0.87 versus. 0.79; p-value = 0.021). These differences held for features extracted from scans with <3 mm slice thickness (AUC = 0.88 versus. 0.80; p-value = 0.039, n = 100) and for the ≥3 mm cases (AUC = 0.81 versus. 0.76; p-value = 0.034, n = 105). In both experiments, shape and peritumoural texture features had a higher stability compared with intratumoural texture features.
Our study suggests that explicitly accounting for both stability and discriminability results in more generalisable radiomic classifiers to distinguish adenocarcinomas from granulomas on non-contrast CT scans. Our results also showed that peritumoural texture and shape features were less affected by the scanner parameters compared with intratumoural texture features; however, they were also less discriminating compared with intratumoural features.
在非对比 CT 扫描上识别稳定且具有区分能力的放射组学特征,以开发更具普适性的放射组学分类器,用于区分肉芽肿与腺癌。
回顾性纳入了来自 3 家机构的 412 例腺癌和肉芽肿患者。由一位经验丰富的放射科医生在 2 轴轴向视图中手动进行肺结节的分割。从结节内和结节周围区域提取放射组学特征。共有 145 例患者被用作训练集(S)的一部分,205 例患者被用作测试集 I(S)的一部分,62 例患者被用作独立测试集 II(S)的一部分。为了减轻 CT 采集参数的变化,我们将“稳定”的放射组学特征定义为在不同部位之间特征表达相对不变的特征,这通过 Wilcoxon 秩和检验进行评估。这些稳定的特征被用于开发更具普适性的放射组学分类器,这些分类器对肺部 CT 扫描的变化更具弹性。根据两个标准对特征进行排序,首先基于可区分性(即最大化 AUC)进行排序,然后基于同时最大化特征稳定性和可区分性进行排序。使用这两种不同标准选择的特征训练不同的机器学习分类器(线性判别分析、二次判别分析、支持向量机和随机森林),然后在两个独立的测试集上根据接收器工作特征曲线下的面积来比较这些分类器在区分肉芽肿与腺癌方面的性能。
在测试集中,使用同时最大化特征稳定性和可区分性的标准构建的分类器与仅基于可区分性的标准相比,在区分肉芽肿与腺癌方面具有更高的 AUC(S [n=205]:最大 AUC 分别为 0.85 和 0.80;p 值=0.047;S [n=62]:最大 AUC 分别为 0.87 和 0.79;p 值=0.021)。这些差异适用于提取自厚度<3mm 切片扫描的特征(AUC=0.88 与 0.80;p 值=0.039,n=100)和厚度≥3mm 扫描的特征(AUC=0.81 与 0.76;p 值=0.034,n=105)。在这两种情况下,与肿瘤内纹理特征相比,肿瘤周围纹理和形状特征的稳定性更高。
本研究表明,明确考虑稳定性和可区分性可以产生更具普适性的放射组学分类器,用于在非对比 CT 扫描上区分腺癌与肉芽肿。我们的结果还表明,与肿瘤内纹理特征相比,肿瘤周围纹理和形状特征受扫描仪参数的影响较小,但与肿瘤内特征相比,它们的区分能力也较低。