Artificial Intelligence Imaging Hub, Royal Marsden NHS Foundation Trust, London, UK.
Division of Radiotherapy and Imaging, Institute of Cancer Research, London, UK.
Cancer Imaging. 2023 Aug 14;23(1):76. doi: 10.1186/s40644-023-00594-3.
The aim of this work is to evaluate the performance of radiomics predictions for a range of molecular, genomic and clinical targets in patients with clear cell renal cell carcinoma (ccRCC) and demonstrate the impact of novel feature selection strategies and sub-segmentations on model interpretability.
Contrast-enhanced CT scans from the first 101 patients recruited to the TRACERx Renal Cancer study (NCT03226886) were used to derive radiomics classification models to predict 20 molecular, histopathology and clinical target variables. Manual 3D segmentation was used in conjunction with automatic sub-segmentation to generate radiomics features from the core, rim, high and low enhancing sub-regions, and the whole tumour. Comparisons were made between two classification model pipelines: a Conventional pipeline reflecting common radiomics practice, and a Proposed pipeline including two novel feature selection steps designed to improve model interpretability. For both pipelines nested cross-validation was used to estimate prediction performance and tune model hyper-parameters, and permutation testing was used to evaluate the statistical significance of the estimated performance measures. Further model robustness assessments were conducted by evaluating model variability across the cross-validation folds.
Classification performance was significant (p < 0.05, H:AUROC = 0.5) for 11 of 20 targets using either pipeline and for these targets the AUROCs were within ± 0.05 for the two pipelines, except for one target where the Proposed pipeline performance increased by > 0.1. Five of these targets (necrosis on histology, presence of renal vein invasion, overall histological stage, linear evolutionary subtype and loss of 9p21.3 somatic alteration marker) had AUROC > 0.8. Models derived using the Proposed pipeline contained fewer feature groups than the Conventional pipeline, leading to more straightforward model interpretations without loss of performance. Sub-segmentations lead to improved performance and/or improved interpretability when predicting the presence of sarcomatoid differentiation and tumour stage.
Use of the Proposed pipeline, which includes the novel feature selection methods, leads to more interpretable models without compromising prediction performance.
NCT03226886 (TRACERx Renal).
本研究旨在评估在透明细胞肾细胞癌(ccRCC)患者中,对一系列分子、基因组和临床靶标进行放射组学预测的性能,并展示新的特征选择策略和亚分割对模型可解释性的影响。
使用来自 TRACERx 肾癌研究(NCT03226886)中招募的前 101 例患者的增强 CT 扫描来构建放射组学分类模型,以预测 20 个分子、组织病理学和临床靶标变量。手动 3D 分割与自动亚分割相结合,从核心、边缘、高增强和低增强亚区以及整个肿瘤中生成放射组学特征。比较了两种分类模型管道:传统管道反映了常见的放射组学实践,以及包括两个旨在提高模型可解释性的新特征选择步骤的建议管道。对于两种管道,都使用嵌套交叉验证来估计预测性能并调整模型超参数,并使用置换检验来评估估计性能度量的统计学意义。通过评估跨交叉验证折叠的模型可变性来进一步进行模型稳健性评估。
使用两种管道中的任何一种,对 20 个目标中的 11 个目标的分类性能均具有统计学意义(p<0.05,H:AUROC=0.5),并且对于这些目标,两个管道的 AUROCs 相差在 0.05 以内,除了一个目标,其中建议管道的性能提高了>0.1。这些目标中有 5 个(组织学上的坏死、肾静脉侵犯的存在、整体组织学阶段、线性进化亚型和丢失 9p21.3 体细胞改变标记物)的 AUROC>0.8。使用建议管道构建的模型比传统管道包含更少的特征组,导致模型解释更加简单,而不会降低性能。在预测肉瘤样分化和肿瘤分期时,亚分割可提高性能和/或可解释性。
使用包括新特征选择方法的建议管道可构建更具解释性的模型,而不会降低预测性能。
NCT03226886(TRACERx 肾)。