Zhang Jia, Huang Wenxiang, Li Yang, Zhang Xuan, Chen Yong, Chen Shaohao, Ming Qiu, Jiang Qing, Xv Yingjie
Department of Geriatrics, The First Affiliated Hospital of Chongqing Medical University, Chongqing 400016, China (J.Z., W.H.).
Department of Urology, Chongqing University Three Gorges Hospital, Chongqing 404000, China (Y.L.).
Acad Radiol. 2025 Jul;32(7):3788-3800. doi: 10.1016/j.acra.2025.03.042. Epub 2025 Apr 10.
To develop and validate a computed tomography (CT) radiomics-based interpretable machine learning (ML) model for predicting 5-year recurrence-free survival (RFS) in non-metastatic clear cell renal cell carcinoma (ccRCC).
559 patients with non-metastatic ccRCCs were retrospectively enrolled from eight independent institutes between March 2013 and January 2019, and were assigned to the primary set (n=271), external test set 1 (n=216), and external test set 2 (n=72). 1316 Radiomics features were extracted via "Pyradiomics." The least absolute shrinkage and selection operator algorithm was used for feature selection and Rad-Score construction. Patients were stratified into low and high 5-year recurrence risk groups based on Rad-Score, followed by Kaplan-Meier analyses. Five ML models integrating Rad-Score and clinicopathological risk factors were compared. Models' performances were evaluated via the discrimination, calibration, and decision curve analysis. The most robust ML model was interpreted using the SHapley Additive exPlanation (SHAP) method.
13 radiomic features were filtered to produce the Rad-Score, which predicted 5-year RFS with area under the receiver operating characteristic curve (AUCs) of 0.734-0.836. Kaplan-Meier analysis showed significant survival differences based on Rad-Score (all Log-Rank p values <0.05). The random forest model outperformed other models, obtaining AUCs of 0.826 [95% confidential interval (CI): 0.766-0.879] and 0.799 (95% CI: 0.670-0.899) in the external test set 1 and 2, respectively. The SHAP analysis suggested positive associations between contributing factors and 5-year RFS status in non-metastatic ccRCC.
CT radiomics-based interpretable ML model can effectively predict 5-year RFS in non-metastatic ccRCC patients, distinguishing between low and high 5-year recurrence risks.
开发并验证一种基于计算机断层扫描(CT)影像组学的可解释机器学习(ML)模型,用于预测非转移性透明细胞肾细胞癌(ccRCC)的5年无复发生存率(RFS)。
2013年3月至2019年1月期间,从8个独立机构回顾性纳入559例非转移性ccRCC患者,并将其分配到主要数据集(n = 271)、外部测试集1(n = 216)和外部测试集2(n = 72)。通过“Pyradiomics”提取1316个影像组学特征。使用最小绝对收缩和选择算子算法进行特征选择和Rad-Score构建。根据Rad-Score将患者分为5年复发风险低和高的组,然后进行Kaplan-Meier分析。比较了5个整合Rad-Score和临床病理危险因素的ML模型。通过鉴别、校准和决策曲线分析评估模型性能。使用SHapley加性解释(SHAP)方法解释最稳健的ML模型。
筛选出13个影像组学特征以生成Rad-Score,其预测5年RFS的受试者工作特征曲线下面积(AUC)为0.734 - 0.836。Kaplan-Meier分析显示基于Rad-Score的生存差异显著(所有对数秩p值<0.05)。随机森林模型优于其他模型,在外部测试集1和2中的AUC分别为0.826 [95%置信区间(CI):0.766 - 0.879]和0.799(95% CI:0.670 - 0.899)。SHAP分析表明非转移性ccRCC中影响因素与5年RFS状态之间存在正相关。
基于CT影像组学的可解释ML模型可以有效预测非转移性ccRCC患者的5年RFS,区分5年复发风险的高低。