Sun Yihan, Lin Da, Deng Xiangyang, Zhang Yinlong
Department of Thyroid Breast Surgery, The Second Affiliated Hospital and Yuying Children's Hospital of Wenzhou Medical University, Wenzhou, China.
Department of General Surgery, Hangzhou TCM Hospital Affiliated to Zhejiang Chinese Medical University, Hangzhou, China.
Discov Oncol. 2025 Jan 21;16(1):70. doi: 10.1007/s12672-024-01703-9.
Despite the identification of various prognostic factors for anaplastic thyroid carcinoma (ATC) patients over the years, a precise prognostic tool for these patients is still lacking. This study aimed to develop and validate a prognostic model for predicting survival outcomes for ATC patients using random survival forests (RSF), a machine learning algorithm.
A total of 1222 ATC patients were extracted from the Surveillance, Epidemiology, and End Results (SEER) database and randomly divided into a training set of 855 patients and a validation set of 367 patients. We developed an RSF model and a traditional Cox model using the training cohort and further compared their performance based on calibration and discrimination. integrated brier score (iBS) was used to estimate the calibration ability. The Brier score, C-index value, the receiver operating characteristic (ROC) curve with the area under the curve (AUC) and Decision Curves Analysis (DCA) were evaluated. Furthermore, we assessed the feature importance within the RSF model and validated its performance using the validation group.
An RSF model and a traditional Cox model were successfully developed in training set. The Brier score for the RSF model was 0.055, which is lower than the Cox model's score of 0.063, indicating better performance since a lower Brier score signifies superior model accuracy. The RSF model exceeded the Cox model in performance based on the C-index and AUC. Additionally, the DCA curve indicated that the RSF model provided substantial clinical benefit. And we further ranked the time-dependent features according to their permutation importance and observed that surgery, radiotherapy, and chemotherapy were the most influential predictors initially. Moreover, according to the RSF model predictions, the ATC patients were successfully stratified into 2 prognostic groups displaying significant difference in survival.
This prognostic study first revealed that RSF offers more precise overall survival predictions and superior prognostic stratification compared to the Cox regression model for ATC patients.
尽管多年来已确定了间变性甲状腺癌(ATC)患者的各种预后因素,但仍缺乏针对这些患者的精确预后工具。本研究旨在开发并验证一种使用随机生存森林(RSF)这一机器学习算法来预测ATC患者生存结局的预后模型。
从监测、流行病学和最终结果(SEER)数据库中提取了总共1222例ATC患者,并将其随机分为855例患者的训练集和367例患者的验证集。我们使用训练队列开发了一个RSF模型和一个传统的Cox模型,并基于校准和区分进一步比较了它们的性能。综合Brier评分(iBS)用于估计校准能力。评估了Brier评分、C指数值、曲线下面积(AUC)的受试者工作特征(ROC)曲线以及决策曲线分析(DCA)。此外,我们评估了RSF模型中的特征重要性,并使用验证组验证了其性能。
在训练集中成功开发了一个RSF模型和一个传统的Cox模型。RSF模型的Brier评分为0.055,低于Cox模型的0.063分,表明性能更好,因为较低的Brier评分表示模型准确性更高。基于C指数和AUC,RSF模型在性能上超过了Cox模型。此外,DCA曲线表明RSF模型提供了实质性的临床益处。我们还根据其排列重要性对时间依赖性特征进行了排名,发现手术、放疗和化疗最初是最有影响力的预测因素。此外,根据RSF模型预测,ATC患者成功分层为2个预后组,其生存率显示出显著差异。
这项预后研究首次表明,与Cox回归模型相比,RSF为ATC患者提供了更精确的总生存预测和更好的预后分层。