Suppr超能文献

利用梯度提升机建立预测结直肠癌患者生存模型的研究

Development and validation of a model to predict survival in colorectal cancer using a gradient-boosted machine.

机构信息

Radiation Oncology, Stanford Medicine, Stanford, California, USA

Radiation Oncology, Stanford Medicine, Stanford, California, USA.

出版信息

Gut. 2021 May;70(5):884-889. doi: 10.1136/gutjnl-2020-321799. Epub 2020 Sep 4.

Abstract

OBJECTIVE

The success of treatment planning relies critically on our ability to predict the potential benefit of a therapy. In colorectal cancer (CRC), several nomograms are available to predict different outcomes based on the use of tumour specific features. Our objective is to provide an accurate and explainable prediction of the risk to die within 10 years after CRC diagnosis, by incorporating the tumour features and the patient medical and demographic information.

DESIGN

In the prostate, lung, colorectal and ovarian cancer screening (PLCO) Trial, participants (n=154 900) were randomised to screening with flexible sigmoidoscopy, with a repeat screening at 3 or 5 years, or to usual care. We selected patients who were diagnosed with CRC during the follow-up to train a gradient-boosted model to predict the risk to die within 10 years after CRC diagnosis. Using Shapley values, we determined the 20 most relevant features and provided explanation to prediction.

RESULTS

During the follow-up, 2359 patients were diagnosed with CRC. Median follow-up was 16.8 years (14.4-18.9) for mortality. In total, 686 patients (29%) died from CRC during the follow-up. The dataset was randomly split into a training (n=1887) and a testing (n=472) dataset. The area under the receiver operating characteristic was 0.84 (±0.04) and accuracy was 0.83 (±0.04) with a 0.5 classification threshold. The model is available online for research use.

CONCLUSIONS

We trained and validated a model with prospective data from a large multicentre cohort of patients. The model has high predictive performances at the individual scale. It could be used to discuss treatment strategies.

摘要

目的

治疗计划的成功与否关键取决于我们预测治疗潜在获益的能力。在结直肠癌(CRC)中,有几种列线图可根据肿瘤特异性特征预测不同的结局。我们的目标是通过纳入肿瘤特征以及患者的医疗和人口统计学信息,对 CRC 诊断后 10 年内死亡的风险进行准确且可解释的预测。

设计

在前列腺、肺、结直肠和卵巢癌筛查(PLCO)试验中,参与者(n=154900)被随机分配接受软性乙状结肠镜筛查,可选择 3 年或 5 年重复筛查,或接受常规护理。我们选择了在随访期间被诊断为 CRC 的患者,以训练一个梯度提升模型来预测 CRC 诊断后 10 年内死亡的风险。使用 Shapley 值,我们确定了 20 个最相关的特征,并提供了预测的解释。

结果

在随访期间,有 2359 名患者被诊断为 CRC。中位随访时间为 16.8 年(14.4-18.9),随访期间总共有 686 名患者(29%)死于 CRC。数据集随机分为训练集(n=1887)和测试集(n=472)。接收器操作特征曲线下面积为 0.84(±0.04),准确度为 0.83(±0.04),分类阈值为 0.5。该模型可在线供研究使用。

结论

我们使用来自大型多中心患者队列的前瞻性数据训练和验证了一个模型。该模型在个体层面上具有较高的预测性能。它可以用于讨论治疗策略。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验