不同机器学习模型对非转移性结直肠癌长期总生存率预测的比较
Comparison of Different Machine Learning Models for Predicting Long-Term Overall Survival in Non-metastatic Colorectal Cancers.
作者信息
Kos Fahriye Tugba, Cecen Kaynak Songul, Aktürk Esen Selin, Arslan Hilal, Uncu Dogan
机构信息
Department of Medical Oncology, Ankara Bilkent City Hospital, Ankara, TUR.
Department of Software Engineering, Faculty of Engineering and Natural Sciences, Ankara Yıldırım Beyazıt University, Ankara, TUR.
出版信息
Cureus. 2024 Dec 14;16(12):e75713. doi: 10.7759/cureus.75713. eCollection 2024 Dec.
INTRODUCTION
In recent years, machine learning (ML) methods have gained significant popularity among medical researchers interested in cancer. We aimed to test different (ML) models to predict both overall survival and survival at specific time points in patients with non-metastatic colorectal cancer (CRC).
METHODS
The clinicopathological and treatment data of non-metastatic CRC patients with more than 10 years of follow-up at a single center were retrospectively reviewed. 1, 2, 3, 5, and 10-year survival rates for all patients and stages I-III were statistically calculated using the Kaplan-Meier method. Five distinct machine-learning algorithms were employed to develop predictive models for patient survival at five designated time points.
RESULTS
A total of 498 patients were included in the study. The decision tree model had the highest area under the curve (AUC) for 1-year survival prediction (0.89). The ensemble model had the highest AUC for predicting 2-year, 3-year, and 5-year survival predictions (0.86, 0.92, and 0.89, respectively), while the support vector machine model had the highest AUC (0.84) for predicting 10-year survival. When considering the stages separately and assessing survival for the designated time intervals, the accuracy of all five models was found to be similar, ranging around 70% or higher.
CONCLUSION
ML models can predict short- and long-term survival in patients with CRC, both for the overall patient population and when stratified by stage.
引言
近年来,机器学习(ML)方法在对癌症感兴趣的医学研究人员中颇受欢迎。我们旨在测试不同的(ML)模型,以预测非转移性结直肠癌(CRC)患者的总生存期和特定时间点的生存期。
方法
回顾性分析了在单一中心进行了超过10年随访的非转移性CRC患者的临床病理和治疗数据。使用Kaplan-Meier方法统计计算了所有患者以及I-III期患者的1年、2年、3年、5年和10年生存率。采用五种不同的机器学习算法来开发五个指定时间点患者生存的预测模型。
结果
共有498例患者纳入研究。决策树模型在1年生存预测中的曲线下面积(AUC)最高(0.89)。集成模型在预测2年、3年和5年生存率时的AUC最高(分别为0.86、0.92和0.89),而支持向量机模型在预测10年生存率时的AUC最高(0.84)。当分别考虑各阶段并评估指定时间间隔的生存率时,发现所有五个模型的准确性相似,约为70%或更高。
结论
ML模型可以预测CRC患者的短期和长期生存,无论是总体患者群体还是按阶段分层时。