使用机器学习算法预测口咽鳞状细胞癌的生存率：一项基于监测、流行病学和最终结果数据库的研究。

Prediction of survival in oropharyngeal squamous cell carcinoma using machine learning algorithms: A study based on the surveillance, epidemiology, and end results database.

作者信息

Kim Su Il, Kang Jeong Wook, Eun Young-Gyu, Lee Young Chan

机构信息

Department of Otolaryngology-Head and Neck Surgery, Kyung Hee University School of Medicine, Seoul, South Korea.

出版信息

Front Oncol. 2022 Aug 22;12:974678. doi: 10.3389/fonc.2022.974678. eCollection 2022.

DOI:10.3389/fonc.2022.974678

PMID:36072804

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9441569/

Abstract

BACKGROUND

We determined appropriate survival prediction machine learning models for patients with oropharyngeal squamous cell carcinoma (OPSCC) using the "Surveillance, Epidemiology, and End Results" (SEER) database.

METHODS

In total, 4039 patients diagnosed with OPSCC between 2004 and 2016 were enrolled in this study. In particular, 13 variables were selected and analyzed: age, sex, tumor grade, tumor size, neck dissection, radiation therapy, cancer directed surgery, chemotherapy, T stage, N stage, M stage, clinical stage, and human papillomavirus (HPV) status. The T-, N-, and clinical staging were reconstructed based on the American Joint Committee on Cancer (AJCC) Staging Manual, 8th Edition. The patients were randomly assigned to a development or test dataset at a 7:3 ratio. The extremely randomized survival tree (EST), conditional survival forest (CSF), and DeepSurv models were used to predict the overall and disease-specific survival in patients with OPSCC. A 10-fold cross-validation on a development dataset was used to build the training and internal validation data for all models. We evaluated the predictive performance of each model using test datasets.

RESULTS

A higher c-index value and lower integrated Brier score (IBS), root mean square error (RMSE), and mean absolute error (MAE) indicate a better performance from a machine learning model. The C-index was the highest for the DeepSurv model (0.77). The IBS was also the lowest in the DeepSurv model (0.08). However, the RMSE and RAE were the lowest for the CSF model.

CONCLUSIONS

We demonstrated various machine-learning-based survival prediction models. The CSF model showed a better performance in predicting the survival of patients with OPSCC in terms of the RMSE and RAE. In this context, machine learning models based on personalized survival predictions can be used to stratify various complex risk factors. This could help in designing personalized treatments and predicting prognoses for patients.

摘要

背景

我们使用“监测、流行病学和最终结果”（SEER）数据库为口咽鳞状细胞癌（OPSCC）患者确定了合适的生存预测机器学习模型。

方法

本研究共纳入2004年至2016年间诊断为OPSCC的4039例患者。特别选取并分析了13个变量：年龄、性别、肿瘤分级、肿瘤大小、颈部清扫术、放射治疗、癌症定向手术、化疗、T分期、N分期、M分期、临床分期和人乳头瘤病毒（HPV）状态。T分期、N分期和临床分期根据美国癌症联合委员会（AJCC）第8版分期手册进行重建。患者以7:3的比例随机分配到开发或测试数据集。使用极端随机生存树（EST）、条件生存森林（CSF）和DeepSurv模型预测OPSCC患者的总生存期和疾病特异性生存期。在开发数据集上进行10倍交叉验证，为所有模型构建训练和内部验证数据。我们使用测试数据集评估每个模型的预测性能。

结果

更高的c指数值和更低的综合Brier评分（IBS）、均方根误差（RMSE）和平均绝对误差（MAE）表明机器学习模型的性能更好。DeepSurv模型的C指数最高（0.77）。DeepSurv模型的IBS也最低（0.08）。然而，CSF模型的RMSE和RAE最低。

结论

我们展示了各种基于机器学习的生存预测模型。CSF模型在预测OPSCC患者生存方面，在RMSE和RAE方面表现更好。在这种情况下，基于个性化生存预测的机器学习模型可用于对各种复杂风险因素进行分层。这有助于为患者设计个性化治疗方案并预测预后。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/febb/9441569/b24d42b26726/fonc-12-974678-g001.jpg

相似文献

Prediction of survival in oropharyngeal squamous cell carcinoma using machine learning algorithms: A study based on the surveillance, epidemiology, and end results database.使用机器学习算法预测口咽鳞状细胞癌的生存率：一项基于监测、流行病学和最终结果数据库的研究。

Front Oncol. 2022 Aug 22;12:974678. doi: 10.3389/fonc.2022.974678. eCollection 2022.

Development and validation of machine learning models for predicting prognosis and guiding individualized postoperative chemotherapy: A real-world study of distal cholangiocarcinoma.用于预测预后和指导个体化术后化疗的机器学习模型的开发与验证：一项远端胆管癌的真实世界研究

Front Oncol. 2023 Mar 15;13:1106029. doi: 10.3389/fonc.2023.1106029. eCollection 2023.

Development and validation of a deep learning-based survival prediction model for pediatric glioma patients: A retrospective study using the SEER database and Chinese data.基于深度学习的儿童脑胶质瘤患者生存预测模型的建立与验证：SEER 数据库与中国数据的回顾性研究

Comput Biol Med. 2024 Nov;182:109185. doi: 10.1016/j.compbiomed.2024.109185. Epub 2024 Sep 27.

Deep learning models for predicting the survival of patients with chondrosarcoma based on a surveillance, epidemiology, and end results analysis.基于监测、流行病学和最终结果分析的预测软骨肉瘤患者生存率的深度学习模型。

Front Oncol. 2022 Aug 22;12:967758. doi: 10.3389/fonc.2022.967758. eCollection 2022.

Potential Added Value of PET/CT Radiomics for Survival Prognostication beyond AJCC 8th Edition Staging in Oropharyngeal Squamous Cell Carcinoma.PET/CT影像组学在口咽鳞状细胞癌中超越美国癌症联合委员会第8版分期进行生存预后评估的潜在附加价值

Cancers (Basel). 2020 Jul 3;12(7):1778. doi: 10.3390/cancers12071778.

Development and validation of survival prediction model for gastric adenocarcinoma patients using deep learning: A SEER-based study.基于深度学习的胃腺癌患者生存预测模型的开发与验证：一项基于监测、流行病学和最终结果（SEER）数据库的研究

Front Oncol. 2023 Mar 7;13:1131859. doi: 10.3389/fonc.2023.1131859. eCollection 2023.

An interpretable machine learning prognostic system for risk stratification in oropharyngeal cancer.一种可解释的机器学习预后系统，用于口咽癌的风险分层。

Int J Med Inform. 2022 Dec;168:104896. doi: 10.1016/j.ijmedinf.2022.104896. Epub 2022 Oct 13.

Predicting the survival of patients with pancreatic neuroendocrine neoplasms using deep learning: A study based on Surveillance, Epidemiology, and End Results database.使用深度学习预测胰腺神经内分泌肿瘤患者的生存情况：基于监测、流行病学和最终结果数据库的研究。

Cancer Med. 2023 Jun;12(11):12413-12424. doi: 10.1002/cam4.5949. Epub 2023 May 11.

Deep learning model for predicting the survival of patients with primary gastrointestinal lymphoma based on the SEER database and a multicentre external validation cohort.基于监测、流行病学和最终结果（SEER）数据库及多中心外部验证队列的预测原发性胃肠道淋巴瘤患者生存情况的深度学习模型

J Cancer Res Clin Oncol. 2023 Oct;149(13):12177-12189. doi: 10.1007/s00432-023-05123-0. Epub 2023 Jul 10.

Changing prognostic significance of tumor stage and nodal stage in patients with squamous cell carcinoma of the oropharynx in the human papillomavirus era.人乳头瘤病毒时代口咽鳞癌患者肿瘤分期和淋巴结分期预后意义的变化。

Cancer. 2015 Aug 1;121(15):2594-602. doi: 10.1002/cncr.29402. Epub 2015 Apr 14.

引用本文的文献

Prognosing post-treatment outcomes of head and neck cancer using structured data and machine learning: A systematic review.使用结构化数据和机器学习预测头颈部癌症治疗后的结局：系统评价。

PLoS One. 2024 Jul 24;19(7):e0307531. doi: 10.1371/journal.pone.0307531. eCollection 2024.

Creation of a machine learning-based prognostic prediction model for various subtypes of laryngeal cancer.基于机器学习的多种喉癌亚型预后预测模型的建立。

Sci Rep. 2024 Mar 18;14(1):6484. doi: 10.1038/s41598-024-56687-x.

A deep learning algorithm with good prediction efficacy for cancer-specific survival in osteosarcoma: A retrospective study.一种对骨肉瘤患者癌症特异性生存具有良好预测效能的深度学习算法：一项回顾性研究。

PLoS One. 2023 Sep 28;18(9):e0286841. doi: 10.1371/journal.pone.0286841. eCollection 2023.

Prediction of lung papillary adenocarcinoma-specific survival using ensemble machine learning models.使用集成机器学习模型预测肺乳头状腺癌特异性生存。

Sci Rep. 2023 Sep 8;13(1):14827. doi: 10.1038/s41598-023-40779-1.

Predicting overall survival in chordoma patients using machine learning models: a web-app application.使用机器学习模型预测 chordoma 患者的总生存期：一个网络应用程序。

J Orthop Surg Res. 2023 Sep 2;18(1):652. doi: 10.1186/s13018-023-04105-9.

De-escalated radiation for human papillomavirus virus-related oropharyngeal cancer: evolving paradigms and future strategies.人乳头瘤病毒相关口咽癌的降级放疗：不断演变的模式与未来策略

Front Oncol. 2023 Jul 27;13:1175578. doi: 10.3389/fonc.2023.1175578. eCollection 2023.

Front Oncol. 2023 Mar 7;13:1131859. doi: 10.3389/fonc.2023.1131859. eCollection 2023.

本文引用的文献

Patient Selection for Surgery vs Radiotherapy for Early Stage Oropharyngeal Cancer.早期口咽癌手术与放疗的患者选择。

Cancer Control. 2021 Jan-Dec;28:10732748211050770. doi: 10.1177/10732748211050770.

A Novel Imputation Approach for Sharing Protected Public Health Data.一种用于共享受保护公共卫生数据的新型插补方法。

Am J Public Health. 2021 Oct;111(10):1830-1838. doi: 10.2105/AJPH.2021.306432. Epub 2021 Sep 16.

Benefit of postoperative radiotherapy in patients with oropharyngeal squamous cell carcinoma in human papillomavirus (HPV) era: A Surveillance, Epidemiology, and End Results (SEER) database analysis.HPV 时代口咽鳞状细胞癌患者术后放疗的获益：监测、流行病学和最终结果（SEER）数据库分析。

Surgery. 2021 Aug;170(2):541-549. doi: 10.1016/j.surg.2021.01.034. Epub 2021 Mar 2.

Deep learning-based survival prediction for multiple cancer types using histopathology images.基于深度学习的多癌症类型生存预测：使用组织病理学图像。

PLoS One. 2020 Jun 17;15(6):e0233678. doi: 10.1371/journal.pone.0233678. eCollection 2020.

Deep learning-based survival prediction of oral cancer patients.基于深度学习的口腔癌患者生存预测。

Sci Rep. 2019 May 6;9(1):6994. doi: 10.1038/s41598-019-43372-7.

Changes in the 8th Edition of the American Joint Committee on Cancer (AJCC) Staging of Head and Neck Cancer: Rationale and Implications.第 8 版美国癌症联合委员会（AJCC）头颈部癌症分期的变化：原理与意义。

Curr Oncol Rep. 2019 Apr 17;21(6):52. doi: 10.1007/s11912-019-0799-x.

Recent Advances of Deep Learning in Bioinformatics and Computational Biology.深度学习在生物信息学和计算生物学中的最新进展

Front Genet. 2019 Mar 26;10:214. doi: 10.3389/fgene.2019.00214. eCollection 2019.

Dynamic-DeepHit: A Deep Learning Approach for Dynamic Survival Analysis With Competing Risks Based on Longitudinal Data.动态深度命中：一种基于纵向数据的具有竞争风险的动态生存分析的深度学习方法。

IEEE Trans Biomed Eng. 2020 Jan;67(1):122-133. doi: 10.1109/TBME.2019.2909027. Epub 2019 Apr 3.

Primary surgery versus primary radiation-based treatment for locally advanced oropharyngeal cancer.局部晚期口咽癌的原发手术与基于原发放疗的治疗对比

Laryngoscope. 2018 Jun;128(6):1353-1364. doi: 10.1002/lary.26903. Epub 2017 Oct 8.

Treatment selection in oropharyngeal cancer: a surveillance, epidemiology, and end results (SEER) patterns of care analysis.口咽癌的治疗选择：一项监测、流行病学及最终结果（SEER）护理模式分析

Cancer Causes Control. 2017 Oct;28(10):1085-1093. doi: 10.1007/s10552-017-0938-3. Epub 2017 Aug 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用机器学习算法预测口咽鳞状细胞癌的生存率：一项基于监测、流行病学和最终结果数据库的研究。

Prediction of survival in oropharyngeal squamous cell carcinoma using machine learning algorithms: A study based on the surveillance, epidemiology, and end results database.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献