• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用机器学习方法预测食管癌患者的5年生存状况。

Application of machine learning approaches to predict the 5-year survival status of patients with esophageal cancer.

作者信息

Gong Xian, Zheng Bin, Xu Guobing, Chen Hao, Chen Chun

机构信息

Department of Thoracic Surgery, Fujian Medical University Union Hospital, Fuzhou, China.

Key Laboratory of Cardio-Thoracic Surgery (Fujian Medical University), Fujian Province University, Fuzhou, China.

出版信息

J Thorac Dis. 2021 Nov;13(11):6240-6251. doi: 10.21037/jtd-21-1107.

DOI:10.21037/jtd-21-1107
PMID:34992804
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8662490/
Abstract

BACKGROUND

Accurate prognostic estimation for esophageal cancer (EC) patients plays an important role in the process of clinical decision-making. The objective of this study was to develop an effective model to predict the 5-year survival status of EC patients using machine learning (ML) algorithms.

METHODS

We retrieved the information of patients diagnosed with EC between 2010 and 2015 from the Surveillance, Epidemiology, and End Results (SEER) Program, including 24 features. A total of 8 ML models were applied to the selected dataset to classify the EC patients in terms of 5-year survival status, including 3 newly developed gradient boosting models (GBM), XGBoost, CatBoost, and LightGBM, 2 commonly used tree-based models, gradient boosting decision trees (GBDT) and random forest (RF), and 3 other ML models, artificial neural networks (ANN), naive Bayes (NB), and support vector machines (SVM). A 5-fold cross-validation was used in model performance measurement.

RESULTS

After excluding records with missing data, the final study population comprised 10,588 patients. Feature selection was conducted based on the χ test, however, the experiment results showed that the complete dataset provided better prediction of outcomes than the dataset with removal of non-significant features. Among the 8 models, XGBoost had the best performance [area under the receiver operating characteristic (ROC) curve (AUC): 0.852 for XGBoost, 0.849 for CatBoost, 0.850 for LightGBM, 0.846 for GBDT, 0.838 for RF, 0.844 for ANN, 0.833 for NB, and 0.789 for SVM]. The accuracy and logistic loss of XGBoost were 0.875 and 0.301, respectively, which were also the best performances. In the XGBoost model, the SHapley Additive exPlanations (SHAP) value was calculated and the result indicated that the four features: reason no cancer-directed surgery, Surg Prim Site, age, and stage group had the greatest impact on predicting the outcomes.

CONCLUSIONS

The XGBoost model and the complete dataset can be used to construct an accurate prognostic model for patients diagnosed with EC which may be applicable in clinical practice in the future.

摘要

背景

准确预测食管癌(EC)患者的预后在临床决策过程中起着重要作用。本研究的目的是使用机器学习(ML)算法开发一种有效的模型来预测EC患者的5年生存状况。

方法

我们从监测、流行病学和最终结果(SEER)计划中检索了2010年至2015年期间被诊断为EC的患者信息,包括24个特征。总共8种ML模型应用于选定的数据集,以根据5年生存状况对EC患者进行分类,包括3种新开发的梯度提升模型(GBM)、XGBoost、CatBoost和LightGBM,2种常用的基于树的模型,梯度提升决策树(GBDT)和随机森林(RF),以及3种其他ML模型,人工神经网络(ANN)、朴素贝叶斯(NB)和支持向量机(SVM)。在模型性能测量中使用5折交叉验证。

结果

在排除缺失数据的记录后,最终研究人群包括10588名患者。基于χ检验进行特征选择,然而,实验结果表明,完整数据集比去除非显著特征的数据集能更好地预测结果。在这8种模型中,XGBoost表现最佳[受试者操作特征(ROC)曲线下面积(AUC):XGBoost为0.852,CatBoost为0.849,LightGBM为0.850,GBDT为0.846,RF为0.838,ANN为0.844,NB为0.833,SVM为0.789]。XGBoost的准确率和逻辑损失分别为0.875和0.301,也是最佳表现。在XGBoost模型中,计算了SHapley加性解释(SHAP)值,结果表明以下四个特征:未进行癌症定向手术的原因、手术原发部位、年龄和分期组对预测结果的影响最大。

结论

XGBoost模型和完整数据集可用于为诊断为EC的患者构建准确的预后模型,未来可能适用于临床实践。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/9c9412ff94c5/jtd-13-11-6240-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/958d8294395a/jtd-13-11-6240-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/cc8d02c0b311/jtd-13-11-6240-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/e64bfb7e5807/jtd-13-11-6240-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/e02068400abd/jtd-13-11-6240-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/9c9412ff94c5/jtd-13-11-6240-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/958d8294395a/jtd-13-11-6240-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/cc8d02c0b311/jtd-13-11-6240-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/e64bfb7e5807/jtd-13-11-6240-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/e02068400abd/jtd-13-11-6240-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1148/8662490/9c9412ff94c5/jtd-13-11-6240-f5.jpg

相似文献

1
Application of machine learning approaches to predict the 5-year survival status of patients with esophageal cancer.应用机器学习方法预测食管癌患者的5年生存状况。
J Thorac Dis. 2021 Nov;13(11):6240-6251. doi: 10.21037/jtd-21-1107.
2
Application of machine learning techniques for predicting survival in ovarian cancer.机器学习技术在卵巢癌生存预测中的应用。
BMC Med Inform Decis Mak. 2022 Dec 30;22(1):345. doi: 10.1186/s12911-022-02087-y.
3
Application of an Interpretable Machine Learning Model to Predict Lymph Node Metastasis in Patients with Laryngeal Carcinoma.一种可解释的机器学习模型在预测喉癌患者淋巴结转移中的应用
J Oncol. 2022 Nov 12;2022:6356399. doi: 10.1155/2022/6356399. eCollection 2022.
4
Establishment and validation of a heart failure risk prediction model for elderly patients after coronary rotational atherectomy based on machine learning.基于机器学习的老年患者冠状动脉旋磨术后心力衰竭风险预测模型的建立与验证
PeerJ. 2024 Jan 31;12:e16867. doi: 10.7717/peerj.16867. eCollection 2024.
5
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
6
Surgical Methods and Social Factors Are Associated With Long-Term Survival in Follicular Thyroid Carcinoma: Construction and Validation of a Prognostic Model Based on Machine Learning Algorithms.手术方法和社会因素与滤泡性甲状腺癌的长期生存相关:基于机器学习算法的预后模型的构建与验证
Front Oncol. 2022 Jun 21;12:816427. doi: 10.3389/fonc.2022.816427. eCollection 2022.
7
Machine learning constructs a diagnostic prediction model for calculous pyonephrosis.机器学习构建了一个用于结石性肾盂肾炎的诊断预测模型。
Urolithiasis. 2024 Jun 19;52(1):96. doi: 10.1007/s00240-024-01587-y.
8
A 5-year survival status prognosis of nonmetastatic cervical cancer patients through machine learning algorithms.通过机器学习算法预测非转移性宫颈癌患者的 5 年生存状态。
Cancer Med. 2023 Mar;12(6):6867-6876. doi: 10.1002/cam4.5477. Epub 2022 Dec 8.
9
Machine learning-based models for the prediction of breast cancer recurrence risk.基于机器学习的乳腺癌复发风险预测模型。
BMC Med Inform Decis Mak. 2023 Nov 29;23(1):276. doi: 10.1186/s12911-023-02377-z.
10
A machine learning model based on ultrasound image features to assess the risk of sentinel lymph node metastasis in breast cancer patients: Applications of scikit-learn and SHAP.一种基于超声图像特征的机器学习模型,用于评估乳腺癌患者前哨淋巴结转移风险:scikit-learn和SHAP的应用
Front Oncol. 2022 Jul 25;12:944569. doi: 10.3389/fonc.2022.944569. eCollection 2022.

引用本文的文献

1
Machine learning-based multimodal radiomics and transcriptomics models for predicting radiotherapy sensitivity and prognosis in esophageal cancer.基于机器学习的多模态放射组学和转录组学模型用于预测食管癌的放疗敏感性和预后。
J Biol Chem. 2025 May 15;301(7):110242. doi: 10.1016/j.jbc.2025.110242.
2
A novel framework for esophageal cancer grading: combining CT imaging, radiomics, reproducibility, and deep learning insights.一种用于食管癌分级的新型框架:结合CT成像、影像组学、可重复性和深度学习见解。
BMC Gastroenterol. 2025 May 10;25(1):356. doi: 10.1186/s12876-025-03952-6.
3
Use machine learning to predict pulmonary metastasis of esophageal cancer: a population-based study.

本文引用的文献

1
Clinical decision support algorithm based on machine learning to assess the clinical response to anti-programmed death-1 therapy in patients with non-small-cell lung cancer.基于机器学习的临床决策支持算法,用于评估非小细胞肺癌患者对抗程序性死亡-1 治疗的临床反应。
Eur J Cancer. 2021 Aug;153:179-189. doi: 10.1016/j.ejca.2021.05.019. Epub 2021 Jun 26.
2
Machine Learning Model for Predicting Postoperative Survival of Patients with Colorectal Cancer.用于预测结直肠癌患者术后生存率的机器学习模型
Cancer Res Treat. 2022 Apr;54(2):517-524. doi: 10.4143/crt.2021.206. Epub 2021 Jun 15.
3
Computer Aided Breast Cancer Detection Using Ensembling of Texture and Statistical Image Features.
利用机器学习预测食管癌肺转移:一项基于人群的研究。
J Cancer Res Clin Oncol. 2024 Sep 16;150(9):420. doi: 10.1007/s00432-024-05937-6.
4
Prediction of hepatic metastasis in esophageal cancer based on machine learning.基于机器学习的食管癌肝转移预测。
Sci Rep. 2024 Jun 24;14(1):14507. doi: 10.1038/s41598-024-63213-6.
5
Screening ovarian cancer by using risk factors: machine learning assists.利用风险因素筛查卵巢癌:机器学习辅助。
Biomed Eng Online. 2024 Feb 12;23(1):18. doi: 10.1186/s12938-024-01219-x.
6
Design of risk prediction model for esophageal cancer based on machine learning approach.基于机器学习方法的食管癌风险预测模型设计
Heliyon. 2024 Jan 20;10(2):e24797. doi: 10.1016/j.heliyon.2024.e24797. eCollection 2024 Jan 30.
7
Prediction of five-year survival among esophageal cancer patients using machine learning.使用机器学习预测食管癌患者的五年生存率。
Heliyon. 2023 Nov 29;9(12):e22654. doi: 10.1016/j.heliyon.2023.e22654. eCollection 2023 Dec.
8
Biomarkers for Early Detection, Prognosis, and Therapeutics of Esophageal Cancers.用于食管癌早期检测、预后和治疗的生物标志物。
Int J Mol Sci. 2023 Feb 7;24(4):3316. doi: 10.3390/ijms24043316.
9
Machine learning for prediction of in-hospital mortality in lung cancer patients admitted to intensive care unit.机器学习在预测 ICU 收治的肺癌患者院内死亡率中的应用。
PLoS One. 2023 Jan 26;18(1):e0280606. doi: 10.1371/journal.pone.0280606. eCollection 2023.
10
Development and validation of a machine learning model for survival risk stratification after esophageal cancer surgery.食管癌手术后生存风险分层的机器学习模型的开发与验证
Front Oncol. 2022 Dec 9;12:1068198. doi: 10.3389/fonc.2022.1068198. eCollection 2022.
基于纹理和统计图像特征集成的计算机辅助乳腺癌检测
Sensors (Basel). 2021 May 23;21(11):3628. doi: 10.3390/s21113628.
4
Digital Biomarkers of Symptom Burden Self-Reported by Perioperative Patients Undergoing Pancreatic Surgery: Prospective Longitudinal Study.接受胰腺手术的围手术期患者自我报告的症状负担的数字生物标志物:前瞻性纵向研究。
JMIR Cancer. 2021 Apr 27;7(2):e27975. doi: 10.2196/27975.
5
Neoadjuvant chemoradiotherapy improves survival in locally advanced adenocarcinoma of esophagogastric junction compared with neoadjuvant chemotherapy: a propensity score matching analysis.新辅助放化疗对比新辅助化疗可改善局部进展期胃食管结合部腺癌患者的生存:一项倾向评分匹配分析。
BMC Surg. 2021 Mar 17;21(1):137. doi: 10.1186/s12893-021-01136-z.
6
A Novel XGBoost Method to Identify Cancer Tissue-of-Origin Based on Copy Number Variations.一种基于拷贝数变异识别癌症组织起源的新型XGBoost方法。
Front Genet. 2020 Nov 20;11:585029. doi: 10.3389/fgene.2020.585029. eCollection 2020.
7
Preoperative identification of microvascular invasion in hepatocellular carcinoma by XGBoost and deep learning.基于 XGBoost 和深度学习的肝细胞癌微血管侵犯术前预测。
J Cancer Res Clin Oncol. 2021 Mar;147(3):821-833. doi: 10.1007/s00432-020-03366-9. Epub 2020 Aug 27.
8
Prognostic evaluation of esophageal cancer patients with stages I-III.Ⅰ-Ⅲ期食管癌患者的预后评估。
Aging (Albany NY). 2020 Jul 23;12(14):14736-14753. doi: 10.18632/aging.103532.
9
Breast cancer diagnosis from histopathological images using textural features and CBIR.基于纹理特征和 CBIR 的乳腺组织病理图像乳腺癌诊断。
Artif Intell Med. 2020 May;105:101845. doi: 10.1016/j.artmed.2020.101845. Epub 2020 Apr 22.
10
Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection.机器学习算法预测肿瘤切除后 IV 期结直肠癌的复发。
Sci Rep. 2020 Feb 13;10(1):2519. doi: 10.1038/s41598-020-59115-y.