• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的预测胃癌患者就诊延迟风险的模型。

Machine learning based models for predicting presentation delay risk among gastric cancer patients.

作者信息

Zhou Huali, Gu Qiong, Bao Rong, Qiu Liping, Zhang Yuhan, Wang Fang, Liu Wenlian, Wu Lingling, Li Li, Ren Yihua, Qiu Lei, Wang Qian, Zhang Gaomin, Qiao Xiaoqing, Yuan Wenjie, Ren Juan, Luo Min, Huang Rong, Yang Qing

机构信息

School of Nursing, Chengdu Medical College, Chengdu, China.

Department of Gastric Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, Affiliated Cancer Hospital of University of Electronic Science and Technology of China, Chengdu, China.

出版信息

Front Oncol. 2025 Jan 13;14:1503047. doi: 10.3389/fonc.2024.1503047. eCollection 2024.

DOI:10.3389/fonc.2024.1503047
PMID:39871938
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11769801/
Abstract

OBJECTIVE

Presentation delay of cancer patients prevents the patient from timely diagnosis and treatment leading to poor prognosis. Predicting the risk of presentation delay is crucial to improve the treatment outcomes. This study aimed to develop and validate prediction models of presentation delay risk in gastric cancer patients by using various machine learning models.

METHODS

875 cases of gastric cancer patients admitted to a tertiary oncology hospital from July 2023 to June 2024 were used as derivation cohort, 200 cases of gastric cancer patients admitted to other 4 tertiary hospital were used as external validation cohort. After collecting the data, statistical analysis was performed to identify discriminative variables for the prediction of presentation delay and 13 statistically significant variables are selected to develop machine learning models. The derivation cohort was randomly assigned to the training and internal validation set by the ratio of 7:3. Prediction models were developed based on six machine learning algorithms, which are logistic regression (LR), support vector machine (SVM), random forest (RF), gradient boosted trees (GBDT), extremely gradient boosting (XGBoost) and muti-layer perceptron (MLP). The discrimination and calibration of each model were assessed based on various metrics including accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), F1-Score and area under curve (AUC), calibration curves and Brier scores. The best model was selected based on comparing of various metrics. Based on the selected best model, the impact of features to the prediction result was analyzed with the permutation feature importance method.

RESULTS

The incidence of presentation delay for gastric cancer patients was 39.3%. The developed models achieved performance metrics as AUC (0.893-0.925), accuracy (0.817-0.847), sensitivity (0.857-0.905), specificity (0.783-0.854), PPV (0.728-0.798), NPV (0.897-0.927), F1 score (0.791-0.826) and Brier score (0.107-0.138) in internal validation set, which indicated good discrimination and calibration for the prediction of presentation delay in gastric cancer patients. Among all models, RF based model was selected as the best one as it achieved good discrimination and calibration performance on both of internal and external validation set. Feature ranking results indicated that both of subjective and objective factors have significant impact on the occurrence of presentation delay in gastric cancer patients.

CONCLUSION

This study demonstrated that the RF based model has favorable performance for the prediction of presentation delay in gastric cancer patients. It can help medical staffs to screen out high-risk gastric cancer patients for presentation delay, and to take appropriate and specific interventions to reduce the risk of presentation delay.

摘要

目的

癌症患者的就诊延迟会妨碍患者及时诊断和治疗,导致预后不良。预测就诊延迟风险对于改善治疗结果至关重要。本研究旨在通过使用各种机器学习模型来开发和验证胃癌患者就诊延迟风险的预测模型。

方法

将2023年7月至2024年6月在一家三级肿瘤医院收治的875例胃癌患者作为推导队列,将在其他4家三级医院收治的200例胃癌患者作为外部验证队列。收集数据后,进行统计分析以确定用于预测就诊延迟的判别变量,并选择13个具有统计学意义的变量来开发机器学习模型。推导队列按7:3的比例随机分配到训练集和内部验证集。基于六种机器学习算法开发预测模型,即逻辑回归(LR)、支持向量机(SVM)、随机森林(RF)、梯度提升树(GBDT)、极端梯度提升(XGBoost)和多层感知器(MLP)。基于各种指标评估每个模型的判别力和校准度,包括准确性、敏感性、特异性、阳性预测值(PPV)、阴性预测值(NPV)、F1分数和曲线下面积(AUC)、校准曲线和布里尔分数。通过比较各种指标选择最佳模型。基于所选的最佳模型,使用排列特征重要性方法分析特征对预测结果的影响。

结果

胃癌患者就诊延迟的发生率为39.3%。所开发的模型在内部验证集中的性能指标为AUC(0.893 - 0.925)、准确性(0.817 - 0.847)、敏感性(0.857 - 0.905)、特异性(0.783 - 0.854)、PPV(0.728 - 0.798)、NPV(0.897 - 0.927)、F1分数(0.791 - 0.826)和布里尔分数(0.107 - 0.138),这表明在预测胃癌患者就诊延迟方面具有良好的判别力和校准度。在所有模型中,基于RF的模型被选为最佳模型,因为它在内部和外部验证集上均具有良好的判别力和校准性能。特征排序结果表明,主观和客观因素均对胃癌患者就诊延迟的发生有显著影响。

结论

本研究表明,基于RF的模型在预测胃癌患者就诊延迟方面具有良好的性能。它可以帮助医务人员筛查出就诊延迟的高危胃癌患者,并采取适当的针对性干预措施以降低就诊延迟风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/eb648ff7cfdd/fonc-14-1503047-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/d87eaf09ad4e/fonc-14-1503047-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/b92372d3367e/fonc-14-1503047-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/6bce7a57b796/fonc-14-1503047-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/7e9610b37ad2/fonc-14-1503047-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/500c8fd4fd8d/fonc-14-1503047-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/eb648ff7cfdd/fonc-14-1503047-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/d87eaf09ad4e/fonc-14-1503047-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/b92372d3367e/fonc-14-1503047-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/6bce7a57b796/fonc-14-1503047-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/7e9610b37ad2/fonc-14-1503047-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/500c8fd4fd8d/fonc-14-1503047-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d182/11769801/eb648ff7cfdd/fonc-14-1503047-g006.jpg

相似文献

1
Machine learning based models for predicting presentation delay risk among gastric cancer patients.基于机器学习的预测胃癌患者就诊延迟风险的模型。
Front Oncol. 2025 Jan 13;14:1503047. doi: 10.3389/fonc.2024.1503047. eCollection 2024.
2
Development and validation of a prediction model for coronary heart disease risk in depressed patients aged 20 years and older using machine learning algorithms.使用机器学习算法开发并验证针对20岁及以上抑郁症患者冠心病风险的预测模型。
Front Cardiovasc Med. 2025 Jan 9;11:1504957. doi: 10.3389/fcvm.2024.1504957. eCollection 2024.
3
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
4
Prediction of lumbar disc degeneration based on interpretable machine learning models: retrospective cohort study.基于可解释机器学习模型的腰椎间盘退变预测:回顾性队列研究
Spine J. 2025 Apr 9. doi: 10.1016/j.spinee.2025.04.004.
5
Development and validation of a web-based artificial intelligence prediction model to assess massive intraoperative blood loss for metastatic spinal disease using machine learning techniques.开发和验证一种基于网络的人工智能预测模型,使用机器学习技术评估转移性脊柱疾病的术中大量失血。
Spine J. 2024 Jan;24(1):146-160. doi: 10.1016/j.spinee.2023.09.001. Epub 2023 Sep 11.
6
Development and validation of inpatient mortality prediction models for patients with hyperglycemic crisis using machine learning approaches.使用机器学习方法开发和验证高血糖危象患者的住院死亡率预测模型
BMC Endocr Disord. 2025 Mar 27;25(1):86. doi: 10.1186/s12902-025-01873-9.
7
Establishment and validation of an interactive artificial intelligence platform to predict postoperative ambulatory status for patients with metastatic spinal disease: a multicenter analysis.建立和验证交互式人工智能平台,以预测转移性脊柱疾病患者的术后活动状态:一项多中心分析。
Int J Surg. 2024 May 1;110(5):2738-2756. doi: 10.1097/JS9.0000000000001169.
8
Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。
Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.
9
[Construction of a predictive model for in-hospital mortality of sepsis patients in intensive care unit based on machine learning].基于机器学习构建重症监护病房脓毒症患者院内死亡率预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2023 Jul;35(7):696-701. doi: 10.3760/cma.j.cn121430-20221219-01104.
10
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.

本文引用的文献

1
Cancer incidence and mortality in China, 2022.2022年中国癌症发病率与死亡率
J Natl Cancer Cent. 2024 Feb 2;4(1):47-53. doi: 10.1016/j.jncc.2024.01.006. eCollection 2024 Mar.
2
Factors influencing presentation delay among cancer patients: a cross-sectional study in Malaysia.癌症患者就诊延误的影响因素:马来西亚的一项横断面研究。
BMC Public Health. 2024 May 8;24(1):1260. doi: 10.1186/s12889-024-18643-2.
3
Comparison of machine learning models to predict the risk of breast cancer-related lymphedema among breast cancer survivors: a cross-sectional study in China.
预测乳腺癌幸存者发生乳腺癌相关淋巴水肿风险的机器学习模型比较:一项中国的横断面研究
Front Oncol. 2024 Feb 12;14:1334082. doi: 10.3389/fonc.2024.1334082. eCollection 2024.
4
The application and use of artificial intelligence in cancer nursing: A systematic review.人工智能在癌症护理中的应用和使用:系统评价。
Eur J Oncol Nurs. 2024 Feb;68:102510. doi: 10.1016/j.ejon.2024.102510. Epub 2024 Jan 14.
5
Machine learning-based models for the prediction of breast cancer recurrence risk.基于机器学习的乳腺癌复发风险预测模型。
BMC Med Inform Decis Mak. 2023 Nov 29;23(1):276. doi: 10.1186/s12911-023-02377-z.
6
Delay in diagnosis to treatment and impact on survival of gastric adenocarcinoma in a low income setting without screening facility.在没有筛查设施的低收入环境中,胃腺癌的诊断延迟对治疗和生存的影响。
Sci Rep. 2023 Nov 23;13(1):20628. doi: 10.1038/s41598-023-47415-y.
7
Development of a Multilevel Model to Identify Patients at Risk for Delay in Starting Cancer Treatment.开发一个多层次模型,以识别癌症治疗开始延迟风险的患者。
JAMA Netw Open. 2023 Aug 1;6(8):e2328712. doi: 10.1001/jamanetworkopen.2023.28712.
8
Applications of different machine learning approaches in prediction of breast cancer diagnosis delay.不同机器学习方法在乳腺癌诊断延迟预测中的应用
Front Oncol. 2023 Feb 16;13:1103369. doi: 10.3389/fonc.2023.1103369. eCollection 2023.
9
Rural-Urban Disparities in Cancer Outcomes: Opportunities for Future Research.城乡癌症结局差异:未来研究的机遇。
J Natl Cancer Inst. 2022 Jul 11;114(7):940-952. doi: 10.1093/jnci/djac030.
10
Barriers and Facilitators Associated with Delays in the Diagnosis and Treatment of Gastric Cancer: a Systematic Review.与胃癌诊断和治疗延迟相关的障碍和促进因素:系统评价。
J Gastrointest Cancer. 2022 Sep;53(3):782-796. doi: 10.1007/s12029-021-00673-3. Epub 2021 Sep 9.