胃癌和食管癌患者VTE风险预测模型的开发与验证

Development and validation of a prediction model for VTE risk in gastric and esophageal cancer patients.

作者信息

Zheng Xingyue, Wu Liuyun, Li Lian, Wang Yin, Yin Qinan, Han Lizhu, Wu Xingwei, Bian Yuan

机构信息

Department of Pharmacy, Personalized Drug Therapy Key Laboratory of Sichuan Province, Sichuan Academy of Medical Sciences and Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China.

出版信息

Front Pharmacol. 2025 Feb 28;16:1448879. doi: 10.3389/fphar.2025.1448879. eCollection 2025.

DOI:10.3389/fphar.2025.1448879

PMID:40093315

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11906997/

Abstract

OBJECTIVE

This study focuses on the risk of venous thromboembolism (VTE) in patients with gastric or esophageal cancer (GC/EC), investigating the risk factors for VTE in this population. Utilizing machine learning techniques, the research aims to develop an interpretable VTE risk prediction model. The goal is to identify patients with gastric or esophageal cancer who are at high risk of VTE at an early stage in clinical practice, thereby enabling precise anticoagulant prophylaxis and thrombus management.

METHODS

This study is a real-world investigation aimed at predicting VTE in patients with GC/EC. Data were collected from inpatients diagnosed with GC/EC at Sichuan Provincial People's Hospital between 1 January 2018, and 31 June 2023. Using nine supervised learning algorithms, 576 prediction models were developed based on 56 available variables. Subsequently, a simplified modeling approach was employed using the top 12 feature variables from the best-performing model. The primary metric for assessing the predictive performance of the models was the area under the ROC curve (AUC). Additionally, the training data used to construct the best model in this study were employed to externally validate several existing assessment models, including the Padua, Caprini, Khorana, and COMPASS-CAT scores.

RESULTS

A total of 3,742 cases of GC/EC patients were collected after excluding duplicate visit information. The study included 861 (23.0%) patients, of which 124 (14.4%) developed VTE. The top five models based on AUC for full-variable modeling are as follows: GBoost (0.9646), Logic Regression (0.9443), AdaBoost (0.9382), CatBoost (0.9354), XGBoost (0.8097). For simplified modeling, the models are: Simp-CatBoost (0.8811), Simp-GBoost (0.8771), Simp-Random Forest (0.8736), Simp-AdaBoost (0.8263), Simp-Logistic Regression (0.8090). After evaluating predictive performance and practicality, the Simp-GBoost model was determined as the best model for this study. External validation of the Padua score, Caprini score, Khorana score, and COMPASS-CAT score based on the training set of the Simp-GBoost model yielded AUCs of 0.4367, 0.2900, 0.5000, and 0.3633, respectively.

CONCLUSION

In this study, we analyzed the risk factors of VTE in GC/EC patients, and constructed a well-performing VTE risk prediction model capable of accurately identifying the extent of VTE risk in patients. Four VTE prediction scoring systems were introduced to externally validate the dataset of this study. The results demonstrated that the VTE risk prediction model established in this study held greater clinical utility for patients with GC/EC. The Simp-GB model can provide intelligent assistance in the early clinical assessment of VTE risk in these patients.

摘要

目的

本研究聚焦于胃癌或食管癌（GC/EC）患者的静脉血栓栓塞症（VTE）风险，调查该人群中VTE的风险因素。利用机器学习技术，本研究旨在开发一个可解释的VTE风险预测模型。目标是在临床实践中早期识别出具有高VTE风险的胃癌或食管癌患者，从而实现精准的抗凝预防和血栓管理。

方法

本研究是一项旨在预测GC/EC患者VTE的真实世界调查。数据收集自2018年1月1日至2023年6月31日在四川省人民医院诊断为GC/EC的住院患者。使用九种监督学习算法，基于56个可用变量开发了576个预测模型。随后，采用简化建模方法，使用表现最佳模型中的前12个特征变量。评估模型预测性能的主要指标是ROC曲线下面积（AUC）。此外，本研究中用于构建最佳模型的训练数据被用于外部验证几个现有的评估模型，包括帕多瓦评分、卡普里尼评分、霍拉纳评分和COMPASS-CAT评分。

结果

排除重复就诊信息后，共收集到3742例GC/EC患者。该研究纳入了861例（23.0%）患者，其中124例（14.4%）发生了VTE。基于全变量建模的AUC排名前五的模型如下：GBoost（0.9646）、逻辑回归（0.9443）、AdaBoost（0.9382）、CatBoost（0.9354）、XGBoost（0.8097）。对于简化建模，模型如下：Simp-CatBoost（0.8811）、Simp-GBoost（0.8771）、Simp-随机森林（0.8736）、Simp-AdaBoost（0.8263）、Simp-逻辑回归（0.8090）。在评估预测性能和实用性后，Simp-GBoost模型被确定为本研究的最佳模型。基于Simp-GBoost模型训练集对帕多瓦评分、卡普里尼评分、霍拉纳评分和COMPASS-CAT评分进行外部验证，得到的AUC分别为0.4367、0.2900、0.5000和0.3633。

结论

在本研究中，我们分析了GC/EC患者VTE的风险因素，并构建了一个性能良好的VTE风险预测模型，能够准确识别患者的VTE风险程度。引入了四个VTE预测评分系统对本研究数据集进行外部验证。结果表明，本研究建立的VTE风险预测模型对GC/EC患者具有更大的临床实用性。Simp-GB模型可为这些患者VTE风险的早期临床评估提供智能辅助。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87bf/11906997/447c70fb66a7/fphar-16-1448879-g001.jpg

相似文献

Development and validation of a prediction model for VTE risk in gastric and esophageal cancer patients.

Front Pharmacol. 2025 Feb 28;16:1448879. doi: 10.3389/fphar.2025.1448879. eCollection 2025.

Development and validation of machine learning models for postoperative venous thromboembolism prediction in colorectal cancer inpatients: a retrospective study.

J Gastrointest Oncol. 2023 Feb 28;14(1):220-232. doi: 10.21037/jgo-23-18. Epub 2023 Feb 15.

Derivation, validation and assessment of a novel nomogram-based risk assessment model for venous thromboembolism in hospitalized patients with lung cancer: A retrospective case control study.

Front Oncol. 2022 Oct 10;12:988287. doi: 10.3389/fonc.2022.988287. eCollection 2022.

Ten-Year Multicenter Retrospective Study Utilizing Machine Learning Algorithms to Identify Patients at High Risk of Venous Thromboembolism After Radical Gastrectomy.

Int J Gen Med. 2023 May 18;16:1909-1925. doi: 10.2147/IJGM.S408770. eCollection 2023.

Comparison between the Khorana prediction score and Caprini risk assessment models for assessing the risk of venous thromboembolism in hospitalized patients with cancer: a retrospective case control study.

Interact Cardiovasc Thorac Surg. 2020 Oct 1;31(4):454-460. doi: 10.1093/icvts/ivaa137.

[Risk prediction of venous thromboembolism in non-small cell lung cancer patients based on COMPASS-CAT risk assessment model].

Zhonghua Zhong Liu Za Zhi. 2020 Apr 23;42(4):340-345. doi: 10.3760/cma.j.cn112152-20191101-00707.

Ability of Caprini and Padua risk-assessment models to predict venous thromboembolism in a nationwide Veterans Affairs study.

J Vasc Surg Venous Lymphat Disord. 2024 Mar;12(2):101693. doi: 10.1016/j.jvsv.2023.101693. Epub 2023 Oct 12.

Ability of Caprini and Padua Risk-Assessment Models to Predict Venous Thromboembolism in a Nationwide Study.

medRxiv. 2023 Mar 21:2023.03.20.23287506. doi: 10.1101/2023.03.20.23287506.

Venous Thrombosis Risk after Cast Immobilization of the Lower Extremity: Derivation and Validation of a Clinical Prediction Score, L-TRiP(cast), in Three Population-Based Case-Control Studies.

PLoS Med. 2015 Nov 10;12(11):e1001899; discussion e1001899. doi: 10.1371/journal.pmed.1001899. eCollection 2015 Nov.

[Establishment and Validation of a Predictive Model for Gallstone Disease in the General Population: A Multicenter Study].

Sichuan Da Xue Xue Bao Yi Xue Ban. 2024 May 20;55(3):641-652. doi: 10.12182/20240560501.

本文引用的文献

Cancer incidence and mortality in China, 2022.

J Natl Cancer Cent. 2024 Feb 2;4(1):47-53. doi: 10.1016/j.jncc.2024.01.006. eCollection 2024 Mar.

Cardiovascular Toxicity Induced by Vascular Endothelial Growth Factor Inhibitors.

Life (Basel). 2023 Jan 29;13(2):366. doi: 10.3390/life13020366.

The Best Evidence for the Prevention and Management of Lower Extremity Deep Venous Thrombosis After Gynecological Malignant Tumor Surgery: A Systematic Review and Network Meta-Analysis.

Front Surg. 2022 Mar 22;9:841275. doi: 10.3389/fsurg.2022.841275. eCollection 2022.

Venous thromboembolism and radiation therapy: The final radiation-induced thrombosis study analysis.

Cancer Med. 2022 Apr;11(8):1753-1762. doi: 10.1002/cam4.4559. Epub 2022 Feb 24.

Cancer-associated venous thromboembolism.

Nat Rev Dis Primers. 2022 Feb 17;8(1):11. doi: 10.1038/s41572-022-00336-y.

Gastrointestinal cancers in China, the USA, and Europe.

Gastroenterol Rep (Oxf). 2021 Mar 29;9(2):91-104. doi: 10.1093/gastro/goab010. eCollection 2021 Apr.

Double trouble for cancer patients.

Eur Heart J. 2021 Jun 14;42(23):2308-2310. doi: 10.1093/eurheartj/ehab252.

Changing profiles of cancer burden worldwide and in China: a secondary analysis of the global cancer statistics 2020.

Chin Med J (Engl). 2021 Mar 17;134(7):783-791. doi: 10.1097/CM9.0000000000001474.

Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.

CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.

Utilization and Complications of Central Venous Access Devices in Oncology Patients.

Curr Oncol. 2021 Jan 10;28(1):367-377. doi: 10.3390/curroncol28010039.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

胃癌和食管癌患者VTE风险预测模型的开发与验证

Development and validation of a prediction model for VTE risk in gastric and esophageal cancer patients.

作者信息

Zheng Xingyue, Wu Liuyun, Li Lian, Wang Yin, Yin Qinan, Han Lizhu, Wu Xingwei, Bian Yuan

机构信息