Xu Qianjie, Lei Haike, Li Xiaosheng, Li Fang, Shi Hao, Wang Guixue, Sun Anlong, Wang Ying, Peng Bin
Department of Health Statistics, School of Public Health, Chongqing Medical University, Chongqing, 400016, China.
Chongqing Cancer Multi-omics Big Data Application Engineering Research Center, Chongqing University Cancer Hospital, Chongqing, 400030, China.
Heliyon. 2023 Jan 3;9(1):e12681. doi: 10.1016/j.heliyon.2022.e12681. eCollection 2023 Jan.
Stomach cancer (GC) has one of the highest rates of thrombosis among cancers and can lead to considerable morbidity, mortality, and additional costs. However, to date, there is no suitable venous thromboembolism (VTE) prediction model for gastric cancer patients to predict risk. Therefore, there is an urgent need to establish a clinical prediction model for VTE in gastric cancer patients. We collected data on 3092 patients between January 1, 2018 and December 31, 2021. And after feature selection, 11 variables are reserved as predictors to build the model. Five machine learning (ML) algorithms are used to build different VTE predictive models. The accuracy, sensitivity, specificity, and AUC of these five models were compared with traditional logistic regression (LR) to recommend the best VTE prediction model. RF and XGB models have selected the essential characters in the model: Clinical stage, Blood Transfusion History, D-Dimer, AGE, and FDP. The model has an AUC of 0.825, an accuracy of 0.799, a sensitivity of 0.710, and a specificity of 0.802 in the validation set. The model has good performance and high application value in clinical practice, and can identify high-risk groups of gastric cancer patients and prevent venous thromboembolism.
胃癌(GC)是癌症中血栓形成率最高的疾病之一,可导致相当高的发病率、死亡率和额外费用。然而,迄今为止,尚无适用于胃癌患者的静脉血栓栓塞(VTE)预测模型来预测风险。因此,迫切需要建立一种针对胃癌患者VTE的临床预测模型。我们收集了2018年1月1日至2021年12月31日期间3092例患者的数据。经过特征选择后,保留11个变量作为预测因子来构建模型。使用五种机器学习(ML)算法构建不同的VTE预测模型。将这五个模型的准确性、敏感性、特异性和AUC与传统逻辑回归(LR)进行比较,以推荐最佳的VTE预测模型。随机森林(RF)和极端梯度提升(XGB)模型在模型中选择了关键特征:临床分期、输血史、D-二聚体、年龄和纤维蛋白降解产物(FDP)。该模型在验证集中的AUC为0.825,准确率为0.799,敏感性为0.710,特异性为0.802。该模型在临床实践中具有良好的性能和较高的应用价值,能够识别胃癌患者的高危人群并预防静脉血栓栓塞。