• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习算法构建结直肠癌患者术后深静脉血栓形成风险预测模型。

Construction of a risk prediction model for postoperative deep vein thrombosis in colorectal cancer patients based on machine learning algorithms.

作者信息

Liu Xin, Shu Xingming, Zhou Yejiang, Jiang Yifan

机构信息

Department of Clinical Medicine, Southwest Medical University, Luzhou, China.

Department of Gastrointestinal Surgery, The Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan, China.

出版信息

Front Oncol. 2024 Nov 27;14:1499794. doi: 10.3389/fonc.2024.1499794. eCollection 2024.

DOI:10.3389/fonc.2024.1499794
PMID:39664197
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11631706/
Abstract

BACKGROUND

Colorectal cancer is a prevalent malignancy of the digestive system, with an increasing incidence. Lower extremity deep vein thrombosis (DVT) is a frequent postoperative complication, occurring in up to 40% of cases.

OBJECTIVE

This research aims to develop and validate a machine learning model (ML) to predict the risk of lower limb deep vein thrombosis in patients with colorectal cancer, facilitating preventive and therapeutic measures to enhance recovery and ensure safety.

METHODS

In this retrospective cohort study, we collected data from 429 colorectal cancer patients from January 2021 to January 2024. The medical records included age, blood test results, body mass index, underlying diseases, clinical staging, histological typing, surgical methods, and postoperative complications. We employed the Synthetic Minority Oversampling Technique to address imbalanced data and split the dataset into training and validation sets in a 7:3 ratio. Feature selection was performed using Random Forest (RF), XGBoost, and Least Absolute Shrinkage and Selection Operator algorithms (LASSO). We then trained six machine learning models: Logistic Regression (LR), Naive Bayes (NB), Gaussian Process (GP), Random Forest, XGBoost, and Multilayer Perceptron (MLP). The model's performance was evaluated using metrics such as area under the Receiver Operating Characteristic curve, accuracy, sensitivity, specificity, F1 score, and confusion matrix. Additionally, SHAP and LIME were used to enhance the interpretability of the results.

RESULTS

The study combined Random Forest, XGBoost algorithms, and LASSO regression with univariate regression analysis to identify significant predictive factors, including age, preoperative prealbumin, preoperative albumin, preoperative hemoglobin, operation time, PIKVA2, CEA, and preoperative neutrophil count. The XGBoost model outperformed other ML algorithms, achieving an AUC of 0.996, an accuracy of 0.9636, a specificity of 0.9778, and an F1 score of 0.9576. Moreover, the SHAP method identified age and preoperative prealbumin as the primary determinants influencing ML model predictions. Finally, the study employed LIME for more precise prediction and interpretation of individual predictions.

CONCLUSION

The machine learning algorithms effectively predicted postoperative lower limb deep vein thrombosis in colorectal cancer patients. The XGBoost model demonstrated strong potential for improving early detection and treatment in clinical settings.

摘要

背景

结直肠癌是消化系统常见的恶性肿瘤,发病率呈上升趋势。下肢深静脉血栓形成(DVT)是常见的术后并发症,发生率高达40%。

目的

本研究旨在开发并验证一种机器学习模型(ML),以预测结直肠癌患者下肢深静脉血栓形成的风险,促进采取预防和治疗措施,以加快康复并确保安全。

方法

在这项回顾性队列研究中,我们收集了2021年1月至2024年1月期间429例结直肠癌患者的数据。病历包括年龄、血液检查结果、体重指数、基础疾病、临床分期、组织学类型、手术方法和术后并发症。我们采用合成少数过采样技术来处理数据不平衡问题,并将数据集按7:3的比例分为训练集和验证集。使用随机森林(RF)、XGBoost和最小绝对收缩和选择算子算法(LASSO)进行特征选择。然后我们训练了六种机器学习模型:逻辑回归(LR)、朴素贝叶斯(NB)、高斯过程(GP)、随机森林、XGBoost和多层感知器(MLP)。使用受试者工作特征曲线下面积、准确率、敏感性、特异性、F1分数和混淆矩阵等指标评估模型的性能。此外,使用SHAP和LIME来增强结果的可解释性。

结果

该研究将随机森林、XGBoost算法和LASSO回归与单变量回归分析相结合,以确定显著的预测因素,包括年龄、术前前白蛋白、术前白蛋白、术前血红蛋白、手术时间、PIKVA2、癌胚抗原(CEA)和术前中性粒细胞计数。XGBoost模型优于其他ML算法,曲线下面积(AUC)为0.996,准确率为0.9636,特异性为0.9778,F1分数为0.9576。此外,SHAP方法确定年龄和术前前白蛋白是影响ML模型预测的主要决定因素。最后,该研究使用LIME对个体预测进行更精确的预测和解释。

结论

机器学习算法有效地预测了结直肠癌患者术后下肢深静脉血栓形成。XGBoost模型在临床环境中改善早期检测和治疗方面显示出强大的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/8f5b9d4aaf5f/fonc-14-1499794-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/db4dbd19ad70/fonc-14-1499794-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/13f379e09113/fonc-14-1499794-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/4fb80eb5b79c/fonc-14-1499794-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/6e841b51825a/fonc-14-1499794-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/8f5b9d4aaf5f/fonc-14-1499794-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/db4dbd19ad70/fonc-14-1499794-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/13f379e09113/fonc-14-1499794-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/4fb80eb5b79c/fonc-14-1499794-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/6e841b51825a/fonc-14-1499794-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29e8/11631706/8f5b9d4aaf5f/fonc-14-1499794-g005.jpg

相似文献

1
Construction of a risk prediction model for postoperative deep vein thrombosis in colorectal cancer patients based on machine learning algorithms.基于机器学习算法构建结直肠癌患者术后深静脉血栓形成风险预测模型。
Front Oncol. 2024 Nov 27;14:1499794. doi: 10.3389/fonc.2024.1499794. eCollection 2024.
2
Construction of a risk prediction model for lung infection after chemotherapy in lung cancer patients based on the machine learning algorithm.基于机器学习算法构建肺癌患者化疗后肺部感染风险预测模型。
Front Oncol. 2024 Aug 9;14:1403392. doi: 10.3389/fonc.2024.1403392. eCollection 2024.
3
Machine learning assisted radiomics in predicting postoperative occurrence of deep venous thrombosis in patients with gastric cancer.机器学习辅助的放射组学在预测胃癌患者术后深静脉血栓形成中的应用
BMC Cancer. 2025 Feb 7;25(1):220. doi: 10.1186/s12885-025-13630-1.
4
A risk prediction model based on machine learning algorithm for parastomal hernia after permanent colostomy.基于机器学习算法的永久性结肠造口术后粪袋周围疝风险预测模型。
BMC Med Inform Decis Mak. 2024 Aug 8;24(1):224. doi: 10.1186/s12911-024-02627-8.
5
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
6
Prediction of lumbar disc degeneration based on interpretable machine learning models: retrospective cohort study.基于可解释机器学习模型的腰椎间盘退变预测:回顾性队列研究
Spine J. 2025 Apr 9. doi: 10.1016/j.spinee.2025.04.004.
7
Interpretable machine learning model to predict surgical difficulty in laparoscopic resection for rectal cancer.用于预测直肠癌腹腔镜切除手术难度的可解释机器学习模型。
Front Oncol. 2024 Feb 6;14:1337219. doi: 10.3389/fonc.2024.1337219. eCollection 2024.
8
Comparative study of two models predicting the risk of deep vein thrombosis progression in spinal trauma patients after operation.两种模型预测术后脊柱创伤患者深静脉血栓进展风险的比较研究。
Clin Neurol Neurosurg. 2024 Jan;236:108072. doi: 10.1016/j.clineuro.2023.108072. Epub 2023 Nov 29.
9
Predictive modeling of lower extreme deep vein thrombosis following radical gastrectomy for gastric cancer: based on multiple machine learning methods.基于多种机器学习方法的胃癌根治术后下肢深静脉血栓形成的预测模型。
Sci Rep. 2024 Jul 8;14(1):15711. doi: 10.1038/s41598-024-66754-y.
10
Prediction of STAS in lung adenocarcinoma with nodules ≤ 2 cm using machine learning: a multicenter retrospective study.使用机器学习预测直径≤2 cm的肺腺癌中的STAS:一项多中心回顾性研究
BMC Cancer. 2025 Mar 7;25(1):417. doi: 10.1186/s12885-025-13783-z.

引用本文的文献

1
Constructing a Prognostic Model for Subtypes of Colorectal Cancer Based on Machine Learning and Immune Infiltration-Related Genes.基于机器学习和免疫浸润相关基因构建结直肠癌亚型的预后模型
J Cell Mol Med. 2025 Feb;29(4):e70437. doi: 10.1111/jcmm.70437.

本文引用的文献

1
Construction of a risk prediction model for lung infection after chemotherapy in lung cancer patients based on the machine learning algorithm.基于机器学习算法构建肺癌患者化疗后肺部感染风险预测模型。
Front Oncol. 2024 Aug 9;14:1403392. doi: 10.3389/fonc.2024.1403392. eCollection 2024.
2
Treating cancer-associated venous thromboembolism: A practical approach.治疗癌症相关静脉血栓栓塞症:一种实用方法。
Eur J Cancer. 2024 Sep;209:114263. doi: 10.1016/j.ejca.2024.114263. Epub 2024 Aug 5.
3
Clinical Applications of Machine Learning.
机器学习的临床应用
Ann Surg Open. 2024 Apr 18;5(2):e423. doi: 10.1097/AS9.0000000000000423. eCollection 2024 Jun.
4
A Rare Case of Rectal Carcinoma With Pulmonary Artery Thrombosis.一例罕见的伴有肺动脉血栓形成的直肠癌病例。
Cureus. 2024 Mar 13;16(3):e56095. doi: 10.7759/cureus.56095. eCollection 2024 Mar.
5
Construction and validation of a clinical prediction model for deep vein thrombosis in patients with digestive system tumors based on a machine learning.基于机器学习的消化系统肿瘤患者深静脉血栓形成临床预测模型的构建与验证
Am J Cancer Res. 2024 Jan 15;14(1):155-168. doi: 10.62347/LNDL8700. eCollection 2024.
6
Case report of atypical undernutrition of hypoproteinemia type.低蛋白血症型非典型营养不良病例报告
Open Life Sci. 2023 Nov 29;18(1):20220766. doi: 10.1515/biol-2022-0766. eCollection 2023.
7
Development and Validation of a Prediction Model for Venous Thrombus Embolism (VTE) in Patients With Colorectal Cancer.开发和验证用于预测结直肠癌患者静脉血栓栓塞症(VTE)的模型。
Technol Cancer Res Treat. 2023 Jan-Dec;22:15330338231186790. doi: 10.1177/15330338231186790.
8
Establishment and Validation of a Postoperative VTE Prediction Model in Patients with Colorectal Cancer Undergoing Radical Resection: CRSPOT Nomogram.结直肠癌根治术后患者静脉血栓栓塞症预测模型的建立与验证:CRSPOT 列线图。
Clin Appl Thromb Hemost. 2023 Jan-Dec;29:10760296231216966. doi: 10.1177/10760296231216966.
9
Caprini risk assessment model combined with D-dimer to predict the occurrence of deep vein thrombosis and guide intervention after laparoscopic radical resection of colorectal cancer.卡普里尼风险评估模型联合 D-二聚体预测结直肠癌腹腔镜根治术后深静脉血栓的发生及指导干预。
World J Surg Oncol. 2023 Sep 21;21(1):299. doi: 10.1186/s12957-023-03183-7.
10
Application of SHAP for Explainable Machine Learning on Age-Based Subgrouping Mammography Questionnaire Data for Positive Mammography Prediction and Risk Factor Identification.SHAP在基于年龄的乳腺钼靶问卷调查数据亚组划分中的应用,用于乳腺钼靶阳性预测和风险因素识别的可解释机器学习。
Healthcare (Basel). 2023 Jul 11;11(14):2000. doi: 10.3390/healthcare11142000.