一种通过考虑模糊总体来建立的静脉血栓栓塞症新风险评估模型。

A new risk assessment model of venous thromboembolism by considering fuzzy population.

作者信息

Wang Xin, Yang Yu-Qing, Hong Xin-Yu, Liu Si-Hua, Li Jian-Chu, Chen Ting, Shi Ju-Hong

机构信息

Department of Ultrasound, Peking Union Medical College Hospital, Beijing, China.

Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, China.

出版信息

BMC Med Inform Decis Mak. 2024 Dec 30;24(1):413. doi: 10.1186/s12911-024-02834-3.

DOI:10.1186/s12911-024-02834-3

PMID:39736732

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11686901/

Abstract

BACKGROUND

Inpatients with high risk of venous thromboembolism (VTE) usually face serious threats to their health and economic conditions. Many studies using machine learning (ML) models to predict VTE risk overlook the impact of class-imbalance problem due to the low incidence rate of VTE, resulting in inferior and unstable model performance, which hinders their ability to replace the Padua model, a widely used linear weighted model in clinic. Our study aims to develop a new VTE risk assessment model suitable for Chinese medical inpatients.

METHODS

3284 inpatients in the medical department of Peking Union Medical College Hospital (PUMCH) from January 2014 to June 2016 were collected. The training and test set were divided based on the admission time and inpatients from May 2016 to June 2016 were included as the test dataset. We explained the class imbalance problem from a clinical perspective and defined a new term, "fuzzy population", to elaborate and model this phenomenon. By considering the "fuzzy population", a new ML VTE risk assessment model was built through population splitting. Sensitivity and specificity of our method was compared with five ML models (support vector machine (SVM), random forest (RF), gradient boosting decision tree (GBDT), logistic regression (LR), and XGBoost) and the Padua model.

RESULTS

The 'fuzzy population' phenomenon was explained and verified on the VTE dataset. The proposed model achieved higher specificity (64.94% vs. 63.30%) and the same sensitivity (90.24% vs. 90.24%) on test data than the Padua model. Other five ML models couldn't simultaneously surpass the Padua's sensitivity and specificity. Besides, our model was more robust than five ML models and its standard deviations of sensitivities and specificities were smaller. Adjusting the distribution of negative samples in the training set based on the 'fuzzy population' would exacerbate the instability of performance of five ML models, which limited the application of ML methods in clinic.

CONCLUSIONS

The proposed model achieved higher sensitivity and specificity than the Padua model, and better robustness than traditional ML models. This study built a population-split-based ML model of VTE by modeling the class-imbalance problem and it can be applied more broadly in risk assessment of other diseases.

摘要

背景

静脉血栓栓塞症（VTE）高危住院患者通常面临严重的健康和经济状况威胁。许多使用机器学习（ML）模型预测VTE风险的研究，由于VTE发病率低而忽视了类不平衡问题的影响，导致模型性能较差且不稳定，这阻碍了它们取代临床上广泛使用的线性加权模型Padua模型的能力。我们的研究旨在开发一种适用于中国内科住院患者的新型VTE风险评估模型。

方法

收集了2014年1月至2016年6月在北京协和医院（PUMCH）内科住院的3284例患者。根据入院时间划分训练集和测试集，并将2016年5月至6月的住院患者作为测试数据集。我们从临床角度解释了类不平衡问题，并定义了一个新术语“模糊人群”来阐述和模拟这一现象。通过考虑“模糊人群”，通过人群划分构建了一个新的ML VTE风险评估模型。将我们方法的敏感性和特异性与五个ML模型（支持向量机（SVM）、随机森林（RF）、梯度提升决策树（GBDT）、逻辑回归（LR）和XGBoost）以及Padua模型进行了比较。

结果

在VTE数据集上解释并验证了“模糊人群”现象。在测试数据上，所提出的模型比Padua模型具有更高的特异性（64.94%对63.30%）和相同的敏感性（90.24%对90.24%）。其他五个ML模型不能同时超过Padua模型的敏感性和特异性。此外，我们的模型比五个ML模型更稳健，其敏感性和特异性的标准差更小。基于“模糊人群”调整训练集中阴性样本的分布会加剧五个ML模型性能的不稳定，这限制了ML方法在临床上的应用。

结论

所提出的模型比Padua模型具有更高的敏感性和特异性，并且比传统ML模型具有更好的稳健性。本研究通过对类不平衡问题进行建模，构建了基于人群划分的VTE的ML模型，并且它可以更广泛地应用于其他疾病的风险评估。

相似文献

A new risk assessment model of venous thromboembolism by considering fuzzy population.

BMC Med Inform Decis Mak. 2024 Dec 30;24(1):413. doi: 10.1186/s12911-024-02834-3.

Comparing different venous thromboembolism risk assessment machine learning models in Chinese patients.

J Eval Clin Pract. 2020 Feb;26(1):26-34. doi: 10.1111/jep.13324. Epub 2019 Dec 15.

Ontology-based venous thromboembolism risk assessment model developing from medical records.

BMC Med Inform Decis Mak. 2019 Aug 8;19(Suppl 4):151. doi: 10.1186/s12911-019-0856-2.

Comparison of the PADUA and IMPROVE scores in assessing venous thromboembolism risk in 42,257 medical inpatients in China.

J Thromb Thrombolysis. 2024 Jun;57(5):775-783. doi: 10.1007/s11239-024-02979-y. Epub 2024 Apr 21.

Development and validation of machine learning models for postoperative venous thromboembolism prediction in colorectal cancer inpatients: a retrospective study.

J Gastrointest Oncol. 2023 Feb 28;14(1):220-232. doi: 10.21037/jgo-23-18. Epub 2023 Feb 15.

Prediction of venous thromboembolism with machine learning techniques in young-middle-aged inpatients.

Sci Rep. 2021 Jun 18;11(1):12868. doi: 10.1038/s41598-021-92287-9.

Assessment of the Risk of Venous Thromboembolism in Medical Inpatients using the Padua Prediction Score and Caprini Risk Assessment Model.

J Atheroscler Thromb. 2018 Nov 1;25(11):1091-1104. doi: 10.5551/jat.43653. Epub 2018 Mar 13.

Development and validation of machine learning models for predicting venous thromboembolism in colorectal cancer patients: A cohort study in China.

Int J Med Inform. 2025 Mar;195:105770. doi: 10.1016/j.ijmedinf.2024.105770. Epub 2024 Dec 19.

Machine learning constructs a diagnostic prediction model for calculous pyonephrosis.

Urolithiasis. 2024 Jun 19;52(1):96. doi: 10.1007/s00240-024-01587-y.

Machine Learning to Dynamically Predict In-Hospital Venous Thromboembolism After Inguinal Hernia Surgery: Results From the CHAT-1 Study.

Clin Appl Thromb Hemost. 2023 Jan-Dec;29:10760296231171082. doi: 10.1177/10760296231171082.

本文引用的文献

Comparing different venous thromboembolism risk assessment machine learning models in Chinese patients.

J Eval Clin Pract. 2020 Feb;26(1):26-34. doi: 10.1111/jep.13324. Epub 2019 Dec 15.

Ontology-based venous thromboembolism risk assessment model developing from medical records.

BMC Med Inform Decis Mak. 2019 Aug 8;19(Suppl 4):151. doi: 10.1186/s12911-019-0856-2.

Opportunities and obstacles for deep learning in biology and medicine.

J R Soc Interface. 2018 Apr;15(141). doi: 10.1098/rsif.2017.0387.

Prediction of venous thromboembolism using semantic and sentiment analyses of clinical narratives.

Comput Biol Med. 2018 Mar 1;94:1-10. doi: 10.1016/j.compbiomed.2017.12.026. Epub 2018 Jan 3.

Predicting Hospitalization and Outpatient Corticosteroid Use in Inflammatory Bowel Disease Patients Using Machine Learning.

Inflamm Bowel Dis. 2017 Dec 19;24(1):45-53. doi: 10.1093/ibd/izx007.

Update on the management of venous thromboembolism.

Cleve Clin J Med. 2017 Dec;84(12 Suppl 3):39-46. doi: 10.3949/ccjm.84.s3.04.

Epidemiology, Pathophysiology, Stratification, and Natural History of Pulmonary Embolism.

Tech Vasc Interv Radiol. 2017 Sep;20(3):135-140. doi: 10.1053/j.tvir.2017.07.002. Epub 2017 Jul 5.

Risk Assessment for Venous Thromboembolism in Chemotherapy-Treated Ambulatory Cancer Patients.

Med Decis Making. 2017 Feb;37(2):234-242. doi: 10.1177/0272989X16662654. Epub 2016 Aug 19.

Comparison between Caprini and Padua risk assessment models for hospitalized medical patients at risk for venous thromboembolism: a retrospective study.

Interact Cardiovasc Thorac Surg. 2016 Oct;23(4):538-43. doi: 10.1093/icvts/ivw158. Epub 2016 Jun 13.

Antithrombotic Therapy for VTE Disease: CHEST Guideline and Expert Panel Report.

Chest. 2016 Feb;149(2):315-352. doi: 10.1016/j.chest.2015.11.026. Epub 2016 Jan 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种通过考虑模糊总体来建立的静脉血栓栓塞症新风险评估模型。

A new risk assessment model of venous thromboembolism by considering fuzzy population.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献