Department of Computer Science, Shanghai Jiao Tong University, Shanghai, China.
Shanghai Synyi Medical Technology Co., Ltd, Shanghai, China.
J Biomed Inform. 2021 Oct;122:103892. doi: 10.1016/j.jbi.2021.103892. Epub 2021 Aug 26.
Venous thromboembolism (VTE) is a common vascular disease and potentially fatal complication during hospitalization, and so the early identification of VTE risk is of significant importance. Compared with traditional scale assessments, machine learning methods provide new opportunities for precise early warning of VTE from clinical medical records. This research aimed to propose a two-stage hierarchical machine learning model for VTE risk prediction in patients from multiple departments. First, we built a machine learning prediction model that covered the entire hospital, based on all cohorts and common risk factors. Then, we took the prediction output of the first stage as an initial assessment score and then built specific models for each department. Over the duration of the study, a total of 9213 inpatients, including 1165 VTE-positive samples, were collected from four departments, which were split into developing and test datasets. The proposed model achieved an AUC of 0.879 in the department of oncology, which outperformed the first-stage model (0.730) and the department model (0.787). This was attributed to the fully usage of both the large sample size at the hospital level and variable abundance at the department level. Experimental results show that our model could effectively improve the prediction of hospital-acquired VTE risk before image diagnosis and provide decision support for further nursing and medical intervention.
静脉血栓栓塞症(VTE)是一种常见的血管疾病,也是住院期间潜在的致命并发症,因此早期识别 VTE 风险具有重要意义。与传统的量表评估相比,机器学习方法为从临床病历中精确预警 VTE 提供了新的机会。本研究旨在提出一种两阶段分层机器学习模型,用于预测来自多个科室的患者的 VTE 风险。首先,我们基于所有队列和共同风险因素,构建了一个覆盖整个医院的机器学习预测模型。然后,我们将第一阶段的预测输出作为初始评估分数,然后为每个科室构建特定的模型。在研究期间,共从四个科室收集了 9213 名住院患者,包括 1165 例 VTE 阳性样本,将其分为开发数据集和测试数据集。所提出的模型在肿瘤科的 AUC 为 0.879,优于第一阶段模型(0.730)和科室模型(0.787)。这归因于充分利用了医院层面的大样本量和科室层面的变量丰富性。实验结果表明,我们的模型可以有效地提高医院获得性 VTE 风险的预测能力,在影像诊断之前提供决策支持,并为进一步的护理和医疗干预提供依据。