Wan Tsz-Kin, Huang Rui-Xuan, Tulu Thomas Wetere, Liu Jun-Dong, Vodencarevic Asmir, Wong Chi-Wah, Chan Kei-Hang Katie
Department of Electrical Engineering, City University of Hong Kong, Hong Kong, China.
Department of Biomedical Sciences, City University of Hong Kong, Hong Kong, China.
Life (Basel). 2022 Apr 6;12(4):547. doi: 10.3390/life12040547.
(1) Background: Coronavirus disease 2019 (COVID-19) is a dominant, rapidly spreading respiratory disease. However, the factors influencing COVID-19 mortality still have not been confirmed. The pathogenesis of COVID-19 is unknown, and relevant mortality predictors are lacking. This study aimed to investigate COVID-19 mortality in patients with pre-existing health conditions and to examine the association between COVID-19 mortality and other morbidities. (2) Methods: De-identified data from 113,882, including 14,877 COVID-19 patients, were collected from the UK Biobank. Different types of data, such as disease history and lifestyle factors, from the COVID-19 patients, were input into the following three machine learning models: Deep Neural Networks (DNN), Random Forest Classifier (RF), eXtreme Gradient Boosting classifier (XGB) and Support Vector Machine (SVM). The Area under the Curve (AUC) was used to measure the experiment result as a performance metric. (3) Results: Data from 14,876 COVID-19 patients were input into the machine learning model for risk-level mortality prediction, with the predicted risk level ranging from 0 to 1. Of the three models used in the experiment, the RF model achieved the best result, with an AUC value of 0.86 (95% CI 0.84-0.88). (4) Conclusions: A risk-level prediction model for COVID-19 mortality was developed. Age, lifestyle, illness, income, and family disease history were identified as important predictors of COVID-19 mortality. The identified factors were related to COVID-19 mortality.
(1)背景:2019冠状病毒病(COVID-19)是一种主要的、迅速传播的呼吸道疾病。然而,影响COVID-19死亡率的因素尚未得到证实。COVID-19的发病机制尚不清楚,且缺乏相关的死亡率预测指标。本研究旨在调查患有基础疾病的患者的COVID-19死亡率,并检验COVID-19死亡率与其他疾病之间的关联。(2)方法:从英国生物银行收集了113882例去识别化数据,其中包括14877例COVID-19患者。将COVID-19患者的不同类型数据,如疾病史和生活方式因素,输入以下三种机器学习模型:深度神经网络(DNN)、随机森林分类器(RF)、极端梯度提升分类器(XGB)和支持向量机(SVM)。曲线下面积(AUC)用作衡量实验结果的性能指标。(3)结果:将14876例COVID-19患者的数据输入机器学习模型进行风险水平死亡率预测,预测风险水平范围为0至1。在实验使用的三种模型中,RF模型取得了最佳结果,AUC值为0.86(95%CI 0.84 - 0.88)。(4)结论:开发了一种COVID-19死亡率的风险水平预测模型。年龄、生活方式、疾病、收入和家族疾病史被确定为COVID-19死亡率的重要预测指标。所确定的因素与COVID-19死亡率相关。