Shi Yuchen, Qin Yanwen, Zheng Ze, Wang Ping, Liu Jinghua
Center for Coronary Artery Disease (CCAD), Beijing Anzhen Hospital, Capital Medical University, and Beijing Institute of Heart, Lung and Blood Vessel Diseases, Beijing, China.
Beijing Anzhen Hospital, Capital Medical University, Beijing Institute of Heart Lung and Blood Vessel Disease, The Key Laboratory of Remodeling-Related Cardiovascular Diseases, Ministry of Education, Beijing, China.
J Emerg Med. 2023 Jun 20. doi: 10.1016/j.jemermed.2023.06.012.
The COVID-19 pandemic presents a significant challenge to the global health care system. Implementing timely, accurate, and cost-effective screening approaches is crucial in preventing infections and saving lives by guiding disease management.
The study aimed to use machine learning algorithms to analyze clinical features from routine clinical data to identify risk factors and predict the mortality of COVID-19.
The data used in this research were originally collected for the study titled "Neurologic Syndromes Predict Higher In-Hospital Mortality in COVID-19." A total of 4711 patients with confirmed COVID-19 were enrolled consecutively from four hospitals. Three machine learning models, including random forest (RF), partial least squares discriminant analysis (PLS-DA), and support vector machine (SVM), were used to find risk factors and predict COVID-19 mortality.
The predictive models were developed based on three machine learning algorithms. The RF model was trained with 20 variables and had a receiver operating characteristic (ROC) value of 0.859 (95% confidence interval [CI] 0.804-0.920). The PLS-DA model was trained with 20 variables and had a ROC value of 0.775 (95% CI 0.694-0.833). The SVM model was trained with 10 variables and had a ROC value of 0.828 (95% CI 0.785-0.865). The nine variables that were present in all three models were age, procalcitonin, ferritin, C-reactive protein, troponin, blood urea nitrogen, mean arterial pressure, aspartate transaminase, and alanine transaminase.
This study developed and validated three machine learning prediction models for COVID-19 mortality based on accessible clinical features. The RF model showed the best performance among the three models. The nine variables identified in the models may warrant further investigation as potential prognostic indicators of severe COVID-19.
新冠疫情给全球医疗系统带来了巨大挑战。实施及时、准确且具有成本效益的筛查方法对于通过指导疾病管理来预防感染和挽救生命至关重要。
本研究旨在使用机器学习算法分析常规临床数据中的临床特征,以识别风险因素并预测新冠患者的死亡率。
本研究使用的数据最初是为名为“神经系统综合征预示新冠患者更高的院内死亡率”的研究收集的。从四家医院连续纳入了4711例确诊的新冠患者。使用三种机器学习模型,包括随机森林(RF)、偏最小二乘判别分析(PLS-DA)和支持向量机(SVM)来寻找风险因素并预测新冠死亡率。
基于三种机器学习算法开发了预测模型。RF模型使用20个变量进行训练,受试者操作特征(ROC)值为0.859(95%置信区间[CI]0.804 - 0.920)。PLS-DA模型使用20个变量进行训练,ROC值为0.775(95%CI 0.694 - 0.833)。SVM模型使用10个变量进行训练,ROC值为0.828(95%CI 0.785 - 0.865)。所有三个模型中都存在的九个变量是年龄、降钙素原、铁蛋白、C反应蛋白、肌钙蛋白、血尿素氮、平均动脉压、天冬氨酸转氨酶和丙氨酸转氨酶。
本研究基于可获取的临床特征开发并验证了三种用于预测新冠死亡率的机器学习模型。RF模型在这三种模型中表现最佳。模型中识别出的九个变量作为重症新冠潜在的预后指标可能值得进一步研究。