Karimi Zahra, Malak Jaleh S, Aghakhani Amirhossein, Najafi Mohammad S, Ariannejad Hamid, Zeraati Hojjat, Yekaninejad Mir S
Department of Epidemiology and Biostatistics, School of Public Health Tehran University of Medical Sciences Tehran Iran.
Department of Digital Health, School of Medicine Tehran University of Medical Sciences Tehran Iran.
Health Sci Rep. 2024 Sep 2;7(9):e70041. doi: 10.1002/hsr2.70041. eCollection 2024 Sep.
BACKGROUND & AIM: Timely identification of the patients requiring intensive care unit admission (ICU) could be life-saving. We aimed to compare different machine learning algorithms to predict the requirements for ICU admission in COVID-19 patients.
We screened all patients with COVID-19 at six academic hospitals in Tehran comprising our study population. A total of 44,112 COVID-19 patients (≥18 years old) were included, among which 7722 patients were hospitalized. We used a Random Forest algorithm to select significant variables. Then, prediction models were developed using the Support Vector Machine, Naıve Bayes, logistic regression, lightGBM, decision tree, and K-Nearest Neighbor algorithms. Sensitivity, specificity, accuracy, F1 score, and receiver operating characteristic-Area Under the Curve (AUC) were used to compare the prediction performance of different models.
Based on random Forest, the following predictors were selected: age, cardiac disease, cough, hypertension, diabetes, influenza & pneumonia, malignancy, and nervous system disease. Age was found to have the strongest association with ICU admission among COVID-19 patients. All six models achieved an AUC greater than 0.60. Naıve Bayes achieved the best predictive performance (AUC = 0.71).
Naïve Bayes and lightGBM demonstrated promising results in predicting ICU admission needs in COVID-19 patients. Machine learning models could help quickly identify high-risk patients upon entry and reduce mortality and morbidity among COVID-19 patients.
及时识别需要入住重症监护病房(ICU)的患者可能会挽救生命。我们旨在比较不同的机器学习算法,以预测新冠肺炎患者入住ICU的需求。
我们在德黑兰的六家学术医院对所有新冠肺炎患者进行了筛查,构成了我们的研究人群。总共纳入了44112例新冠肺炎患者(≥18岁),其中7722例患者住院治疗。我们使用随机森林算法来选择显著变量。然后,使用支持向量机、朴素贝叶斯、逻辑回归、轻梯度提升机、决策树和K近邻算法开发预测模型。使用灵敏度、特异性、准确性、F1分数和受试者工作特征曲线下面积(AUC)来比较不同模型的预测性能。
基于随机森林,选择了以下预测因素:年龄、心脏病、咳嗽、高血压、糖尿病、流感和肺炎、恶性肿瘤以及神经系统疾病。发现年龄与新冠肺炎患者入住ICU的关联最强。所有六个模型的AUC均大于0.60。朴素贝叶斯取得了最佳预测性能(AUC = 0.71)。
朴素贝叶斯和轻梯度提升机在预测新冠肺炎患者入住ICU需求方面显示出有前景的结果。机器学习模型可以帮助在患者入院时快速识别高危患者,并降低新冠肺炎患者的死亡率和发病率。