College of Computer Science and Technology, Jilin University, 130012, Changchun, China.
School of Biology and Engineering, Guizhou Medical University, 550025, Guiyang, Guizhou, China.
BMC Infect Dis. 2023 Sep 21;23(1):622. doi: 10.1186/s12879-023-08291-z.
Coronavirus disease 2019 (COVID-19) is a rapidly developing and sometimes lethal pulmonary disease. Accurately predicting COVID-19 mortality will facilitate optimal patient treatment and medical resource deployment, but the clinical practice still needs to address it. Both complete blood counts and cytokine levels were observed to be modified by COVID-19 infection. This study aimed to use inexpensive and easily accessible complete blood counts to build an accurate COVID-19 mortality prediction model. The cytokine fluctuations reflect the inflammatory storm induced by COVID-19, but their levels are not as commonly accessible as complete blood counts. Therefore, this study explored the possibility of predicting cytokine levels based on complete blood counts.
We used complete blood counts to predict cytokine levels. The predictive model includes an autoencoder, principal component analysis, and linear regression models. We used classifiers such as support vector machine and feature selection models such as adaptive boost to predict the mortality of COVID-19 patients.
Complete blood counts and original cytokine levels reached the COVID-19 mortality classification area under the curve (AUC) values of 0.9678 and 0.9111, respectively, and the cytokine levels predicted by the feature set alone reached the classification AUC value of 0.9844. The predicted cytokine levels were more significantly associated with COVID-19 mortality than the original values.
Integrating the predicted cytokine levels and complete blood counts improved a COVID-19 mortality prediction model using complete blood counts only. Both the cytokine level prediction models and the COVID-19 mortality prediction models are publicly available at http://www.healthinformaticslab.org/supp/resources.php .
2019 年冠状病毒病(COVID-19)是一种快速发展且有时致命的肺部疾病。准确预测 COVID-19 的死亡率将有助于为患者提供最佳治疗和医疗资源部署,但临床实践仍需加以解决。COVID-19 感染会改变全血细胞计数和细胞因子水平。本研究旨在使用廉价且易于获取的全血细胞计数来建立准确的 COVID-19 死亡率预测模型。细胞因子波动反映了 COVID-19 引起的炎症风暴,但它们的水平不如全血细胞计数常见。因此,本研究探讨了基于全血细胞计数预测细胞因子水平的可能性。
我们使用全血细胞计数来预测细胞因子水平。预测模型包括自动编码器、主成分分析和线性回归模型。我们使用支持向量机等分类器和自适应增强等特征选择模型来预测 COVID-19 患者的死亡率。
全血细胞计数和原始细胞因子水平的 COVID-19 死亡率分类曲线下面积(AUC)值分别达到 0.9678 和 0.9111,仅基于特征集预测的细胞因子水平达到分类 AUC 值 0.9844。预测的细胞因子水平与 COVID-19 死亡率的相关性明显强于原始值。
整合预测的细胞因子水平和全血细胞计数可以改进仅使用全血细胞计数的 COVID-19 死亡率预测模型。细胞因子水平预测模型和 COVID-19 死亡率预测模型均在 http://www.healthinformaticslab.org/supp/resources.php 上公开提供。