Department of Computer Science, University of Pisa, Pisa, Italy.
Department of Clinical and Experimental Medicine, University of Pisa, Pisa, Italy.
Sci Rep. 2021 Sep 16;11(1):18464. doi: 10.1038/s41598-021-97990-1.
With the outbreak of COVID-19 exerting a strong pressure on hospitals and health facilities, clinical decision support systems based on predictive models can help to effectively improve the management of the pandemic. We present a method for predicting mortality for COVID-19 patients. Starting from a large number of clinical variables, we select six of them with largest predictive power, using a feature selection method based on genetic algorithms and starting from a set of COVID-19 patients from the first wave. The algorithm is designed to reduce the impact of missing values in the set of variables measured, and consider only variables that show good accuracy on validation data. The final predictive model provides accuracy larger than 85% on test data, including a new patient cohort from the second COVID-19 wave, and on patients with imputed missing values. The selected clinical variables are confirmed to be relevant by recent literature on COVID-19.
随着 COVID-19 的爆发对医院和卫生机构造成巨大压力,基于预测模型的临床决策支持系统有助于有效管理大流行。我们提出了一种预测 COVID-19 患者死亡率的方法。从大量临床变量开始,我们使用基于遗传算法的特征选择方法从第一波 COVID-19 患者中选择了六个具有最大预测能力的变量。该算法旨在减少测量变量集中缺失值的影响,并仅考虑在验证数据上具有良好准确性的变量。最终的预测模型在测试数据上提供了超过 85%的准确性,包括来自第二波 COVID-19 的新患者队列,以及对具有缺失值插补的患者。所选临床变量通过最近关于 COVID-19 的文献得到证实与该疾病相关。