Kim Jong Ho, Kwon Young Suk, Baek Moon Seong
Department of Anaesthesiology and Pain Medicine, College of Medicine, Hallym University, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea.
Institute of New Frontier Research Team, Hallym University, Chuncheon 24253, Korea.
J Clin Med. 2021 May 18;10(10):2172. doi: 10.3390/jcm10102172.
Previous scoring models, such as the Acute Physiologic Assessment and Chronic Health Evaluation II (APACHE II) score, do not adequately predict the mortality of patients receiving mechanical ventilation in the intensive care unit. Therefore, this study aimed to apply machine learning algorithms to improve the prediction accuracy for 30-day mortality of mechanically ventilated patients. The data of 16,940 mechanically ventilated patients were divided into the training-validation (83%, = 13,988) and test (17%, = 2952) sets. Machine learning algorithms including balanced random forest, light gradient boosting machine, extreme gradient boost, multilayer perceptron, and logistic regression were used. We compared the area under the receiver operating characteristic curves (AUCs) of machine learning algorithms with those of the APACHE II and ProVent score results. The extreme gradient boost model showed the highest AUC (0.79 (0.77-0.80)) for the 30-day mortality prediction, followed by the balanced random forest model (0.78 (0.76-0.80)). The AUCs of these machine learning models as achieved by APACHE II and ProVent scores were higher than 0.67 (0.65-0.69), and 0.69 (0.67-0.71)), respectively. The most important variables in developing each machine learning model were APACHE II score, Charlson comorbidity index, and norepinephrine. The machine learning models have a higher AUC than conventional scoring systems, and can thus better predict the 30-day mortality of mechanically ventilated patients.
以往的评分模型,如急性生理与慢性健康状况评估II(APACHE II)评分,不能充分预测重症监护病房接受机械通气患者的死亡率。因此,本研究旨在应用机器学习算法提高机械通气患者30天死亡率的预测准确性。将16940例机械通气患者的数据分为训练验证集(83%,n = 13988)和测试集(17%,n = 2952)。使用了包括平衡随机森林、轻梯度提升机、极端梯度提升、多层感知器和逻辑回归在内的机器学习算法。我们将机器学习算法的受试者工作特征曲线下面积(AUC)与APACHE II和ProVent评分结果的AUC进行了比较。极端梯度提升模型在30天死亡率预测中显示出最高的AUC(0.79(0.77 - 0.80)),其次是平衡随机森林模型(0.78(0.76 - 0.80))。APACHE II和ProVent评分所获得的这些机器学习模型的AUC分别高于0.67(0.65 - 0.69)和0.69(0.67 - 0.71))。开发每个机器学习模型时最重要的变量是APACHE II评分、Charlson合并症指数和去甲肾上腺素。机器学习模型比传统评分系统具有更高的AUC,因此能够更好地预测机械通气患者的30天死亡率。