Department of Neurosurgery, Kobe University Graduate School of Medicine, Kobe, Hyogo, Japan.
Department of Neurosurgery, Hyogo Prefectural Kakogawa Medical Center, Kakogawa, Hyogo, Japan.
J Neurotrauma. 2020 Jan 1;37(1):202-210. doi: 10.1089/neu.2018.6276. Epub 2019 Sep 18.
Recently, successful predictions using machine learning (ML) algorithms have been reported in various fields. However, in traumatic brain injury (TBI) cohorts, few studies have examined modern ML algorithms. To develop a simple ML model for TBI outcome prediction, we conducted a performance comparison of nine algorithms: ridge regression, least absolute shrinkage and selection operator (LASSO) regression, random forest, gradient boosting, extra trees, decision tree, Gaussian naïve Bayes, multi-nomial naïve Bayes, and support vector machine. Fourteen feasible parameters were introduced in the ML models, including age, Glasgow Coma Scale (GCS), systolic blood pressure (SBP), abnormal pupillary response, major extracranial injury, computed tomography (CT) findings, and routinely collected laboratory values (glucose, C-reactive protein [CRP], and fibrin/fibrinogen degradation products [FDP]). Data from 232 patients with TBI were randomly divided into a training sample (80%) for hyperparameter tuning and validation sample (20%). The bootstrap method was used for validation. Random forest demonstrated the best performance for in-hospital poor outcome prediction and ridge regression for in-hospital mortality prediction: the mean statistical measures were 100% sensitivity, 72.3% specificity, 91.7% accuracy, and 0.895 area under the receiver operating characteristic curve (AUC); and 88.4% sensitivity, 88.2% specificity, 88.6% accuracy, and 0.875 AUC, respectively. Based on the feature selection method using the tree-based ensemble algorithm, age, Glasgow Coma Scale, fibrin/fibrinogen degradation products, and glucose were identified as the most important prognostic factors for poor outcome and mortality. Our results indicate the relatively good predictive performance of modern ML for TBI outcome. Further external validation is required for more heterogeneous samples to confirm our results.
最近,机器学习(ML)算法在各个领域的成功预测已经有报道。然而,在创伤性脑损伤(TBI)队列中,很少有研究检查现代 ML 算法。为了开发 TBI 结果预测的简单 ML 模型,我们对 9 种算法进行了性能比较:岭回归、最小绝对值收缩和选择算子(LASSO)回归、随机森林、梯度提升、额外树、决策树、高斯朴素贝叶斯、多项式朴素贝叶斯和支持向量机。在 ML 模型中引入了 14 个可行参数,包括年龄、格拉斯哥昏迷量表(GCS)、收缩压(SBP)、异常瞳孔反应、主要颅外损伤、计算机断层扫描(CT)发现和常规采集的实验室值(葡萄糖、C 反应蛋白[CRP]和纤维蛋白/纤维蛋白原降解产物[FDP])。232 名 TBI 患者的数据被随机分为训练样本(80%)进行超参数调整和验证样本(20%)。使用 bootstrap 方法进行验证。随机森林在住院不良预后预测中表现出最佳性能,岭回归在住院死亡率预测中表现最佳:平均统计指标分别为 100%的敏感性、72.3%的特异性、91.7%的准确性和 0.895 接收器操作特征曲线(AUC)下的面积;和 88.4%的敏感性、88.2%的特异性、88.6%的准确性和 0.875 AUC。基于基于树的集成算法的特征选择方法,年龄、格拉斯哥昏迷量表、纤维蛋白/纤维蛋白原降解产物和葡萄糖被确定为不良预后和死亡率的最重要预后因素。我们的结果表明,现代 ML 对 TBI 结果具有相对较好的预测性能。需要进一步的外部验证来确认我们的结果,以更具异质性的样本。