Department of Neurosurgery, Vanderbilt University, Nashville, Tennessee.
DataRobot Inc, Boston, Massachusetts.
Neurosurgery. 2019 Sep 1;85(3):384-393. doi: 10.1093/neuros/nyy343.
Current outcomes prediction tools are largely based on and limited by regression methods. Utilization of machine learning (ML) methods that can handle multiple diverse inputs could strengthen predictive abilities and improve patient outcomes. Inpatient length of stay (LOS) is one such outcome that serves as a surrogate for patient disease severity and resource utilization.
To develop a novel method to systematically rank, select, and combine ML algorithms to build a model that predicts LOS following craniotomy for brain tumor.
A training dataset of 41 222 patients who underwent craniotomy for brain tumor was created from the National Inpatient Sample. Twenty-nine ML algorithms were trained on 26 preoperative variables to predict LOS. Trained algorithms were ranked by calculating the root mean square logarithmic error (RMSLE) and top performing algorithms combined to form an ensemble. The ensemble was externally validated using a dataset of 4592 patients from the National Surgical Quality Improvement Program. Additional analyses identified variables that most strongly influence the ensemble model predictions.
The ensemble model predicted LOS with RMSLE of .555 (95% confidence interval, .553-.557) on internal validation and .631 on external validation. Nonelective surgery, preoperative pneumonia, sodium abnormality, or weight loss, and non-White race were the strongest predictors of increased LOS.
An ML ensemble model predicts LOS with good performance on internal and external validation, and yields clinical insights that may potentially improve patient outcomes. This systematic ML method can be applied to a broad range of clinical problems to improve patient care.
目前的结果预测工具主要基于并受限于回归方法。利用可以处理多种不同输入的机器学习(ML)方法,可以增强预测能力并改善患者的预后。住院时间(LOS)就是这样一个结果,它可以作为患者疾病严重程度和资源利用的替代指标。
开发一种新的方法,系统地对 ML 算法进行排名、选择和组合,以建立一种预测脑肿瘤开颅术后 LOS 的模型。
从国家住院患者样本中创建了一个 41222 名接受脑肿瘤开颅术患者的训练数据集。在 26 个术前变量上对 29 种 ML 算法进行了训练,以预测 LOS。通过计算均方根对数误差(RMSLE)对训练算法进行排名,并将表现最好的算法组合成一个集成。使用来自国家手术质量改进计划的 4592 名患者的数据集对集成进行外部验证。进一步的分析确定了对集成模型预测影响最大的变量。
集成模型在内部验证中预测 LOS 的 RMSLE 为.555(95%置信区间,.553-.557),在外部验证中为.631。非择期手术、术前肺炎、钠异常或体重减轻以及非白种人是 LOS 延长的最强预测因素。
基于 ML 的集成模型在内部和外部验证中都能很好地预测 LOS,并且提供了可能改善患者预后的临床见解。这种系统的 ML 方法可以应用于广泛的临床问题,以改善患者的护理。