Research Department, Hartford Hospital, 80 Seymour Street, ERD-223W, Hartford, CT, 06102, USA.
Division of Geriatric Medicine, Hartford Hospital, Hartford, CT, USA.
J Med Syst. 2018 Nov 14;42(12):261. doi: 10.1007/s10916-018-1109-0.
Delirium is a serious medical complication associated with poor outcomes. Given the complexity of the syndrome, prevention and early detection are critical in mitigating its effects. We used Confusion Assessment Method (CAM) screening and Electronic Health Record (EHR) data for 64,038 inpatient visits to train and test a model predicting delirium arising in hospital. Incident delirium was defined as the first instance of a positive CAM occurring at least 48 h into a hospital stay. A Random Forest machine learning algorithm was used with demographic data, comorbidities, medications, procedures, and physiological measures. The data set was randomly partitioned 80% / 20% for training and validating the predictive model, respectively. Of the 51,240 patients in the training set, 2774 (5.4%) experienced delirium during their hospital stay; and of the 12,798 patients in the validation set, 701 (5.5%) experienced delirium. Under-sampling of the delirium negative population was used to address the class imbalance. The Random Forest predictive model yielded an area under the receiver operating characteristic curve (ROC AUC) of 0.909 (95% CI 0.898 to 0.921). Important variables in the model included previously identified predisposing and precipitating risk factors. This machine learning approach displayed a high degree of accuracy and has the potential to provide a clinically useful predictive model for earlier intervention in those patients at greatest risk of developing delirium.
谵妄是一种与不良结局相关的严重医疗并发症。鉴于该综合征的复杂性,预防和早期发现对于减轻其影响至关重要。我们使用了 64038 次住院患者的意识混乱评估方法(CAM)筛查和电子健康记录(EHR)数据来训练和测试一种预测医院内发生谵妄的模型。新发谵妄的定义为在住院至少 48 小时后首次出现阳性 CAM。使用随机森林机器学习算法结合人口统计学数据、合并症、药物、程序和生理测量值。数据集随机分为 80%/20%用于训练和验证预测模型,分别。在训练集中的 51240 名患者中,有 2774 名(5.4%)在住院期间发生谵妄;在验证集中的 12798 名患者中,有 701 名(5.5%)发生谵妄。对谵妄阴性人群进行欠采样以解决类别不平衡问题。随机森林预测模型的受试者工作特征曲线下面积(ROC AUC)为 0.909(95%置信区间为 0.898 至 0.921)。模型中的重要变量包括先前确定的易患和诱发风险因素。这种机器学习方法具有很高的准确性,有可能为那些发生谵妄风险最高的患者提供一种有临床应用价值的预测模型,以便更早地进行干预。