Simon Gregory E, Johnson Eric, Shortreed Susan M, Ziebell Rebecca A, Rossom Rebecca C, Ahmedani Brian K, Coleman Karen J, Beck Arne, Lynch Frances L, Daida Yihe G
Kaiser Permanente Washington Health Research Institute, Seattle, WA, United States of America.
Kaiser Permanente Washington Health Research Institute, Seattle, WA, United States of America.
Gen Hosp Psychiatry. 2024 Mar-Apr;87:13-19. doi: 10.1016/j.genhosppsych.2024.01.009. Epub 2024 Jan 22.
Use health records data to predict suicide death following emergency department visits.
Electronic health records and insurance claims from seven health systems were used to: identify emergency department visits with mental health or self-harm diagnoses by members aged 11 or older; extract approximately 2500 potential predictors including demographic, historical, and baseline clinical characteristics; and ascertain subsequent deaths by self-harm. Logistic regression with lasso and random forest models predicted self-harm death over 90 days after each visit.
Records identified 2,069,170 eligible visits, 899 followed by suicide death within 90 days. The best-fitting logistic regression with lasso model yielded an area under the receiver operating curve of 0.823 (95% CI 0.810-0.836). Visits above the 95th percentile of predicted risk included 34.8% (95% CI 31.1-38.7) of subsequent suicide deaths and had a 0.303% (95% CI 0.261-0.346) suicide death rate over the following 90 days. Model performance was similar across subgroups defined by age, sex, race, and ethnicity.
Machine learning models using coded data from health records have moderate performance in predicting suicide death following emergency department visits for mental health or self-harm diagnosis and could be used to identify patients needing more systematic follow-up.
利用健康记录数据预测急诊科就诊后的自杀死亡情况。
使用来自七个医疗系统的电子健康记录和保险理赔数据来:识别11岁及以上成员有心理健康或自我伤害诊断的急诊科就诊情况;提取约2500个潜在预测因素,包括人口统计学、病史和基线临床特征;并确定随后的自我伤害死亡情况。采用套索逻辑回归和随机森林模型预测每次就诊后90天内的自我伤害死亡情况。
记录识别出2,069,170次符合条件的就诊,其中899例在90天内随后自杀死亡。拟合效果最佳的套索逻辑回归模型的受试者工作特征曲线下面积为0.823(95%置信区间0.810 - 0.836)。预测风险处于第95百分位数以上的就诊包括34.8%(95%置信区间31.1 - 38.7)的随后自杀死亡病例,且在接下来的90天内自杀死亡率为0.303%(95%置信区间0.261 - 0.346)。在按年龄、性别、种族和民族定义的亚组中,模型表现相似。
使用健康记录编码数据的机器学习模型在预测因心理健康或自我伤害诊断而到急诊科就诊后的自杀死亡情况方面具有中等表现,可用于识别需要更系统随访的患者。