College of Nursing, The University of Iowa, Iowa City, Iowa, USA.
Center for Home Care Policy & Research, VNS Health, New York, New York, USA.
J Am Med Inform Assoc. 2023 Sep 25;30(10):1622-1633. doi: 10.1093/jamia/ocad129.
Little is known about proactive risk assessment concerning emergency department (ED) visits and hospitalizations in patients with heart failure (HF) who receive home healthcare (HHC) services. This study developed a time series risk model for predicting ED visits and hospitalizations in patients with HF using longitudinal electronic health record data. We also explored which data sources yield the best-performing models over various time windows.
We used data collected from 9362 patients from a large HHC agency. We iteratively developed risk models using both structured (eg, standard assessment tools, vital signs, visit characteristics) and unstructured data (eg, clinical notes). Seven specific sets of variables included: (1) the Outcome and Assessment Information Set, (2) vital signs, (3) visit characteristics, (4) rule-based natural language processing-derived variables, (5) term frequency-inverse document frequency variables, (6) Bio-Clinical Bidirectional Encoder Representations from Transformers variables, and (7) topic modeling. Risk models were developed for 18 time windows (1-15, 30, 45, and 60 days) before an ED visit or hospitalization. Risk prediction performances were compared using recall, precision, accuracy, F1, and area under the receiver operating curve (AUC).
The best-performing model was built using a combination of all 7 sets of variables and the time window of 4 days before an ED visit or hospitalization (AUC = 0.89 and F1 = 0.69).
This prediction model suggests that HHC clinicians can identify patients with HF at risk for visiting the ED or hospitalization within 4 days before the event, allowing for earlier targeted interventions.
在接受家庭保健 (HHC) 服务的心力衰竭 (HF) 患者中,对于急诊就诊 (ED) 和住院的主动风险评估知之甚少。本研究使用纵向电子健康记录数据为 HF 患者开发了用于预测 ED 就诊和住院的时间序列风险模型。我们还探讨了在不同的时间窗口内,哪些数据源可以产生性能最佳的模型。
我们使用来自一家大型 HHC 机构的 9362 名患者的数据进行研究。我们使用结构化(例如标准评估工具、生命体征、就诊特征)和非结构化数据(例如临床记录)来迭代开发风险模型。共包含 7 组特定变量:(1)结果和评估信息集,(2)生命体征,(3)就诊特征,(4)基于规则的自然语言处理衍生变量,(5)词频-逆文档频率变量,(6)基于生物临床双向转换器表示的变量,以及(7)主题建模。为 ED 就诊或住院前的 18 个时间窗口(1-15、30、45 和 60 天)开发风险模型。使用召回率、精度、准确度、F1 和接收器操作曲线下的面积 (AUC) 来比较风险预测性能。
使用所有 7 组变量和 ED 就诊或住院前 4 天的时间窗口构建的模型性能最佳(AUC=0.89,F1=0.69)。
该预测模型表明,HHC 临床医生可以在事件发生前 4 天内识别出有 ED 就诊或住院风险的 HF 患者,以便更早地进行有针对性的干预。