Department of Emergency Medicine, Emory University School of Medicine, Atlanta, GA, USA.
Department of Surgery, Emory University School of Medicine, Atlanta, GA, USA; Health Services Research Center, Emory University School of Medicine, Atlanta, GA, USA.
Int J Med Inform. 2019 Sep;129:184-188. doi: 10.1016/j.ijmedinf.2019.06.008. Epub 2019 Jun 13.
Nursing triage documentation is the first free-form text data created at the start of an emergency department (ED) visit. These 1-3 unstructured sentences reflect the clinical impression of an experienced nurse and are key in gauging a patient's illness. We aimed to predict final ED disposition using three commonly-employed natural language processing (NLP) techniques of nursing triage notes in isolation from other data.
We constructed a retrospective cohort of all 260,842 consecutive ED encounters in 2015-16, from three clinically heterogeneous academically-affiliated EDs. After exclusion of 3964 encounters based on completeness of triage, and disposition data, we included 256,878 encounters. We defined the outcome as: 1) admission, transfer, or in-ED death [68,092 encounters] vs. 2) discharge, "left without being seen," and "left against medical advice" [188,786 encounters]. The dataset was divided into training and testing subsets. Neural network regression models were trained using bag-of-words, paragraph vectors, and topic distributions to predict disposition and were evaluated using the testing dataset.
Area under the curve for disposition using triage notes as bag-of-words, paragraph vectors, and topic distributions were 0.737 (95% CI: 0.734 - 0.740), 0.785 (95% CI: 0.782 - 0.788), and 0.687 (95% CI: 0.684 - 0.690), respectively.
Nursing triage notes can be used to predict final ED patient disposition, even when used separately from other clinical information. These findings have substantial implications for future studies, suggesting that free text from medical records may be considered as a critical predictor in research of patient outcomes.
护理分诊文档是急诊科就诊时创建的第一批非结构化文本数据。这 1-3 个非结构化句子反映了经验丰富护士的临床印象,是衡量患者疾病的关键。我们旨在使用三种常用的自然语言处理 (NLP) 技术来预测分诊记录,而无需其他数据。
我们构建了一个回顾性队列,包括 2015-16 年来自三个临床异质的学术附属急诊科的 260842 例连续急诊就诊。排除了 3964 例基于分诊和处置数据不完整的就诊后,我们纳入了 256878 例就诊。我们将结局定义为:1)入院、转科或急诊科内死亡[68092 例]与 2)出院、“未就诊离开”和“拒绝医疗建议离开”[188786 例]。数据集分为训练和测试子集。使用词袋、段落向量和主题分布的神经网络回归模型来训练预测处置,并使用测试数据集进行评估。
使用分诊记录作为词袋、段落向量和主题分布预测处置的曲线下面积分别为 0.737(95%CI:0.734-0.740)、0.785(95%CI:0.782-0.788)和 0.687(95%CI:0.684-0.690)。
护理分诊记录可用于预测最终急诊科患者的处置,即使与其他临床信息分开使用。这些发现对未来的研究具有重要意义,表明来自医疗记录的自由文本可能被视为患者结局研究中的关键预测因素。