自然语言处理在骨科手术部位感染识别中的应用。

Natural Language Processing for the Identification of Surgical Site Infections in Orthopaedics.

机构信息

University of Rochester, Rochester, New York.

出版信息

J Bone Joint Surg Am. 2019 Dec 18;101(24):2167-2174. doi: 10.2106/JBJS.19.00661.

Abstract

BACKGROUND

The identification of surgical site infections for infection surveillance in hospitals depends on the manual abstraction of medical records and, for research purposes, depends mainly on the use of administrative or claims data. The objective of this study was to determine whether automating the abstraction process with natural language processing (NLP)-based models that analyze the free-text notes of the medical record can identify surgical site infections with predictive abilities that match the manual abstraction process and that surpass surgical site infection identification from administrative data.

METHODS

We used surgical site infection surveillance data compiled by the infection prevention team to identify surgical site infections among patients undergoing orthopaedic surgical procedures at a tertiary care academic medical center from 2011 to 2017. We compiled a list of keywords suggestive of surgical site infections, and we used NLP to identify occurrences of these keywords and their grammatical variants in the free-text notes of the medical record. The key outcome was a binary indicator of whether a surgical site infection occurred. We estimated 7 incremental multivariable logistic regression models using a combination of administrative and NLP-derived variables. We split the analytic cohort into training (80%) and testing data sets (20%), and we used a tenfold cross-validation approach. The main analytic cohort included 172 surgical site infection cases and 200 controls that were repeatedly and randomly selected from a pool of 1,407 controls.

RESULTS

For Model 1 (variables from administrative data only), the sensitivity was 68% and the positive predictive value was 70%; for Model 4 (with NLP 5-grams [distinct sequences of 5 contiguous words] from the medical record), the sensitivity was 97% and the positive predictive value was 97%; and for Model 7 (a combination of Models 1 and 4), the sensitivity was 97% and the positive predictive value was 97%. Thus, NLP-based models identified 97% of surgical site infections identified by manual abstraction with high precision and 43% more surgical site infections compared with models that used administrative data only.

CONCLUSIONS

Models that used NLP keywords achieved predictive abilities that were comparable with the manual abstraction process and were superior to models that used administrative data only. NLP has the potential to automate and aid accurate surgical site infection identification and, thus, play an important role in their prevention.

CLINICAL RELEVANCE

This study examines NLP's potential to automate the identification of surgical site infections. This automation can potentially aid the prevention and early identification of these surgical complications, thereby reducing their adverse clinical and economic impact.

摘要

背景

医院感染监测中手术部位感染的识别依赖于病历的人工提取,而对于研究目的,则主要依赖于行政或索赔数据的使用。本研究的目的是确定使用基于自然语言处理(NLP)的模型自动提取过程是否可以通过分析病历的自由文本记录来识别手术部位感染,其预测能力与手动提取过程相匹配,并超过从行政数据中识别手术部位感染的能力。

方法

我们使用感染预防小组编制的手术部位感染监测数据,从 2011 年至 2017 年,在一家三级保健学术医疗中心对接受矫形外科手术的患者进行手术部位感染识别。我们编制了一个提示手术部位感染的关键词列表,并使用 NLP 识别病历自由文本记录中这些关键词及其语法变体的出现情况。主要结果是手术部位感染是否发生的二进制指标。我们使用行政和 NLP 衍生变量的组合估计了 7 个递增的多变量逻辑回归模型。我们将分析队列分为训练(80%)和测试数据集(20%),并使用十折交叉验证方法。主要分析队列包括 172 例手术部位感染病例和 200 例从 1407 例对照中反复随机选择的对照。

结果

对于模型 1(仅来自行政数据的变量),敏感性为 68%,阳性预测值为 70%;对于模型 4(来自病历的 NLP 5-gram[5 个连续单词的独特序列]),敏感性为 97%,阳性预测值为 97%;对于模型 7(模型 1 和 4 的组合),敏感性为 97%,阳性预测值为 97%。因此,基于 NLP 的模型以高精度识别 97%的手术部位感染,与仅使用行政数据的模型相比,还识别了 43%的更多手术部位感染。

结论

使用 NLP 关键词的模型达到了与手动提取过程相当的预测能力,并且优于仅使用行政数据的模型。NLP 有可能实现手术部位感染的自动识别,并辅助其准确识别,从而在预防这些手术并发症方面发挥重要作用。

临床相关性

本研究检查了 NLP 实现手术部位感染自动识别的潜力。这种自动化有可能有助于这些手术并发症的预防和早期识别,从而降低其不良临床和经济影响。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索