Department of Medicine, Veterans Affairs Maryland Health Care System, Baltimore, Maryland, USA.
PLoS One. 2013 Aug 13;8(8):e70944. doi: 10.1371/journal.pone.0070944. eCollection 2013.
Timely information about disease severity can be central to the detection and management of outbreaks of acute respiratory infections (ARI), including influenza. We asked if two resources: 1) free text, and 2) structured data from an electronic medical record (EMR) could complement each other to identify patients with pneumonia, an ARI severity landmark.
A manual EMR review of 2747 outpatient ARI visits with associated chest imaging identified x-ray reports that could support the diagnosis of pneumonia (kappa score = 0.88 (95% CI 0.82∶0.93)), along with attendant cases with Possible Pneumonia (adds either cough, sputum, fever/chills/night sweats, dyspnea or pleuritic chest pain) or with Pneumonia-in-Plan (adds pneumonia stated as a likely diagnosis by the provider). The x-ray reports served as a reference to develop a text classifier using machine-learning software that did not require custom coding. To identify pneumonia cases, the classifier was combined with EMR-based structured data and with text analyses aimed at ARI symptoms in clinical notes.
370 reference cases with Possible Pneumonia and 250 with Pneumonia-in-Plan were identified. The x-ray report text classifier increased the positive predictive value of otherwise identical EMR-based case-detection algorithms by 20-70%, while retaining sensitivities of 58-75%. These performance gains were independent of the case definitions and of whether patients were admitted to the hospital or sent home. Text analyses seeking ARI symptoms in clinical notes did not add further value.
Specialized software development is not required for automated text analyses to help identify pneumonia patients. These results begin to map an efficient, replicable strategy through which EMR data can be used to stratify ARI severity.
及时的疾病严重程度信息对于急性呼吸道感染(ARI),包括流感的爆发检测和管理至关重要。我们想知道两个资源:1)自由文本,2)电子病历(EMR)中的结构化数据,是否可以相互补充,以识别肺炎患者,这是 ARI 严重程度的一个标志。
对 2747 例门诊 ARI 就诊病例进行了手动 EMR 审查,这些病例均有相关的胸部影像学检查,发现了可以支持肺炎诊断的 X 射线报告(kappa 评分=0.88(95%置信区间 0.82∶0.93)),同时还有伴有疑似肺炎(添加咳嗽、咳痰、发热/寒战/盗汗、呼吸困难或胸膜炎性胸痛)或肺炎计划(添加提供者认为肺炎可能诊断)的病例。X 射线报告被用作开发使用机器学习软件的文本分类器的参考,该软件不需要定制编码。为了识别肺炎病例,该分类器与基于 EMR 的结构化数据以及针对临床记录中 ARI 症状的文本分析相结合。
确定了 370 例疑似肺炎的参考病例和 250 例肺炎计划病例。X 射线报告文本分类器将原本相同的基于 EMR 的病例检测算法的阳性预测值提高了 20-70%,同时保持了 58-75%的敏感性。这些性能提升与病例定义以及患者是否住院或出院无关。在临床记录中寻找 ARI 症状的文本分析并没有增加更多的价值。
自动化文本分析不需要专门的软件开发来帮助识别肺炎患者。这些结果开始为通过 EMR 数据分层 ARI 严重程度的有效、可复制策略提供了依据。