Department of Computer Science and Engineering, College of Engineering, The Ohio State University, Columbus, Ohio; Center for Health Services Research, Regenstrief Institute, Indianapolis, Indiana.
Center for Health Services Research, Regenstrief Institute, Indianapolis, Indiana; Department of Computer and Information Science, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania.
J Surg Res. 2024 Nov;303:699-708. doi: 10.1016/j.jss.2024.09.062. Epub 2024 Oct 24.
Peripheral arterial disease (PAD) is the leading cause of amputation in the United States. Despite affecting 8.5 million Americans and more than 200 million people globally, there are significant gaps in awareness by both patients and providers. Ongoing efforts to raise PAD awareness among both the public and health-care professionals have not met widespread success. Thus, there is a need for alternative methods for identifying PAD patients. One potentially promising strategy leverages natural language processing (NLP) to digitally screen patients for PAD. Prior approaches have applied keyword search (KWS) to billing codes or unstructured clinical narratives to identify patients with PAD. However, KWS is limited by its lack of flexibility, the need for manual algorithm development, inconsistent validation, and an inherent failure to capture patients with undiagnosed PAD. Recent advances in deep learning (DL) allow modern NLP models to learn a conceptual representation of the verbiage associated with PAD. This capability may overcome the characteristic constraints of applying strict rule-based algorithms (i.e., searching for a disease-defining set of keywords or billing codes) to real-world clinical data. Herein, we investigate the use of DL to identify patients with PAD from unstructured notes in the electronic health record (EHR).
Using EHR data from a statewide health information exchange, we first created a dataset of all patients with diagnostic or procedural codes (International Classification of Diseases version 9 or 10 or Current Procedural Terminology) for PAD. This study population was then subdivided into training (70%) and testing (30%) cohorts. We based ground truth labels (PAD versus no PAD) on the presence of a primary diagnostic or procedural billing code for PAD at the encounter level. We implemented our KWS-based identification strategy using the currently published state-of-the-art algorithm for identifying PAD cases from unstructured EHR data. We developed a DL model using a BioMed-RoBERTa base that was fine-tuned on the training cohort. We compared the performance of the KWS algorithm to our DL model on a binary classification task (PAD versus no PAD).
Our study included 484,363 encounters across 71,355 patients represented in 2,268,062 notes. For the task of correctly identifying PAD related notes in our testing set, the DL outperformed KWS on all model performance measures (Sens 0.70 versus 0.62; Spec 0.99 versus 0.94; PPV 0.82 versus 0.69; NPV 0.97 versus 0.96; Accuracy 0.96 versus 0.91; P value for all comparisons <0.001).
Our findings suggest that DL outperforms KWS for identifying PAD cases from clinical narratives. Future planned work derived from this project will develop models to stage patients based on clinical scoring systems.
外周动脉疾病(PAD)是美国截肢的主要原因。尽管影响了 850 万美国人和全球超过 2 亿人,但患者和提供者对 PAD 的认识都存在明显差距。为提高公众和医疗保健专业人员对 PAD 的认识而进行的持续努力并没有取得广泛的成功。因此,需要寻找识别 PAD 患者的替代方法。一种潜在的有前途的策略是利用自然语言处理(NLP)对患者进行 PAD 的数字筛查。先前的方法已经将关键字搜索(KWS)应用于计费代码或非结构化临床叙述,以识别患有 PAD 的患者。然而,KWS 受到其缺乏灵活性、手动算法开发的需要、不一致的验证以及无法捕获未确诊 PAD 患者的固有缺陷的限制。深度学习(DL)的最新进展使现代 NLP 模型能够学习与 PAD 相关的词汇的概念表示。这种能力可能克服将严格基于规则的算法(即,搜索疾病定义的关键字集或计费代码)应用于现实临床数据的特征限制。在此,我们研究了使用 DL 从电子健康记录(EHR)中的非结构化笔记中识别 PAD 患者。
我们使用来自全州健康信息交换的数据,首先创建了一个包含所有 PAD 诊断或程序代码(国际疾病分类第 9 版或第 10 版或当前程序术语)的患者的数据集。该研究人群随后分为训练(70%)和测试(30%)队列。我们基于主要诊断或程序计费代码在就诊级别上存在 PAD 的存在来确定(PAD 与无 PAD)的真实标签。我们使用当前发表的用于从非结构化 EHR 数据中识别 PAD 病例的最先进算法来实施我们的 KWS 识别策略。我们使用经过微调的基于 Biomed-RoBERTa 的基础开发了一个 DL 模型。我们在二进制分类任务(PAD 与无 PAD)上比较了 KWS 算法和我们的 DL 模型的性能。
我们的研究包括 484363 次就诊,涉及 71355 名患者,2268062 份记录。在我们的测试集中正确识别 PAD 相关记录的任务中,DL 在所有模型性能指标上都优于 KWS(敏感性 0.70 对 0.62;特异性 0.99 对 0.94;PPV 0.82 对 0.69;NPV 0.97 对 0.96;准确性 0.96 对 0.91;所有比较的 P 值均<0.001)。
我们的发现表明,DL 在从临床叙述中识别 PAD 病例方面优于 KWS。该项目的未来计划工作将开发基于临床评分系统对患者进行分期的模型。