Suppr超能文献

利用电子健康记录和相关医疗保险索赔数据的自然语言处理提高痛风发作自动识别的准确性。

Improving the accuracy of automated gout flare ascertainment using natural language processing of electronic health records and linked Medicare claims data.

机构信息

Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA.

Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA.

出版信息

Pharmacoepidemiol Drug Saf. 2024 Jan;33(1):e5684. doi: 10.1002/pds.5684. Epub 2023 Aug 31.

Abstract

BACKGROUND

We aimed to determine whether integrating concepts from the notes from the electronic health record (EHR) data using natural language processing (NLP) could improve the identification of gout flares.

METHODS

Using Medicare claims linked with EHR, we selected gout patients who initiated the urate-lowering therapy (ULT). Patients' 12-month baseline period and on-treatment follow-up were segmented into 1-month units. We retrieved EHR notes for months with gout diagnosis codes and processed notes for NLP concepts. We selected a random sample of 500 patients and reviewed each of their notes for the presence of a physician-documented gout flare. Months containing at least 1 note mentioning gout flares were considered months with events. We used 60% of patients to train predictive models with LASSO. We evaluated the models by the area under the curve (AUC) in the validation data and examined positive/negative predictive values (P/NPV).

RESULTS

We extracted and labeled 839 months of follow-up (280 with gout flares). The claims-only model selected 20 variables (AUC = 0.69). The NLP concept-only model selected 15 (AUC = 0.69). The combined model selected 32 claims variables and 13 NLP concepts (AUC = 0.73). The claims-only model had a PPV of 0.64 [0.50, 0.77] and an NPV of 0.71 [0.65, 0.76], whereas the combined model had a PPV of 0.76 [0.61, 0.88] and an NPV of 0.71 [0.65, 0.76].

CONCLUSION

Adding NLP concept variables to claims variables resulted in a small improvement in the identification of gout flares. Our data-driven claims-only model and our combined claims/NLP-concept model outperformed existing rule-based claims algorithms reliant on medication use, diagnosis, and procedure codes.

摘要

背景

我们旨在确定使用自然语言处理(NLP)整合电子健康记录(EHR)数据中的笔记概念是否可以提高痛风发作的识别率。

方法

我们使用与 EHR 相关联的医疗保险索赔数据,选择开始降低尿酸治疗(ULT)的痛风患者。患者的 12 个月基线期和治疗随访期被分割为 1 个月的单位。我们检索了有痛风诊断代码的月份的 EHR 笔记,并对笔记进行了 NLP 概念处理。我们随机选择了 500 名患者的样本,并审查了他们每个人的笔记,以确定是否有医生记录的痛风发作。包含至少 1 份提及痛风发作的笔记的月份被视为有事件的月份。我们使用 60%的患者使用 LASSO 训练预测模型。我们在验证数据中通过曲线下面积(AUC)评估模型,并检查阳性/阴性预测值(PPV/NPV)。

结果

我们提取并标记了 839 个月的随访(280 个月有痛风发作)。仅索赔模型选择了 20 个变量(AUC=0.69)。仅 NLP 概念模型选择了 15 个(AUC=0.69)。综合模型选择了 32 个索赔变量和 13 个 NLP 概念(AUC=0.73)。仅索赔模型的 PPV 为 0.64 [0.50, 0.77],NPV 为 0.71 [0.65, 0.76],而综合模型的 PPV 为 0.76 [0.61, 0.88],NPV 为 0.71 [0.65, 0.76]。

结论

将 NLP 概念变量添加到索赔变量中可以略微提高痛风发作的识别率。我们的数据驱动的仅索赔模型和综合的索赔/NLP 概念模型优于依赖药物使用、诊断和程序代码的现有基于规则的索赔算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc54/10873073/1f3fe68926f4/nihms-1940122-f0001.jpg

相似文献

3
Validation of claims-based algorithms for gout flares.基于索赔的痛风发作算法的验证。
Pharmacoepidemiol Drug Saf. 2016 Jul;25(7):820-6. doi: 10.1002/pds.4044. Epub 2016 May 27.

本文引用的文献

1
2020 American College of Rheumatology Guideline for the Management of Gout.2020 年美国风湿病学会痛风管理指南。
Arthritis Rheumatol. 2020 Jun;72(6):879-895. doi: 10.1002/art.41247. Epub 2020 May 11.
7
Validation of claims-based algorithms for gout flares.基于索赔的痛风发作算法的验证。
Pharmacoepidemiol Drug Saf. 2016 Jul;25(7):820-6. doi: 10.1002/pds.4044. Epub 2016 May 27.
8
Gout.痛风
Lancet. 2016 Oct 22;388(10055):2039-2052. doi: 10.1016/S0140-6736(16)00346-9. Epub 2016 Apr 21.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验