• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度学习和电子健康记录识别抗中性粒细胞胞浆抗体相关性血管炎队列

Identification of an ANCA-Associated Vasculitis Cohort Using Deep Learning and Electronic Health Records.

作者信息

Wang Liqin, Novoa-Laurentiev John, Cook Claire, Srivatsan Shruthi, Hua Yining, Yang Jie, Miloslavsky Eli, Choi Hyon K, Zhou Li, Wallace Zachary S

机构信息

Division of General Internal Medicine and Primary Care, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA.

Rheumatology and Allergy Clinical Epidemiology Research Center and Division of Rheumatology, Allergy, and Immunology, and Mongan Institute, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA.

出版信息

medRxiv. 2024 Jun 10:2024.06.09.24308603. doi: 10.1101/2024.06.09.24308603.

DOI:10.1101/2024.06.09.24308603
PMID:38946986
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11213085/
Abstract

BACKGROUND

ANCA-associated vasculitis (AAV) is a rare but serious disease. Traditional case-identification methods using claims data can be time-intensive and may miss important subgroups. We hypothesized that a deep learning model analyzing electronic health records (EHR) can more accurately identify AAV cases.

METHODS

We examined the Mass General Brigham (MGB) repository of clinical documentation from 12/1/1979 to 5/11/2021, using expert-curated keywords and ICD codes to identify a large cohort of potential AAV cases. Three labeled datasets (I, II, III) were created, each containing note sections. We trained and evaluated a range of machine learning and deep learning algorithms for note-level classification, using metrics like positive predictive value (PPV), sensitivity, F-score, area under the receiver operating characteristic curve (AUROC), and area under the precision and recall curve (AUPRC). The deep learning model was further evaluated for its ability to classify AAV cases at the patient-level, compared with rule-based algorithms in 2,000 randomly chosen samples.

RESULTS

Datasets I, II, and III comprised 6,000, 3,008, and 7,500 note sections, respectively. Deep learning achieved the highest AUROC in all three datasets, with scores of 0.983, 0.991, and 0.991. The deep learning approach also had among the highest PPVs across the three datasets (0.941, 0.954, and 0.800, respectively). In a test cohort of 2,000 cases, the deep learning model achieved a PPV of 0.262 and an estimated sensitivity of 0.975. Compared to the best rule-based algorithm, the deep learning model identified six additional AAV cases, representing 13% of the total.

CONCLUSION

The deep learning model effectively classifies clinical note sections for AAV diagnosis. Its application to EHR notes can potentially uncover additional cases missed by traditional rule-based methods.

摘要

背景

抗中性粒细胞胞浆抗体相关性血管炎(AAV)是一种罕见但严重的疾病。使用索赔数据的传统病例识别方法可能耗时较长,并且可能遗漏重要的亚组。我们假设,分析电子健康记录(EHR)的深度学习模型能够更准确地识别AAV病例。

方法

我们检查了1979年12月1日至2021年5月11日期间麻省总医院布莱根分院(MGB)的临床文档库,使用专家整理的关键词和国际疾病分类代码来识别一大批潜在的AAV病例。创建了三个标记数据集(I、II、III),每个数据集都包含注释部分。我们使用阳性预测值(PPV)、灵敏度、F值、受试者工作特征曲线下面积(AUROC)以及精度和召回率曲线下面积(AUPRC)等指标,对一系列用于注释级分类的机器学习和深度学习算法进行了训练和评估。在2000个随机选择的样本中,将深度学习模型在患者层面分类AAV病例的能力与基于规则的算法进行比较,对其进行了进一步评估。

结果

数据集I、II和III分别包含6000个、3008个和7500个注释部分。深度学习在所有三个数据集中的AUROC最高,得分分别为0.983、0.991和0.991。深度学习方法在三个数据集中的PPV也处于最高水平(分别为0.941、0.954和0.800)。在一个2000例的测试队列中,深度学习模型的PPV为0.262,估计灵敏度为0.975。与最佳的基于规则的算法相比,深度学习模型额外识别出6例AAV病例,占总数的13%。

结论

深度学习模型有效地对用于AAV诊断的临床注释部分进行分类。将其应用于EHR注释有可能发现传统基于规则的方法遗漏的其他病例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd2/11213085/4a0a561426a1/nihpp-2024.06.09.24308603v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd2/11213085/b7b545adea6a/nihpp-2024.06.09.24308603v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd2/11213085/4a0a561426a1/nihpp-2024.06.09.24308603v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd2/11213085/b7b545adea6a/nihpp-2024.06.09.24308603v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fdd2/11213085/4a0a561426a1/nihpp-2024.06.09.24308603v1-f0002.jpg

相似文献

1
Identification of an ANCA-Associated Vasculitis Cohort Using Deep Learning and Electronic Health Records.利用深度学习和电子健康记录识别抗中性粒细胞胞浆抗体相关性血管炎队列
medRxiv. 2024 Jun 10:2024.06.09.24308603. doi: 10.1101/2024.06.09.24308603.
2
Identification of an ANCA-associated vasculitis cohort using deep learning and electronic health records.使用深度学习和电子健康记录识别抗中性粒细胞胞浆抗体相关性血管炎队列
Int J Med Inform. 2025 Apr;196:105797. doi: 10.1016/j.ijmedinf.2025.105797. Epub 2025 Jan 17.
3
Development and Validation of a Deep Learning Model for Earlier Detection of Cognitive Decline From Clinical Notes in Electronic Health Records.基于电子健康记录中临床笔记的深度学习模型用于早期认知能力下降检测的开发和验证。
JAMA Netw Open. 2021 Nov 1;4(11):e2135174. doi: 10.1001/jamanetworkopen.2021.35174.
4
Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record.基于规则和机器学习算法可在电子健康记录中准确识别系统性硬化症患者。
Arthritis Res Ther. 2019 Dec 30;21(1):305. doi: 10.1186/s13075-019-2092-7.
5
Identifying Functional Status Impairment in People Living With Dementia Through Natural Language Processing of Clinical Documents: Cross-Sectional Study.通过对临床文档的自然语言处理识别痴呆患者的功能状态障碍:横断面研究。
J Med Internet Res. 2024 Feb 13;26:e47739. doi: 10.2196/47739.
6
A deep learning approach for transgender and gender diverse patient identification in electronic health records.深度学习方法在电子健康记录中识别跨性别和性别多样化患者。
J Biomed Inform. 2023 Nov;147:104507. doi: 10.1016/j.jbi.2023.104507. Epub 2023 Sep 29.
7
Identification of Patients With Congestive Heart Failure From the Electronic Health Records of Two Hospitals: Retrospective Study.从两家医院的电子健康记录中识别充血性心力衰竭患者:回顾性研究
JMIR Med Inform. 2025 Apr 10;13:e64113. doi: 10.2196/64113.
8
Topic modeling to characterize the natural history of ANCA-Associated vasculitis from clinical notes: A proof of concept study.从临床记录中对 ANCA 相关性血管炎自然史进行主题建模:概念验证研究。
Semin Arthritis Rheum. 2021 Feb;51(1):150-157. doi: 10.1016/j.semarthrit.2020.10.012. Epub 2020 Dec 24.
9
Automated phenotyping of mild cognitive impairment and Alzheimer's disease and related dementias using electronic health records.利用电子健康记录对轻度认知障碍、阿尔茨海默病及相关痴呆症进行自动表型分析。
Int J Med Inform. 2025 Aug;200:105917. doi: 10.1016/j.ijmedinf.2025.105917. Epub 2025 Apr 11.
10
Using an artificial intelligence tool incorporating natural language processing to identify patients with a diagnosis of ANCA-associated vasculitis in electronic health records.利用自然语言处理技术的人工智能工具在电子健康记录中识别诊断为抗中性粒细胞胞浆抗体相关性血管炎的患者。
Comput Biol Med. 2024 Jan;168:107757. doi: 10.1016/j.compbiomed.2023.107757. Epub 2023 Nov 25.

本文引用的文献

1
A deep learning approach for transgender and gender diverse patient identification in electronic health records.深度学习方法在电子健康记录中识别跨性别和性别多样化患者。
J Biomed Inform. 2023 Nov;147:104507. doi: 10.1016/j.jbi.2023.104507. Epub 2023 Sep 29.
2
Comparative Effectiveness of Rituximab- Versus Cyclophosphamide-Based Remission Induction Strategies in Antineutrophil Cytoplasmic Antibody-Associated Vasculitis for the Risk of Kidney Failure and Mortality.抗中性粒细胞胞质抗体相关性血管炎中利妥昔单抗与环磷酰胺诱导缓解策略的疗效比较:对肾衰竭和死亡率的影响。
Arthritis Rheumatol. 2023 Sep;75(9):1599-1607. doi: 10.1002/art.42515. Epub 2023 Jul 4.
3
Development and Validation of a Deep Learning Model for Earlier Detection of Cognitive Decline From Clinical Notes in Electronic Health Records.
基于电子健康记录中临床笔记的深度学习模型用于早期认知能力下降检测的开发和验证。
JAMA Netw Open. 2021 Nov 1;4(11):e2135174. doi: 10.1001/jamanetworkopen.2021.35174.
4
Topic modeling to characterize the natural history of ANCA-Associated vasculitis from clinical notes: A proof of concept study.从临床记录中对 ANCA 相关性血管炎自然史进行主题建模:概念验证研究。
Semin Arthritis Rheum. 2021 Feb;51(1):150-157. doi: 10.1016/j.semarthrit.2020.10.012. Epub 2020 Dec 24.
5
Development and Validation of a Deep Learning Model for Detection of Allergic Reactions Using Safety Event Reports Across Hospitals.利用医院安全事件报告开发和验证一种用于检测过敏反应的深度学习模型
JAMA Netw Open. 2020 Nov 2;3(11):e2022836. doi: 10.1001/jamanetworkopen.2020.22836.
6
ANCA-associated vasculitis.抗中性粒细胞胞浆抗体相关性血管炎。
Nat Rev Dis Primers. 2020 Aug 27;6(1):71. doi: 10.1038/s41572-020-0204-y.
7
Detection of probable dementia cases in undiagnosed patients using structured and unstructured electronic health records.使用结构化和非结构化电子健康记录检测未确诊患者中的可能痴呆病例。
BMC Med Inform Decis Mak. 2019 Jul 9;19(1):128. doi: 10.1186/s12911-019-0846-4.
8
BioWordVec, improving biomedical word embeddings with subword information and MeSH.BioWordVec,利用子词信息和 MeSH 改进生物医学词向量。
Sci Data. 2019 May 10;6(1):52. doi: 10.1038/s41597-019-0055-0.
9
Mortality in ANCA-associated vasculitis: ameta-analysis of observational studies.抗中性粒细胞胞浆抗体相关性血管炎的死亡率:观察性研究的荟萃分析。
Ann Rheum Dis. 2017 Sep;76(9):1566-1574. doi: 10.1136/annrheumdis-2016-210942. Epub 2017 May 3.
10
Using Medical Text Extraction, Reasoning and Mapping System (MTERMS) to process medication information in outpatient clinical notes.使用医学文本提取、推理与映射系统(MTERMS)处理门诊临床记录中的用药信息。
AMIA Annu Symp Proc. 2011;2011:1639-48. Epub 2011 Oct 22.