• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发和验证放射学报告的自然语言处理算法,以与国际疾病分类第 10 版编码相比,用于识别住院医疗患者中的静脉血栓栓塞症。

Developing and validating natural language processing algorithms for radiology reports compared to ICD-10 codes for identifying venous thromboembolism in hospitalized medical patients.

机构信息

St. Michael's Hospital, Unity Health Toronto, Toronto, ON, Canada; Department of Medicine, University of Toronto, Toronto, ON, Canada; Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada.

Department of Medicine, University of Toronto, Toronto, ON, Canada.

出版信息

Thromb Res. 2022 Jan;209:51-58. doi: 10.1016/j.thromres.2021.11.020. Epub 2021 Nov 27.

DOI:10.1016/j.thromres.2021.11.020
PMID:34871982
Abstract

BACKGROUND

Identifying venous thromboembolism (VTE) from large clinical and administrative databases is important for research and quality improvement.

OBJECTIVE

To develop and validate natural language processing (NLP) algorithms to identify VTE from radiology reports among general internal medicine (GIM) inpatients.

METHODS

This cross-sectional study included GIM hospitalizations between April 1, 2010 and March 31, 2017 at 5 hospitals in Toronto, Ontario, Canada. We developed NLP algorithms to identify pulmonary embolism (PE) and deep venous thrombosis (DVT) from radiologist reports of thoracic computed tomography (CT), extremity compression ultrasound (US), and nuclear ventilation-perfusion (VQ) scans in a training dataset of 1551 hospitalizations. We compared the accuracy of our NLP algorithms, the previously-published "simpleNLP" tool, and administrative discharge diagnosis codes (ICD-10-CA) for PE and DVT to the "gold standard" manual review in a separate random sample of 4000 GIM hospitalizations.

RESULTS

Our NLP algorithms were highly accurate for identifying DVT from US, with sensitivity 0.94, positive predictive value (PPV) 0.90, and Area Under the Receiver-Operating-Characteristic Curve (AUC) 0.96; and in identifying PE from CT, with sensitivity 0.91, PPV 0.89, and AUC 0.96. Administrative diagnosis codes and the simple NLP tool were less accurate for DVT (ICD-10-CA sensitivity 0.63, PPV 0.43, AUC 0.81; simpleNLP sensitivity 0.41, PPV 0.36, AUC 0.66) and PE (ICD-10-CA sensitivity 0.83, PPV 0.70, AUC 0.91; simpleNLP sensitivity 0.89, PPV 0.62, AUC 0.92).

CONCLUSIONS

Administrative diagnosis codes are unreliable in identifying VTE in hospitalized patients. We developed highly accurate NLP algorithms to identify VTE from radiology reports in a multicentre sample and have made the algorithms freely available to the academic community with a user-friendly tool (https://lks-chart.github.io/CHARTextract-docs/08-downloads/rulesets.html#venous-thromboembolism-vte-rulesets).

摘要

背景

从大型临床和管理数据库中识别静脉血栓栓塞症(VTE)对于研究和质量改进非常重要。

目的

开发和验证自然语言处理(NLP)算法,以从多伦多 5 家医院的综合内科(GIM)住院患者的放射学报告中识别 VTE。

方法

这项横断面研究纳入了 2010 年 4 月 1 日至 2017 年 3 月 31 日期间在加拿大安大略省多伦多的 5 家医院的 GIM 住院患者。我们开发了 NLP 算法,以从胸部计算机断层扫描(CT)、四肢压缩超声(US)和核通气-灌注(VQ)扫描的放射科报告中识别肺栓塞(PE)和深静脉血栓形成(DVT),该算法在 1551 例住院患者的训练数据集中进行了验证。我们比较了 NLP 算法、之前发表的“simpleNLP”工具以及行政出院诊断代码(ICD-10-CA)在 4000 例 GIM 住院患者的独立随机样本中对 PE 和 DVT 的准确性,以与“金标准”手动审查进行比较。

结果

我们的 NLP 算法对 US 识别 DVT 的准确性很高,其敏感性为 0.94,阳性预测值(PPV)为 0.90,受试者工作特征曲线下面积(AUC)为 0.96;对 CT 识别 PE 的敏感性为 0.91,PPV 为 0.89,AUC 为 0.96。行政诊断代码和 simpleNLP 工具对 DVT 的准确性较低(ICD-10-CA 的敏感性为 0.63,PPV 为 0.43,AUC 为 0.81;simpleNLP 的敏感性为 0.41,PPV 为 0.36,AUC 为 0.66)和 PE(ICD-10-CA 的敏感性为 0.83,PPV 为 0.70,AUC 为 0.91;simpleNLP 的敏感性为 0.89,PPV 为 0.62,AUC 为 0.92)。

结论

行政诊断代码在识别住院患者的 VTE 方面不可靠。我们开发了高度准确的 NLP 算法,可从多中心样本的放射学报告中识别 VTE,并通过易于使用的工具(https://lks-chart.github.io/CHARTextract-docs/08-downloads/rulesets.html#venous-thromboembolism-vte-rulesets)向学术界免费提供这些算法。

相似文献

1
Developing and validating natural language processing algorithms for radiology reports compared to ICD-10 codes for identifying venous thromboembolism in hospitalized medical patients.开发和验证放射学报告的自然语言处理算法,以与国际疾病分类第 10 版编码相比,用于识别住院医疗患者中的静脉血栓栓塞症。
Thromb Res. 2022 Jan;209:51-58. doi: 10.1016/j.thromres.2021.11.020. Epub 2021 Nov 27.
2
Natural Language Processing in a Clinical Decision Support System for the Identification of Venous Thromboembolism: Algorithm Development and Validation.临床决策支持系统中自然语言处理用于识别静脉血栓栓塞症:算法开发与验证。
J Med Internet Res. 2023 Apr 24;25:e43153. doi: 10.2196/43153.
3
The validity of ICD codes coupled with imaging procedure codes for identifying acute venous thromboembolism using administrative data.使用管理数据时,国际疾病分类(ICD)编码与影像检查程序编码相结合用于识别急性静脉血栓栓塞症的有效性。
Vasc Med. 2015 Aug;20(4):364-8. doi: 10.1177/1358863X15573839. Epub 2015 Apr 1.
4
A novel method of adverse event detection can accurately identify venous thromboembolisms (VTEs) from narrative electronic health record data.一种新型不良事件检测方法能够从叙述性电子健康记录数据中准确识别静脉血栓栓塞症(VTEs)。
J Am Med Inform Assoc. 2015 Jan;22(1):155-65. doi: 10.1136/amiajnl-2014-002768. Epub 2014 Oct 20.
5
The use of natural language processing on pediatric diagnostic radiology reports in the electronic health record to identify deep venous thrombosis in children.利用自然语言处理技术对电子健康记录中的儿科诊断放射学报告进行分析,以识别儿童深静脉血栓。
J Thromb Thrombolysis. 2017 Oct;44(3):281-290. doi: 10.1007/s11239-017-1532-y.
6
A comparison of natural language processing to ICD-10 codes for identification and characterization of pulmonary embolism.用于识别和表征肺栓塞的自然语言处理与国际疾病分类第10版(ICD - 10)编码的比较。
Thromb Res. 2021 Jul;203:190-195. doi: 10.1016/j.thromres.2021.04.020. Epub 2021 May 6.
7
Natural Language Processing Performance for the Identification of Venous Thromboembolism in an Integrated Healthcare System.自然语言处理在集成医疗保健系统中识别静脉血栓栓塞症的性能。
Clin Appl Thromb Hemost. 2021 Jan-Dec;27:10760296211013108. doi: 10.1177/10760296211013108.
8
Automated Extraction of VTE Events From Narrative Radiology Reports in Electronic Health Records: A Validation Study.从电子健康记录中的叙述性放射学报告自动提取静脉血栓栓塞事件:一项验证研究。
Med Care. 2017 Oct;55(10):e73-e80. doi: 10.1097/MLR.0000000000000346.
9
Validity of Using Inpatient and Outpatient Administrative Codes to Identify Acute Venous Thromboembolism: The CVRN VTE Study.使用住院和门诊管理代码识别急性静脉血栓栓塞的有效性:CVRN VTE研究
Med Care. 2017 Dec;55(12):e137-e143. doi: 10.1097/MLR.0000000000000524.
10
Identifying venous thromboembolism and major bleeding in emergency room discharges using administrative data.利用行政数据识别急诊室出院患者中的静脉血栓栓塞症和大出血情况。
Thromb Res. 2015 Dec;136(6):1195-8. doi: 10.1016/j.thromres.2015.10.035. Epub 2015 Oct 29.

引用本文的文献

1
Validating adverse events in administrative healthcare data in ireland: a retrospective chart review study.验证爱尔兰医疗保健管理数据中的不良事件:一项回顾性图表审查研究。
BMC Health Serv Res. 2025 Aug 20;25(1):1113. doi: 10.1186/s12913-025-13201-x.
2
Role of Artificial Intelligence in the Diagnosis and Management of Pulmonary Embolism: A Comprehensive Review.人工智能在肺栓塞诊断与管理中的作用:全面综述
Diagnostics (Basel). 2025 Apr 1;15(7):889. doi: 10.3390/diagnostics15070889.
3
Healthy Plant-Based Diet, Genetic Predisposition, and the Risk of Incident Venous Thromboembolism.
健康的植物性饮食、遗传易感性与静脉血栓栓塞症的发病风险
JACC Adv. 2024 Oct 16;3(12):101318. doi: 10.1016/j.jacadv.2024.101318. eCollection 2024 Dec.
4
Machine learning in cancer-associated thrombosis: hype or hope in untangling the clot.癌症相关血栓形成中的机器学习:解开血栓之谜是炒作还是希望。
Bleeding Thromb Vasc Biol. 2024;3(Suppl 1). doi: 10.4081/btvb.2024.123. Epub 2024 May 16.
5
Derivation and external validation of a portable method to identify patients with pulmonary embolism from radiology reports: The READ-PE algorithm.从放射学报告中识别肺栓塞患者的便携式方法的推导和外部验证:READ-PE 算法。
Thromb Res. 2024 Sep;241:109105. doi: 10.1016/j.thromres.2024.109105. Epub 2024 Jul 26.
6
Automated vs. manual coding of neuroimaging reports via natural language processing, using the international classification of diseases, tenth revision.通过自然语言处理,使用《国际疾病分类》第十版,对神经影像学报告进行自动编码与人工编码的比较
Heliyon. 2024 May 7;10(10):e30106. doi: 10.1016/j.heliyon.2024.e30106. eCollection 2024 May 30.
7
A scoping review of the methodological approaches used in retrospective chart reviews to validate adverse event rates in administrative data.回顾性图表审查中用于验证行政数据中不良事件发生率的方法学方法的范围综述。
Int J Qual Health Care. 2024 May 10;36(2). doi: 10.1093/intqhc/mzae037.
8
Evaluation of venous thromboembolism risk assessment models for hospital inpatients: the VTEAM evidence synthesis.医院住院患者静脉血栓栓塞风险评估模型的评估:VTEAM 证据综合评价。
Health Technol Assess. 2024 Apr;28(20):1-166. doi: 10.3310/AWTW6200.
9
ClotCatcher: a novel natural language model to accurately adjudicate venous thromboembolism from radiology reports.ClotCatcher:一种新颖的自然语言模型,可准确从放射学报告中判断静脉血栓栓塞。
BMC Med Inform Decis Mak. 2023 Nov 16;23(1):262. doi: 10.1186/s12911-023-02369-z.
10
Validation of an Algorithm to Identify Venous Thromboembolism in Health Insurance Claims Data Among Patients with Rheumatoid Arthritis.一种用于识别类风湿关节炎患者健康保险理赔数据中静脉血栓栓塞症的算法的验证
Clin Epidemiol. 2023 Jun 1;15:671-682. doi: 10.2147/CLEP.S402360. eCollection 2023.