自然语言处理技术从放射学报告中识别肺炎。

Natural Language Processing to identify pneumonia from radiology reports.

机构信息

Group Health Research Institute, Seattle, WA 98101, USA.

出版信息

Pharmacoepidemiol Drug Saf. 2013 Aug;22(8):834-41. doi: 10.1002/pds.3418. Epub 2013 Apr 1.

DOI:10.1002/pds.3418

PMID:23554109

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3811072/

Abstract

PURPOSE

This study aimed to develop Natural Language Processing (NLP) approaches to supplement manual outcome validation, specifically to validate pneumonia cases from chest radiograph reports.

METHODS

We trained one NLP system, ONYX, using radiograph reports from children and adults that were previously manually reviewed. We then assessed its validity on a test set of 5000 reports. We aimed to substantially decrease manual review, not replace it entirely, and so, we classified reports as follows: (1) consistent with pneumonia; (2) inconsistent with pneumonia; or (3) requiring manual review because of complex features. We developed processes tailored either to optimize accuracy or to minimize manual review. Using logistic regression, we jointly modeled sensitivity and specificity of ONYX in relation to patient age, comorbidity, and care setting. We estimated positive and negative predictive value (PPV and NPV) assuming pneumonia prevalence in the source data.

RESULTS

Tailored for accuracy, ONYX identified 25% of reports as requiring manual review (34% of true pneumonias and 18% of non-pneumonias). For the remainder, ONYX's sensitivity was 92% (95% CI 90-93%), specificity 87% (86-88%), PPV 74% (72-76%), and NPV 96% (96-97%). Tailored to minimize manual review, ONYX classified 12% as needing manual review. For the remainder, ONYX had sensitivity 75% (72-77%), specificity 95% (94-96%), PPV 86% (83-88%), and NPV 91% (90-91%).

CONCLUSIONS

For pneumonia validation, ONYX can replace almost 90% of manual review while maintaining low to moderate misclassification rates. It can be tailored for different outcomes and study needs and thus warrants exploration in other settings.

摘要

目的

本研究旨在开发自然语言处理（NLP）方法来补充手动结果验证，特别是验证胸部 X 光报告中的肺炎病例。

方法

我们使用先前经过手动审查的儿童和成人的 X 光报告来训练一个名为 ONYX 的 NLP 系统。然后，我们在一个包含 5000 份报告的测试集中评估了它的有效性。我们的目标是大量减少手动审查，而不是完全取代它，因此，我们将报告分类如下：（1）与肺炎一致；（2）与肺炎不一致；或（3）由于复杂特征需要手动审查。我们开发了针对准确性或最小化手动审查的过程。我们使用逻辑回归联合建模了 ONYX 与患者年龄、合并症和护理环境的敏感性和特异性。我们根据源数据中的肺炎患病率估计了阳性和阴性预测值（PPV 和 NPV）。

结果

针对准确性进行调整的 ONYX 将 25%的报告标记为需要手动审查（34%的真肺炎和 18%的非肺炎）。对于其余的报告，ONYX 的敏感性为 92%（95%CI 90-93%），特异性为 87%（86-88%），PPV 为 74%（72-76%），NPV 为 96%（96-97%）。针对最小化手动审查进行调整的 ONYX 将 12%的报告标记为需要手动审查。对于其余的报告，ONYX 的敏感性为 75%（72-77%），特异性为 95%（94-96%），PPV 为 86%（83-88%），NPV 为 91%（90-91%）。

结论

对于肺炎验证，ONYX 可以替代近 90%的手动审查，同时保持较低至中等的分类错误率。它可以根据不同的结果和研究需求进行调整，因此值得在其他环境中进行探索。

相似文献

Natural Language Processing to identify pneumonia from radiology reports.自然语言处理技术从放射学报告中识别肺炎。

Pharmacoepidemiol Drug Saf. 2013 Aug;22(8):834-41. doi: 10.1002/pds.3418. Epub 2013 Apr 1.

A Methodological Approach to Validate Pneumonia Encounters from Radiology Reports Using Natural Language Processing.一种使用自然语言处理技术验证放射学报告中肺炎病例的方法学途径。

Methods Inf Med. 2022 May;61(1-02):38-45. doi: 10.1055/a-1817-7008. Epub 2022 Apr 5.

Development and Validation of a Natural Language Processing Tool to Identify Patients Treated for Pneumonia across VA Emergency Departments.一种用于识别退伍军人事务部急诊科接受肺炎治疗患者的自然语言处理工具的开发与验证

Appl Clin Inform. 2018 Jan;9(1):122-128. doi: 10.1055/s-0038-1626725. Epub 2018 Feb 21.

The Clinical Utility of Chest Radiography for Identifying Pneumonia: Accounting for Diagnostic Uncertainty in Radiology Reports.胸部 X 线摄影在识别肺炎中的临床效用：考虑放射学报告中的诊断不确定性。

AJR Am J Roentgenol. 2019 Dec;213(6):1207-1212. doi: 10.2214/AJR.19.21521. Epub 2019 Sep 11.

Automated identification of pneumonia in chest radiograph reports in critically ill patients.自动识别危重症患者胸部 X 光报告中的肺炎。

BMC Med Inform Decis Mak. 2013 Aug 15;13:90. doi: 10.1186/1472-6947-13-90.

Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer.用于识别可疑肺癌横断面影像报告的自然语言处理与人工编码的比较

JCO Clin Cancer Inform. 2018 Dec;2:1-7. doi: 10.1200/CCI.17.00069.

Extracting information on pneumonia in infants using natural language processing of radiology reports.利用放射学报告的自然语言处理提取婴儿肺炎相关信息。

J Biomed Inform. 2005 Aug;38(4):314-21. doi: 10.1016/j.jbi.2005.02.003. Epub 2005 Mar 30.

Validation of natural language processing to determine the presence and size of abdominal aortic aneurysms in a large integrated health system.验证自然语言处理技术在大型综合健康系统中确定腹主动脉瘤的存在和大小的准确性。

J Vasc Surg. 2021 Aug;74(2):459-466.e3. doi: 10.1016/j.jvs.2020.12.090. Epub 2021 Feb 4.

A Transparent and Adaptable Method to Extract Colonoscopy and Pathology Data Using Natural Language Processing.一种使用自然语言处理提取结肠镜检查和病理学数据的透明且可适应的方法。

J Med Syst. 2020 Jul 31;44(9):151. doi: 10.1007/s10916-020-01604-8.

Enhancing Comparative Effectiveness Research With Automated Pediatric Pneumonia Detection in a Multi-Institutional Clinical Repository: A PHIS+ Pilot Study.利用多机构临床知识库中的自动儿科肺炎检测增强比较效果研究：一项PHIS+试点研究

J Med Internet Res. 2017 May 15;19(5):e162. doi: 10.2196/jmir.6887.

引用本文的文献

Community-acquired pneumonia identification from electronic health records in the absence of a gold standard: A Bayesian latent class analysis.在缺乏金标准的情况下从电子健康记录中识别社区获得性肺炎：一项贝叶斯潜在类别分析。

PLOS Digit Health. 2025 Jul 21;4(7):e0000936. doi: 10.1371/journal.pdig.0000936. eCollection 2025 Jul.

Natural Language Processing Applied to Clinical Documentation in Post-acute Care Settings: A Scoping Review.自然语言处理在急性后护理环境中临床文档中的应用：一项范围综述

J Am Med Dir Assoc. 2024 Jan;25(1):69-83. doi: 10.1016/j.jamda.2023.09.006. Epub 2023 Oct 11.

Artificial Intelligence to Improve Patient Understanding of Radiology Reports.人工智能提高患者对放射科报告的理解。

Yale J Biol Med. 2023 Sep 29;96(3):407-417. doi: 10.59249/NKOY5498. eCollection 2023 Sep.

Automated, machine learning-based alerts increase epilepsy surgery referrals: A randomized controlled trial.基于自动化、机器学习的警报可增加癫痫手术转诊：一项随机对照试验。

Epilepsia. 2023 Jul;64(7):1791-1799. doi: 10.1111/epi.17629. Epub 2023 May 27.

The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia.开发一种新型自然语言处理工具，用于识别患有肺炎的儿科胸部X光片报告。

Front Digit Health. 2023 Feb 22;5:1104604. doi: 10.3389/fdgth.2023.1104604. eCollection 2023.

Clinical Concept-Based Radiology Reports Classification Pipeline for Lung Carcinoma.基于临床概念的肺癌放射学报告分类流水线。

J Digit Imaging. 2023 Jun;36(3):812-826. doi: 10.1007/s10278-023-00787-z. Epub 2023 Feb 14.

Identifying oral disease variables associated with pneumonia emergence by application of machine learning to integrated medical and dental big data to inform eHealth approaches.通过将机器学习应用于整合的医学和牙科大数据来识别与肺炎发生相关的口腔疾病变量，以为电子健康方法提供信息。

Front Dent Med. 2022;3. doi: 10.3389/fdmed.2022.1005140. Epub 2022 Sep 22.

Development and evaluation of an interoperable natural language processing system for identifying pneumonia across clinical settings of care and institutions.一种用于在不同临床护理环境和机构中识别肺炎的可互操作自然语言处理系统的开发与评估。

JAMIA Open. 2022 Dec 30;5(4):ooac114. doi: 10.1093/jamiaopen/ooac114. eCollection 2022 Dec.

Natural Language Processing and Graph Theory: Making Sense of Imaging Records in a Novel Representation Frame.自然语言处理与图论：在一种新型表示框架中理解影像记录

JMIR Med Inform. 2022 Dec 21;10(12):e40534. doi: 10.2196/40534.

Implementation of Machine Learning Pipelines for Clinical Practice: Development and Validation Study.用于临床实践的机器学习管道的实施：开发与验证研究。

JMIR Med Inform. 2022 Dec 16;10(12):e37833. doi: 10.2196/37833.

本文引用的文献

Automating classification of free-text electronic health records for epidemiological studies.自动化电子健康记录的自由文本分类用于流行病学研究。

Pharmacoepidemiol Drug Saf. 2012 Jun;21(6):651-8. doi: 10.1002/pds.3205. Epub 2012 Jan 24.

Use of opioids or benzodiazepines and risk of pneumonia in older adults: a population-based case-control study.阿片类药物或苯二氮䓬类药物的使用与老年人肺炎风险的关系：一项基于人群的病例对照研究。

J Am Geriatr Soc. 2011 Oct;59(10):1899-907. doi: 10.1111/j.1532-5415.2011.03586.x. Epub 2011 Sep 13.

Automated identification of postoperative complications within an electronic medical record using natural language processing.利用自然语言处理技术在电子病历中自动识别术后并发症。

JAMA. 2011 Aug 24;306(8):848-55. doi: 10.1001/jama.2011.1204.

Association of community-acquired pneumonia with antipsychotic drug use in elderly patients: a nested case-control study.社区获得性肺炎与老年患者抗精神病药物使用的关联：一项巢式病例对照研究。

Ann Intern Med. 2010 Apr 6;152(7):418-25, W139-40. doi: 10.7326/0003-4819-152-7-201004060-00006.

Acid-suppressive medication use and the risk for hospital-acquired pneumonia.抑酸药物的使用与医院获得性肺炎的风险

JAMA. 2009 May 27;301(20):2120-8. doi: 10.1001/jama.2009.722.

ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.语境：一种从临床报告中确定否定、体验者和时间状态的算法。

J Biomed Inform. 2009 Oct;42(5):839-51. doi: 10.1016/j.jbi.2009.05.002. Epub 2009 May 10.

NLP-based identification of pneumonia cases from free-text radiological reports.基于自然语言处理技术从自由文本放射学报告中识别肺炎病例

AMIA Annu Symp Proc. 2008 Nov 6;2008:172-6.

Influenza vaccination and risk of community-acquired pneumonia in immunocompetent elderly people: a population-based, nested case-control study.免疫功能正常老年人的流感疫苗接种与社区获得性肺炎风险：一项基于人群的巢式病例对照研究。

Lancet. 2008 Aug 2;372(9636):398-405. doi: 10.1016/S0140-6736(08)61160-5.

Impact of the introduction of pneumococcal conjugate vaccine on rates of community acquired pneumonia in children and adults.肺炎球菌结合疫苗的引入对儿童和成人社区获得性肺炎发病率的影响。

Vaccine. 2008 Sep 8;26(38):4947-54. doi: 10.1016/j.vaccine.2008.07.016. Epub 2008 Jul 26.

CDC/NHSN surveillance definition of health care-associated infection and criteria for specific types of infections in the acute care setting.疾病控制与预防中心/国家医疗安全网络（CDC/NHSN）对医疗保健相关感染的监测定义以及急性护理环境中特定类型感染的标准。

Am J Infect Control. 2008 Jun;36(5):309-32. doi: 10.1016/j.ajic.2008.03.002.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验