自然语言处理技术在放射学报告中识别肺结节并提取结节特征的应用。

Natural Language Processing to Identify Pulmonary Nodules and Extract Nodule Characteristics From Radiology Reports.

机构信息

Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA.

出版信息

Chest. 2021 Nov;160(5):1902-1914. doi: 10.1016/j.chest.2021.05.048. Epub 2021 Jun 4.

DOI:10.1016/j.chest.2021.05.048

PMID:34089738

Abstract

BACKGROUND

There is an urgent need for population-based studies on managing patients with pulmonary nodules.

RESEARCH QUESTION

Is it possible to identify pulmonary nodules and associated characteristics using an automated method?

STUDY DESIGN AND METHODS

We revised and refined an existing natural language processing (NLP) algorithm to identify radiology transcripts with pulmonary nodules and greatly expanded its functionality to identify the characteristics of the largest nodule, when present, including size, lobe, laterality, attenuation, calcification, and edge. We compared NLP results with a reference standard of manual transcript review in a random test sample of 200 radiology transcripts. We applied the final automated method to a larger cohort of patients who underwent chest CT scan in an integrated health care system from 2006 to 2016, and described their demographic and clinical characteristics.

RESULTS

In the test sample, the NLP algorithm had very high sensitivity (98.6%; 95% CI, 95.0%-99.8%) and specificity (100%; 95% CI, 93.9%-100%) for identifying pulmonary nodules. For attenuation, edge, and calcification, the NLP algorithm achieved similar accuracies, and it correctly identified the diameter of the largest nodule in 135 of 141 cases (95.7%; 95% CI, 91.0%-98.4%). In the larger cohort, the NLP found 217,771 reports with nodules among 717,304 chest CT reports (30.4%). From 2006 to 2016, the number of reports with nodules increased by 150%, and the mean size of the largest nodule gradually decreased from 11 to 8.9 mm. Radiologists documented the laterality and lobe (90%-95%) more often than the attenuation, calcification, and edge characteristics (11%-14%).

INTERPRETATION

The NLP algorithm identified pulmonary nodules and associated characteristics with high accuracy. In our community practice settings, the documentation of nodule characteristics is incomplete. Our results call for better documentation of nodule findings. The NLP algorithm can be used in population-based studies to identify pulmonary nodules, avoiding labor-intensive chart review.

摘要

背景

目前非常需要进行基于人群的研究，以管理患有肺结节的患者。

研究问题

是否可以使用自动化方法来识别肺结节及相关特征？

研究设计与方法

我们对现有的自然语言处理（NLP）算法进行了修订和完善，以识别包含肺结节的放射学转录本，并极大地扩展了其功能，以识别最大结节的特征，若存在最大结节的话，包括大小、叶、侧别、衰减、钙化和边缘。我们在 200 份放射学转录本的随机测试样本中，将 NLP 结果与手动转录本审查的参考标准进行了比较。我们将最终的自动化方法应用于在 2006 年至 2016 年期间在综合医疗保健系统中接受胸部 CT 扫描的更大患者队列，并描述了他们的人口统计学和临床特征。

结果

在测试样本中，NLP 算法对于识别肺结节具有很高的敏感性（98.6%；95%CI，95.0%-99.8%）和特异性（100%；95%CI，93.9%-100%）。对于衰减、边缘和钙化，NLP 算法具有相似的准确性，并且在 141 例中的 135 例（95.7%；95%CI，91.0%-98.4%）中正确识别了最大结节的直径。在更大的队列中，NLP 在 717304 份胸部 CT 报告中发现了 217771 份有结节的报告（30.4%）。从 2006 年至 2016 年，有结节报告的数量增加了 150%，而最大结节的平均大小从 11 毫米逐渐减小至 8.9 毫米。放射科医生记录了侧别和叶（90%-95%）的情况比衰减、钙化和边缘特征（11%-14%）更为频繁。

解释

NLP 算法具有很高的准确性，可以识别肺结节及相关特征。在我们的社区实践环境中，结节特征的记录并不完整。我们的研究结果呼吁更好地记录结节发现。NLP 算法可用于基于人群的研究，以识别肺结节，避免费力的图表审查。

相似文献

Natural Language Processing to Identify Pulmonary Nodules and Extract Nodule Characteristics From Radiology Reports.自然语言处理技术在放射学报告中识别肺结节并提取结节特征的应用。

Chest. 2021 Nov;160(5):1902-1914. doi: 10.1016/j.chest.2021.05.048. Epub 2021 Jun 4.

Integrity of clinical information in radiology reports documenting pulmonary nodules.放射学报告中肺结节临床信息的完整性。

J Am Med Inform Assoc. 2021 Jan 15;28(1):80-85. doi: 10.1093/jamia/ocaa209.

Automated identification of patients with pulmonary nodules in an integrated health system using administrative health plan data, radiology reports, and natural language processing.利用管理式医疗保健计划数据、放射学报告和自然语言处理技术，在综合卫生系统中自动识别肺部结节患者。

J Thorac Oncol. 2012 Aug;7(8):1257-62. doi: 10.1097/JTO.0b013e31825bd9f5.

Validation of a Deep Learning Algorithm for the Detection of Malignant Pulmonary Nodules in Chest Radiographs.深度学习算法在胸部 X 光片中检测恶性肺结节的验证。

JAMA Netw Open. 2020 Sep 1;3(9):e2017135. doi: 10.1001/jamanetworkopen.2020.17135.

Identifying pulmonary nodules or masses on chest radiography using deep learning: external validation and strategies to improve clinical practice.利用深度学习技术在胸部 X 光片上识别肺结节或肿块：外部验证及改善临床实践的策略。

Clin Radiol. 2020 Jan;75(1):38-45. doi: 10.1016/j.crad.2019.08.005. Epub 2019 Sep 11.

The Probability of Lung Cancer in Patients With Incidentally Detected Pulmonary Nodules: Clinical Characteristics and Accuracy of Prediction Models.偶然发现肺部结节的患者肺癌的概率：临床特征和预测模型的准确性。

Chest. 2022 Feb;161(2):562-571. doi: 10.1016/j.chest.2021.07.2168. Epub 2021 Aug 6.

What's in a Name? Factors Associated with Documentation and Evaluation of Incidental Pulmonary Nodules.名字里有什么？与偶然发现的肺部结节的记录和评估相关的因素。

Ann Am Thorac Soc. 2016 Oct;13(10):1704-1711. doi: 10.1513/AnnalsATS.201602-142OC.

Factors associated with radiologists' adherence to Fleischner Society guidelines for management of pulmonary nodules.与放射科医生遵循 Fleischner 学会肺部结节管理指南相关的因素。

J Am Coll Radiol. 2012 Jul;9(7):468-73. doi: 10.1016/j.jacr.2012.03.009.

Natural Language Processing for Identification of Incidental Pulmonary Nodules in Radiology Reports.自然语言处理在放射学报告中识别偶然肺部结节的应用。

J Am Coll Radiol. 2019 Nov;16(11):1587-1594. doi: 10.1016/j.jacr.2019.04.026. Epub 2019 May 24.

Small low-risk pulmonary nodules on chest digital radiography: can we predict whether the nodule is benign?胸部数字X线摄影上的小的低风险肺结节：我们能否预测该结节是否为良性？

Clin Radiol. 2018 Oct;73(10):902-906. doi: 10.1016/j.crad.2018.06.002. Epub 2018 Jul 3.

引用本文的文献

Automatic Abstraction of Computed Tomography Imaging Indication Using Natural Language Processing for Evaluation of Surveillance Patterns in Long-Term Lung Cancer Survivors.使用自然语言处理自动提取计算机断层扫描成像指征以评估长期肺癌幸存者的监测模式

JCO Clin Cancer Inform. 2025 Jul;9:e2400279. doi: 10.1200/CCI-24-00279. Epub 2025 Jul 23.

Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports of Lung Cancer Screening Patients Using Transformer Models.使用Transformer模型从肺癌筛查患者的放射学报告中提取肺结节及结节特征

J Healthc Inform Res. 2024 May 17;8(3):463-477. doi: 10.1007/s41666-024-00166-5. eCollection 2024 Sep.

Detecting Ground Glass Opacity Features in Patients With Lung Cancer: Automated Extraction and Longitudinal Analysis via Deep Learning-Based Natural Language Processing.检测肺癌患者的磨玻璃影特征：基于深度学习的自然语言处理实现自动提取与纵向分析

JMIR AI. 2023 Jun 1;2:e44537. doi: 10.2196/44537.

Natural Language Processing Algorithm Used for Staging Pulmonary Oncology from Free-Text Radiological Reports: "Including PET-CT and Validation Towards Clinical Use".自然语言处理算法在放射学报告中的肺肿瘤分期中的应用：“包括 PET-CT 以及向临床应用的验证”。

J Imaging Inform Med. 2024 Feb;37(1):3-12. doi: 10.1007/s10278-023-00913-x. Epub 2024 Jan 12.

Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool.自然语言处理在计算机断层扫描报告中识别偶然肺结节的应用：一种质量控制工具。

JCO Glob Oncol. 2023 Sep;9:e2300191. doi: 10.1200/GO.23.00191.

Prevalence and consequences of non-adherence to an evidence-based approach for incidental pulmonary nodules.偶然发现的肺结节采用基于证据的方法的不依从率及其后果。

PLoS One. 2022 Sep 9;17(9):e0274107. doi: 10.1371/journal.pone.0274107. eCollection 2022.

Using Natural Language Processing and Machine Learning to Preoperatively Predict Lymph Node Metastasis for Non-Small Cell Lung Cancer With Electronic Medical Records: Development and Validation Study.利用自然语言处理和机器学习，通过电子病历术前预测非小细胞肺癌的淋巴结转移：开发与验证研究

JMIR Med Inform. 2022 Apr 25;10(4):e35475. doi: 10.2196/35475.

Should psychological distress be listed as a surgical indication for indeterminate pulmonary nodules: protocol for a prospective cohort study in real-world settings.心理困扰是否应列为肺结节性质不确定时的手术指征：一项真实世界前瞻性队列研究方案

J Thorac Dis. 2022 Mar;14(3):769-778. doi: 10.21037/jtd-21-1423.

Using Text Content From Coronary Catheterization Reports to Predict 5-Year Mortality Among Patients Undergoing Coronary Angiography: A Deep Learning Approach.利用冠状动脉导管插入术报告中的文本内容预测接受冠状动脉造影患者的5年死亡率：一种深度学习方法。

Front Cardiovasc Med. 2022 Feb 28;9:800864. doi: 10.3389/fcvm.2022.800864. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

自然语言处理技术在放射学报告中识别肺结节并提取结节特征的应用。

Natural Language Processing to Identify Pulmonary Nodules and Extract Nodule Characteristics From Radiology Reports.

机构信息

出版信息

BACKGROUND

RESEARCH QUESTION

STUDY DESIGN AND METHODS

RESULTS

INTERPRETATION

背景

研究问题

研究设计与方法

结果

解释

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献