用于识别可疑肺癌横断面影像报告的自然语言处理与人工编码的比较

Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer.

作者信息

Wadia Roxanne, Akgun Kathleen, Brandt Cynthia, Fenton Brenda T, Levin Woody, Marple Andrew H, Garla Vijay, Rose Michal G, Taddei Tamar, Taylor Caroline

机构信息

Roxanne Wadia, Kathleen Akgun, Cynthia Brandt, Brenda T. Fenton, Andrew H. Marple, Vijay Garla, Michal G. Rose, and Tamar Taddei, Yale University School of Medicine, New Haven; and Roxanne Wadia, Kathleen Akgun, Cynthia Brandt, Brenda T. Fenton, Woody Levin, Michal G. Rose, Tamar Taddei, and Caroline Taylor, Veterans Affairs Connecticut Healthcare System, West Haven, CT.

出版信息

JCO Clin Cancer Inform. 2018 Dec;2:1-7. doi: 10.1200/CCI.17.00069.

DOI:10.1200/CCI.17.00069

PMID:30652545

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6873962/

Abstract

PURPOSE

To compare the accuracy and reliability of a natural language processing (NLP) algorithm with manual coding by radiologists, and the combination of the two methods, for the identification of patients whose computed tomography (CT) reports raised the concern for lung cancer.

METHODS

An NLP algorithm was developed using Clinical Text Analysis and Knowledge Extraction System (cTAKES) with the Yale cTAKES Extensions and trained to differentiate between language indicating benign lesions and lesions concerning for lung cancer. A random sample of 450 chest CT reports performed at Veterans Affairs Connecticut Healthcare System between January 2014 and July 2015 was selected. A reference standard was created by the manual review of reports to determine if the text stated that follow-up was needed for concern for cancer. The NLP algorithm was applied to all reports and compared with case identification using the manual coding by the radiologists.

RESULTS

A total of 450 reports representing 428 patients were analyzed. NLP had higher sensitivity and lower specificity than manual coding (77.3% v 51.5% and 72.5% v 82.5%, respectively). NLP and manual coding had similar positive predictive values (88.4% v 88.9%), and NLP had a higher negative predictive value than manual coding (54% v 38.5%). When NLP and manual coding were combined, sensitivity increased to 92.3%, with a decrease in specificity to 62.85%. Combined NLP and manual coding had a positive predictive value of 87.0% and a negative predictive value of 75.2%.

CONCLUSION

Our NLP algorithm was more sensitive than manual coding of CT chest reports for the identification of patients who required follow-up for suspicion of lung cancer. The combination of NLP and manual coding is a sensitive way to identify patients who need further workup for lung cancer.

摘要

目的

比较自然语言处理（NLP）算法与放射科医生手动编码以及两种方法相结合，在识别计算机断层扫描（CT）报告提示肺癌可能性的患者时的准确性和可靠性。

方法

使用临床文本分析与知识提取系统（cTAKES）及耶鲁cTAKES扩展开发了一种NLP算法，并进行训练以区分表示良性病变和提示肺癌病变的语言。选取了2014年1月至2015年7月在康涅狄格州退伍军人事务医疗系统进行的450份胸部CT报告的随机样本。通过对报告进行人工审核创建参考标准，以确定文本中是否表明因怀疑癌症需要进行随访。将NLP算法应用于所有报告，并与放射科医生的手动编码进行病例识别比较。

结果

共分析了代表428例患者的450份报告。NLP的敏感性高于手动编码，特异性低于手动编码（分别为77.3%对51.5%和72.5%对82.5%）。NLP和手动编码的阳性预测值相似（88.4%对88.9%），且NLP的阴性预测值高于手动编码（54%对38.5%）。当NLP和手动编码相结合时，敏感性提高到92.3%，特异性降至62.85%。NLP与手动编码相结合的阳性预测值为87.0%，阴性预测值为75.2%。

结论

我们的NLP算法在识别因怀疑肺癌需要随访的患者方面比CT胸部报告的手动编码更敏感。NLP与手动编码相结合是识别需要进一步检查肺癌患者的一种敏感方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12f6/6873962/7e3a454c0de7/CCI.17.00069f1.jpg

相似文献

Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer.用于识别可疑肺癌横断面影像报告的自然语言处理与人工编码的比较

JCO Clin Cancer Inform. 2018 Dec;2:1-7. doi: 10.1200/CCI.17.00069.

Evaluating the accuracy of lung-RADS score extraction from radiology reports: Manual entry versus natural language processing.评估从放射学报告中提取肺影像报告和数据系统（Lung-RADS）评分的准确性：手动录入与自然语言处理。

Int J Med Inform. 2024 Nov;191:105580. doi: 10.1016/j.ijmedinf.2024.105580. Epub 2024 Jul 31.

Natural Language Processing to Identify Pulmonary Nodules and Extract Nodule Characteristics From Radiology Reports.自然语言处理技术在放射学报告中识别肺结节并提取结节特征的应用。

Chest. 2021 Nov;160(5):1902-1914. doi: 10.1016/j.chest.2021.05.048. Epub 2021 Jun 4.

Natural Language Processing for the Identification of Incidental Lung Nodules in Computed Tomography Reports: A Quality Control Tool.自然语言处理在计算机断层扫描报告中识别偶然肺结节的应用：一种质量控制工具。

JCO Glob Oncol. 2023 Sep;9:e2300191. doi: 10.1200/GO.23.00191.

Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury.小儿创伤性脑损伤计算机断层扫描成像报告的自动结果分类

Acad Emerg Med. 2016 Feb;23(2):171-8. doi: 10.1111/acem.12859. Epub 2016 Jan 14.

Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing.基于自然语言处理的乳腺磁共振成像报告中成像观察和评估类别的自动提取。

Chin Med J (Engl). 2019 Jul 20;132(14):1673-1680. doi: 10.1097/CM9.0000000000000301.

Automated outcome classification of emergency department computed tomography imaging reports.急诊 CT 影像报告的自动化结果分类。

Acad Emerg Med. 2013 Aug;20(8):848-54. doi: 10.1111/acem.12174.

Natural language processing to identify ureteric stones in radiology reports.利用自然语言处理技术在放射学报告中识别输尿管结石。

J Med Imaging Radiat Oncol. 2019 Jun;63(3):307-310. doi: 10.1111/1754-9485.12861. Epub 2019 Feb 5.

The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports.运用自然语言处理技术从乳腺磁共振成像报告中提取病灶索引。

BMC Med Inform Decis Mak. 2019 Dec 30;19(1):288. doi: 10.1186/s12911-019-0997-3.

Using natural language processing to extract mammographic findings.使用自然语言处理技术提取乳腺钼靶检查结果。

J Biomed Inform. 2015 Apr;54:77-84. doi: 10.1016/j.jbi.2015.01.010. Epub 2015 Feb 3.

引用本文的文献

Automatic Abstraction of Computed Tomography Imaging Indication Using Natural Language Processing for Evaluation of Surveillance Patterns in Long-Term Lung Cancer Survivors.使用自然语言处理自动提取计算机断层扫描成像指征以评估长期肺癌幸存者的监测模式

JCO Clin Cancer Inform. 2025 Jul;9:e2400279. doi: 10.1200/CCI-24-00279. Epub 2025 Jul 23.

Developing and Validating an Automatic Support System for Tumor Coding in Pathology Reports in Spanish.开发并验证一个用于西班牙语病理报告中肿瘤编码的自动支持系统。

JCO Clin Cancer Inform. 2025 Feb;9:e2400124. doi: 10.1200/CCI.24.00124. Epub 2025 Feb 24.

Enhancing diagnosis of benign lesions and lung cancer through ensemble text and breath analysis: a retrospective cohort study.通过集成文本和呼吸分析提高良性病变和肺癌的诊断：一项回顾性队列研究。

Sci Rep. 2024 Apr 16;14(1):8731. doi: 10.1038/s41598-024-59474-w.

Extracting cancer concepts from clinical notes using natural language processing: a systematic review.使用自然语言处理从临床笔记中提取癌症概念：系统评价。

BMC Bioinformatics. 2023 Oct 29;24(1):405. doi: 10.1186/s12859-023-05480-0.

Natural Language Processing Applications for Computer-Aided Diagnosis in Oncology.用于肿瘤学计算机辅助诊断的自然语言处理应用

Diagnostics (Basel). 2023 Jan 12;13(2):286. doi: 10.3390/diagnostics13020286.

A framework for a consistent and reproducible evaluation of manual review for patient matching algorithms.用于对患者匹配算法的人工审核进行一致且可重现的评估的框架。

J Am Med Inform Assoc. 2022 Nov 14;29(12):2105-2109. doi: 10.1093/jamia/ocac175.

Conversion of Automated 12-Lead Electrocardiogram Interpretations to OMOP CDM Vocabulary.将自动化 12 导联心电图解释转换为 OMOP CDM 词汇表。

Appl Clin Inform. 2022 Aug;13(4):880-890. doi: 10.1055/s-0042-1756427. Epub 2022 Sep 21.

Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.通过癌症自然语言处理的范围综述评估癌症研究和患者护理的电子健康记录。

JCO Clin Cancer Inform. 2022 Jul;6:e2200006. doi: 10.1200/CCI.22.00006.

Automating Access to Real-World Evidence.实现真实世界证据获取的自动化。

JTO Clin Res Rep. 2022 May 17;3(6):100340. doi: 10.1016/j.jtocrr.2022.100340. eCollection 2022 Jun.

Performance of a rule-based semi-automated method to optimize chart abstraction for surveillance imaging among patients treated for non-small cell lung cancer.基于规则的半自动方法在优化非小细胞肺癌治疗患者监测成像图表提取方面的性能。

BMC Med Inform Decis Mak. 2022 Jun 3;22(1):148. doi: 10.1186/s12911-022-01863-0.

本文引用的文献

An Automated Method for Identifying Individuals with a Lung Nodule Can Be Feasibly Implemented Across Health Systems.一种用于识别肺结节患者的自动化方法可在各医疗系统中切实可行地实施。

EGEMS (Wash DC). 2016 Aug 26;4(1):1254. doi: 10.13063/2327-9214.1254. eCollection 2016.

Increasing Prevalence Expectation in Thoracic Radiology Leads to Overcall.胸部放射学中不断增加的患病率预期导致过度诊断。

Acad Radiol. 2016 Mar;23(3):284-9. doi: 10.1016/j.acra.2015.11.007. Epub 2016 Jan 7.

Cancer statistics, 2016.癌症统计数据，2016 年。

CA Cancer J Clin. 2016 Jan-Feb;66(1):7-30. doi: 10.3322/caac.21332. Epub 2016 Jan 7.

Lung Cancer Statistics.肺癌统计数据。

Adv Exp Med Biol. 2016;893:1-19. doi: 10.1007/978-3-319-24223-1_1.

Recent Trends in the Identification of Incidental Pulmonary Nodules.近年来偶然发现的肺结节的鉴定趋势。

Am J Respir Crit Care Med. 2015 Nov 15;192(10):1208-14. doi: 10.1164/rccm.201505-0990OC.

The effect of a lung cancer care coordination program on timeliness of care.肺癌护理协调计划对护理及时性的影响。

Clin Lung Cancer. 2013 Sep;14(5):527-34. doi: 10.1016/j.cllc.2013.04.004. Epub 2013 Jul 1.

Automated detection using natural language processing of radiologists recommendations for additional imaging of incidental findings.利用自然语言处理自动检测放射科医生对偶然发现进行额外成像检查的建议。

Ann Emerg Med. 2013 Aug;62(2):162-9. doi: 10.1016/j.annemergmed.2013.02.001. Epub 2013 Mar 30.

Automated identification of patients with pulmonary nodules in an integrated health system using administrative health plan data, radiology reports, and natural language processing.利用管理式医疗保健计划数据、放射学报告和自然语言处理技术，在综合卫生系统中自动识别肺部结节患者。

J Thorac Oncol. 2012 Aug;7(8):1257-62. doi: 10.1097/JTO.0b013e31825bd9f5.

Benefits and harms of CT screening for lung cancer: a systematic review.CT 筛查肺癌的获益与危害：系统评价。

JAMA. 2012 Jun 13;307(22):2418-29. doi: 10.1001/jama.2012.5521.

Using nurse navigation to improve timeliness of lung cancer care at a veterans hospital.在一家退伍军人医院利用护士导航来提高肺癌护理的及时性。

Clin J Oncol Nurs. 2012 Feb;16(1):29-36. doi: 10.1188/12.CJON.29-36.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于识别可疑肺癌横断面影像报告的自然语言处理与人工编码的比较

Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献