利用自然语言处理技术自动检测日本放射学报告中可疑癌症的发现：一项多中心研究。

Automated Detection of Cancer-Suspicious Findings in Japanese Radiology Reports with Natural Language Processing: A Multicenter Study.

作者信息

Sugimoto Kento, Wada Shoya, Konishi Shozo, Sato Junya, Okada Katsuki, Kido Shoji, Tomiyama Noriyuki, Matsumura Yasushi, Takeda Toshihiro

机构信息

Department of Medical Informatics, Osaka University Graduate School of Medicine, 2-2 Yamadaoka, Suita, 565-0871, Osaka, Japan.

Department of Transformative System for Medical Information, Osaka University Graduate School of Medicine, 2-2, Yamadaoka, Suita, 565-0871, Osaka, Japan.

出版信息

J Imaging Inform Med. 2025 Jan 22. doi: 10.1007/s10278-024-01338-w.

DOI:10.1007/s10278-024-01338-w

PMID:39843717

Abstract

Missed critical imaging findings, particularly those indicating cancer, are a common issue that can result in delays in patient follow-up and treatment. To address this, we developed a rule-based natural language processing (NLP) algorithm to detect cancer-suspicious findings from Japanese radiology reports. The dataset used consisted of chest and abdomen CT reports from six institutions. Reports from our institution were used for algorithm development and internal evaluation, while reports from the other five institutions were used for external evaluation. To create the gold standard, reports were annotated by two experienced physicians. Data were statistically analyzed using precision, recall and F1 score with 1000 bootstrap iterations. BERT was used as a baseline deep learning model, and its performance was compared with the proposed rule-based method. At the report level of detection, the overall precision, recall, and F-1 score were 0.886, 0.886, and 0.883, respectively, for the rule-based algorithm, which were higher than those of the deep learning algorithm (0.851, 0.679, and 0.733). The overall results include both internal and external validation data. For the internal validation set, the precision, recall, and F-1 score were 0.929, 0.929, and 0.927, respectively. For the external validation set, the precision, recall, and F-1 score were 0.875, 0.879, and 0.873, demonstrating generalizability. In conclusion, we show the rule-based NLP algorithm exhibited a high performance in detecting cancer-suspicious findings from multi-institutional CT reports.

摘要

遗漏关键影像检查结果，尤其是那些提示癌症的结果，是一个常见问题，可能导致患者随访和治疗延迟。为解决这一问题，我们开发了一种基于规则的自然语言处理（NLP）算法，用于从日本放射学报告中检测可疑癌症的结果。所使用的数据集包括来自六个机构的胸部和腹部CT报告。我们机构的报告用于算法开发和内部评估，而其他五个机构的报告用于外部评估。为创建金标准，报告由两名经验丰富的医生进行注释。使用精度、召回率和F1分数进行统计分析，并进行1000次自助抽样迭代。BERT被用作基线深度学习模型，并将其性能与所提出的基于规则的方法进行比较。在报告检测层面，基于规则的算法的总体精度、召回率和F-1分数分别为0.886、0.886和0.883，高于深度学习算法（0.851、0.679和0.733）。总体结果包括内部和外部验证数据。对于内部验证集，精度、召回率和F-1分数分别为0.929、0.929和0.927。对于外部验证集，精度、召回率和F-1分数分别为0.875、0.879和0.873，表明具有可推广性。总之，我们表明基于规则的NLP算法在从多机构CT报告中检测可疑癌症结果方面表现出高性能。

相似文献

Automated Detection of Cancer-Suspicious Findings in Japanese Radiology Reports with Natural Language Processing: A Multicenter Study.利用自然语言处理技术自动检测日本放射学报告中可疑癌症的发现：一项多中心研究。

J Imaging Inform Med. 2025 Jan 22. doi: 10.1007/s10278-024-01338-w.

Development and External Validation of an Artificial Intelligence Model for Identifying Radiology Reports Containing Recommendations for Additional Imaging.开发和外部验证用于识别包含额外成像建议的放射学报告的人工智能模型。

AJR Am J Roentgenol. 2023 Sep;221(3):377-385. doi: 10.2214/AJR.23.29120. Epub 2023 Apr 19.

Development and Validation of a Model to Identify Critical Brain Injuries Using Natural Language Processing of Text Computed Tomography Reports.利用文本计算机断层扫描报告的自然语言处理开发和验证一种识别关键脑损伤的模型。

JAMA Netw Open. 2022 Aug 1;5(8):e2227109. doi: 10.1001/jamanetworkopen.2022.27109.

Automated deidentification of radiology reports combining transformer and "hide in plain sight" rule-based methods.基于 Transformer 和“隐藏在明处”规则的放射学报告自动去识别化。

J Am Med Inform Assoc. 2023 Jan 18;30(2):318-328. doi: 10.1093/jamia/ocac219.

Automated anonymization of radiology reports: comparison of publicly available natural language processing and large language models.放射学报告的自动匿名化：公开可用的自然语言处理与大语言模型的比较

Eur Radiol. 2025 May;35(5):2634-2641. doi: 10.1007/s00330-024-11148-x. Epub 2024 Oct 31.

Automatic detection of actionable radiology reports using bidirectional encoder representations from transformers.使用来自 Transformer 的双向编码器表示自动检测可操作的放射学报告。

BMC Med Inform Decis Mak. 2021 Sep 11;21(1):262. doi: 10.1186/s12911-021-01623-6.

Near Real-time Natural Language Processing for the Extraction of Abdominal Aortic Aneurysm Diagnoses From Radiology Reports: Algorithm Development and Validation Study.用于从放射学报告中提取腹主动脉瘤诊断的近实时自然语言处理：算法开发与验证研究

JMIR Med Inform. 2023 Feb 24;11:e40964. doi: 10.2196/40964.

Natural Language Processing Model for Identifying Critical Findings-A Multi-Institutional Study.自然语言处理模型在识别关键发现中的应用：一项多机构研究。

J Digit Imaging. 2023 Feb;36(1):105-113. doi: 10.1007/s10278-022-00712-w. Epub 2022 Nov 7.

A Large Language Model to Detect Negated Expressions in Radiology Reports.一种用于检测放射学报告中否定表达的大语言模型。

J Imaging Inform Med. 2025 Jun;38(3):1297-1303. doi: 10.1007/s10278-024-01274-9. Epub 2024 Sep 25.

Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework.基于 BERT（来自 Transformers 的双向编码器表示）的深度学习方法在提取中文放射学报告证据中的应用：计算机辅助肝癌诊断框架的开发。

J Med Internet Res. 2021 Jan 12;23(1):e19689. doi: 10.2196/19689.

本文引用的文献

Classification of Diagnostic Certainty in Radiology Reports with Deep Learning.深度学习在放射学报告中的诊断确定性分类。

Stud Health Technol Inform. 2024 Jan 25;310:569-573. doi: 10.3233/SHTI231029.

Extracting Clinical Information From Japanese Radiology Reports Using a 2-Stage Deep Learning Approach: Algorithm Development and Validation.使用两阶段深度学习方法从日本放射学报告中提取临床信息：算法开发与验证

JMIR Med Inform. 2023 Nov 14;11:e49041. doi: 10.2196/49041.

Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?Transformer 与传统自然语言处理：自动化放射科报告分类需要多少数据？

Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.

Natural Language Processing Model for Identifying Critical Findings-A Multi-Institutional Study.自然语言处理模型在识别关键发现中的应用：一项多机构研究。

J Digit Imaging. 2023 Feb;36(1):105-113. doi: 10.1007/s10278-022-00712-w. Epub 2022 Nov 7.

A clinical specific BERT developed using a huge Japanese clinical text corpus.一个使用大型日本临床文本语料库开发的临床专用 BERT。

PLoS One. 2021 Nov 9;16(11):e0259763. doi: 10.1371/journal.pone.0259763. eCollection 2021.

BMC Med Inform Decis Mak. 2021 Sep 11;21(1):262. doi: 10.1186/s12911-021-01623-6.

Extracting clinical terms from radiology reports with deep learning.深度学习从放射学报告中提取临床术语。

J Biomed Inform. 2021 Apr;116:103729. doi: 10.1016/j.jbi.2021.103729. Epub 2021 Mar 9.

Automated Detection of Radiology Reports that Require Follow-up Imaging Using Natural Language Processing Feature Engineering and Machine Learning Classification.使用自然语言处理特征工程和机器学习分类自动检测需要随访成像的放射学报告。

J Digit Imaging. 2020 Feb;33(1):131-136. doi: 10.1007/s10278-019-00271-7.

Natural Language Processing for Identification of Incidental Pulmonary Nodules in Radiology Reports.自然语言处理在放射学报告中识别偶然肺部结节的应用。

J Am Coll Radiol. 2019 Nov;16(11):1587-1594. doi: 10.1016/j.jacr.2019.04.026. Epub 2019 May 24.

Extracting Follow-Up Recommendations and Associated Anatomy from Radiology Reports.从放射学报告中提取随访建议及相关解剖结构。

Stud Health Technol Inform. 2017;245:1090-1094.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用自然语言处理技术自动检测日本放射学报告中可疑癌症的发现：一项多中心研究。

Automated Detection of Cancer-Suspicious Findings in Japanese Radiology Reports with Natural Language Processing: A Multicenter Study.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献