基于深度学习的结构化放射学报告自然语言处理对肿瘤学结果的评估

Deep Learning-based Assessment of Oncologic Outcomes from Natural Language Processing of Structured Radiology Reports.

作者信息

Fink Matthias A, Kades Klaus, Bischoff Arved, Moll Martin, Schnell Merle, Küchler Maike, Köhler Gregor, Sellner Jan, Heussel Claus Peter, Kauczor Hans-Ulrich, Schlemmer Heinz-Peter, Maier-Hein Klaus, Weber Tim F, Kleesiek Jens

机构信息

Clinic for Diagnostic and Interventional Radiology (M.A.F., A.B., M.M., M.S., M.K., C.P.H., H.U.K., T.F.W.) and Pattern Analysis and Learning Group, Department of Radiation Oncology (K.M.H.), Heidelberg University Hospital, Im Neuenheimer Feld 420, 69120 Heidelberg, Germany; Translational Lung Research Center Heidelberg (TLRC), Member of the German Center for Lung Research (DZL), Heidelberg, Germany (M.A.F., A.B., M.M., M.S., M.K., C.P.H., H.U.K., T.F.W.); Faculty of Mathematics and Computer Science (K.K.) and Department of Diagnostic and Interventional Radiology with Nuclear Medicine, Heidelberg Thoracic Clinic (C.P.H.), Heidelberg University, Heidelberg, Germany; Division of Medical Image Computing (K.K., G.K., K.M.H.), Department of Computer Assisted Medical Interventions (CAMI) (J.S.), and Department of Radiology (H.P.S.), German Cancer Research Center (DKFZ), Heidelberg, Germany; German Cancer Consortium (DKTK), Partner Sites Essen and Heidelberg, Heidelberg, Germany (H.P.S., K.M.H., J.K.); and Institute for Artificial Intelligence in Medicine (IKIM), University Medicine Essen, Essen, Germany (J.K.).

出版信息

Radiol Artif Intell. 2022 Jul 20;4(5):e220055. doi: 10.1148/ryai.220055. eCollection 2022 Sep.

DOI:10.1148/ryai.220055

PMID:36204531

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9530771/

Abstract

PURPOSE

To train a deep natural language processing (NLP) model, using data mined structured oncology reports (SOR), for rapid tumor response category (TRC) classification from free-text oncology reports (FTOR) and to compare its performance with human readers and conventional NLP algorithms.

MATERIALS AND METHODS

In this retrospective study, databases of three independent radiology departments were queried for SOR and FTOR dated from March 2018 to August 2021. An automated data mining and curation pipeline was developed to extract Response Evaluation Criteria in Solid Tumors-related TRCs for SOR for ground truth definition. The deep NLP bidirectional encoder representations from transformers (BERT) model and three feature-rich algorithms were trained on SOR to predict TRCs in FTOR. Models' F1 scores were compared against scores of radiologists, medical students, and radiology technologist students. Lexical and semantic analyses were conducted to investigate human and model performance on FTOR.

RESULTS

Oncologic findings and TRCs were accurately mined from 9653 of 12 833 (75.2%) queried SOR, yielding oncology reports from 10 455 patients (mean age, 60 years ± 14 [SD]; 5303 women) who met inclusion criteria. On 802 FTOR in the test set, BERT achieved better TRC classification results (F1, 0.70; 95% CI: 0.68, 0.73) than the best-performing reference linear support vector classifier (F1, 0.63; 95% CI: 0.61, 0.66) and technologist students (F1, 0.65; 95% CI: 0.63, 0.67), had similar performance to medical students (F1, 0.73; 95% CI: 0.72, 0.75), but was inferior to radiologists (F1, 0.79; 95% CI: 0.78, 0.81). Lexical complexity and semantic ambiguities in FTOR influenced human and model performance, revealing maximum F1 score drops of -0.17 and -0.19, respectively.

CONCLUSION

The developed deep NLP model reached the performance level of medical students but not radiologists in curating oncologic outcomes from radiology FTOR. Neural Networks, Computer Applications-Detection/Diagnosis, Oncology, Research Design, Staging, Tumor Response, Comparative Studies, Decision Analysis, Experimental Investigations, Observer Performance, Outcomes Analysis © RSNA, 2022.

摘要

目的

使用从结构化肿瘤学报告（SOR）中挖掘的数据训练一个深度自然语言处理（NLP）模型，用于从自由文本肿瘤学报告（FTOR）中快速进行肿瘤反应类别（TRC）分类，并将其性能与人类读者和传统NLP算法进行比较。

材料与方法

在这项回顾性研究中，查询了三个独立放射科的数据库，获取2018年3月至2021年8月期间的SOR和FTOR。开发了一个自动化数据挖掘和整理管道，以提取SOR中与实体瘤相关的TRC的反应评估标准，用于定义地面真值。在SOR上训练深度NLP双向编码器表征来自变压器（BERT）模型和三种特征丰富的算法，以预测FTOR中的TRC。将模型的F1分数与放射科医生、医学生和放射技术专业学生的分数进行比较。进行了词汇和语义分析，以研究人类和模型在FTOR上的表现。

结果

从12833份查询的SOR中的9653份（75.2%）中准确挖掘出肿瘤学发现和TRC，得到了10455名符合纳入标准患者（平均年龄60岁±14[标准差]；5303名女性）的肿瘤学报告。在测试集中的802份FTOR上，BERT实现了比表现最佳的参考线性支持向量分类器（F1，0.63；95%CI：0.61，0.66）和技术专业学生（F1，0.65；95%CI：0.63，0.67）更好的TRC分类结果（F1，0.70；95%CI：0.68，0.73），与医学生（F1，0.73；95%CI：0.72，0.75）表现相似，但不如放射科医生（F1，0.79；95%CI：0.78，0.81）。FTOR中的词汇复杂性和语义模糊性影响了人类和模型的表现，分别显示F1分数最大下降-0.17和-0.19。

结论

所开发的深度NLP模型在从放射学FTOR中整理肿瘤学结果方面达到了医学生的性能水平，但未达到放射科医生的水平。神经网络、计算机应用-检测/诊断、肿瘤学、研究设计、分期、肿瘤反应、比较研究、决策分析、实验研究、观察者表现、结果分析 ©RSNA，2022。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e688/9530771/2960abd1666d/ryai.220055.VA.jpg

相似文献

Deep Learning-based Assessment of Oncologic Outcomes from Natural Language Processing of Structured Radiology Reports.基于深度学习的结构化放射学报告自然语言处理对肿瘤学结果的评估

Radiol Artif Intell. 2022 Jul 20;4(5):e220055. doi: 10.1148/ryai.220055. eCollection 2022 Sep.

Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).使用基于转换器的双向编码器表示 (BERT) 和领域内预训练 (IDPT) 对耳鸣患者的可操作放射学报告进行自动文本分类。

BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.

Automatic Diagnosis Labeling of Cardiovascular MRI by Using Semisupervised Natural Language Processing of Text Reports.利用文本报告的半监督自然语言处理对心血管磁共振成像进行自动诊断标注

Radiol Artif Intell. 2021 Nov 24;4(1):e210085. doi: 10.1148/ryai.210085. eCollection 2022 Jan.

Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework.基于 BERT（来自 Transformers 的双向编码器表示）的深度学习方法在提取中文放射学报告证据中的应用：计算机辅助肝癌诊断框架的开发。

J Med Internet Res. 2021 Jan 12;23(1):e19689. doi: 10.2196/19689.

A Question-and-Answer System to Extract Data From Free-Text Oncological Pathology Reports (CancerBERT Network): Development Study.从自由文本肿瘤病理学报告（CancerBERT 网络）中提取数据的问答系统：开发研究。

J Med Internet Res. 2022 Mar 23;24(3):e27210. doi: 10.2196/27210.

Automatic detection of actionable radiology reports using bidirectional encoder representations from transformers.使用来自 Transformer 的双向编码器表示自动检测可操作的放射学报告。

BMC Med Inform Decis Mak. 2021 Sep 11;21(1):262. doi: 10.1186/s12911-021-01623-6.

RadBERT: Adapting Transformer-based Language Models to Radiology.RadBERT：使基于Transformer的语言模型适用于放射学领域。

Radiol Artif Intell. 2022 Jun 15;4(4):e210258. doi: 10.1148/ryai.210258. eCollection 2022 Jul.

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study.用于放射学报告中自动重要发现标记和提取的否定与推测检测的深度学习方法：内部验证与技术比较研究

JMIR Med Inform. 2023 Apr 25;11:e46348. doi: 10.2196/46348.

Natural language processing deep learning models for the differential between high-grade gliomas and metastasis: what if the key is how we report them?自然语言处理深度学习模型在高级别胶质瘤和转移瘤鉴别中的应用：如果关键在于我们如何报告这些结果呢？

Eur Radiol. 2024 Mar;34(3):2113-2120. doi: 10.1007/s00330-023-10202-4. Epub 2023 Sep 4.

Information extraction from weakly structured radiological reports with natural language queries.利用自然语言查询从弱结构放射学报告中提取信息。

Eur Radiol. 2024 Jan;34(1):330-337. doi: 10.1007/s00330-023-09977-3. Epub 2023 Jul 28.

引用本文的文献

Clinical applications of large language models in medicine and surgery: A scoping review.大型语言模型在医学与外科中的临床应用：一项范围综述

J Int Med Res. 2025 Jul;53(7):3000605251347556. doi: 10.1177/03000605251347556. Epub 2025 Jul 4.

A Narrative Review on the Application of Large Language Models to Support Cancer Care and Research.关于应用大语言模型支持癌症护理与研究的叙述性综述。

Yearb Med Inform. 2024 Aug;33(1):90-98. doi: 10.1055/s-0044-1800726. Epub 2025 Apr 8.

Foundation Models in Radiology: What, How, Why, and Why Not.放射学中的基础模型：是什么、如何、为何以及为何不。

Radiology. 2025 Feb;314(2):e240597. doi: 10.1148/radiol.240597.

Automated MRI pituitary structured reporting from free-text using a fine-tuned Llama model: a feasibility study.使用微调的Llama模型从自由文本自动生成MRI垂体结构化报告：一项可行性研究。

Jpn J Radiol. 2025 May;43(5):770-778. doi: 10.1007/s11604-024-01721-1. Epub 2024 Dec 28.

Based on Medicine, The Now and Future of Large Language Models.基于医学，大语言模型的现状与未来。

Cell Mol Bioeng. 2024 Sep 16;17(4):263-277. doi: 10.1007/s12195-024-00820-3. eCollection 2024 Aug.

A Large Language Model to Detect Negated Expressions in Radiology Reports.一种用于检测放射学报告中否定表达的大语言模型。

J Imaging Inform Med. 2025 Jun;38(3):1297-1303. doi: 10.1007/s10278-024-01274-9. Epub 2024 Sep 25.

A scoping review of large language model based approaches for information extraction from radiology reports.基于大语言模型从放射学报告中提取信息的方法的范围综述。

NPJ Digit Med. 2024 Aug 24;7(1):222. doi: 10.1038/s41746-024-01219-0.

Extraction of Radiological Characteristics From Free-Text Imaging Reports Using Natural Language Processing Among Patients With Ischemic and Hemorrhagic Stroke: Algorithm Development and Validation.使用自然语言处理从缺血性和出血性中风患者的自由文本影像报告中提取放射学特征：算法开发与验证

JMIR AI. 2023 Jun 6;2:e42884. doi: 10.2196/42884.

Artificial Intelligence-Assisted Cancer Status Detection in Radiology Reports.人工智能辅助放射学报告中的癌症状态检测。

Cancer Res Commun. 2024 Apr 9;4(4):1041-1049. doi: 10.1158/2767-9764.CRC-24-0064.

Year 2022 in Medical Natural Language Processing: Availability of Language Models as a Step in the Democratization of NLP in the Biomedical Area.2022 年医学自然语言处理：语言模型的可用性是生物医学领域 NLP 民主化的一步。

Yearb Med Inform. 2023 Aug;32(1):244-252. doi: 10.1055/s-0043-1768752. Epub 2023 Dec 26.

本文引用的文献

Applications of natural language processing in radiology: A systematic review.自然语言处理在放射学中的应用：一项系统综述。

Int J Med Inform. 2022 Jul;163:104779. doi: 10.1016/j.ijmedinf.2022.104779. Epub 2022 Apr 26.

CT Angiography Clot Burden Score from Data Mining of Structured Reports for Pulmonary Embolism.CT 血管造影血栓负担评分来自肺栓塞结构化报告的数据挖掘。

Radiology. 2022 Jan;302(1):175-184. doi: 10.1148/radiol.2021211013. Epub 2021 Sep 28.

Automated Organ-Level Classification of Free-Text Pathology Reports to Support a Radiology Follow-up Tracking Engine.用于支持放射学随访跟踪引擎的自由文本病理报告的自动器官水平分类

Radiol Artif Intell. 2019 Aug 7;1(5):e180052. doi: 10.1148/ryai.2019180052. eCollection 2019 Sep.

Joint Imaging Platform for Federated Clinical Data Analytics.联合成像平台，用于联邦临床数据分析。

JCO Clin Cancer Inform. 2020 Nov;4:1027-1038. doi: 10.1200/CCI.20.00045.

Improving radiologic communication in oncology: a single-centre experience with structured reporting for cancer patients.改善肿瘤学中的放射学沟通：癌症患者结构化报告的单中心经验。

Insights Imaging. 2020 Sep 29;11(1):106. doi: 10.1186/s13244-020-00907-1.

Use of Natural Language Processing to Assess Frequency of Functional Status Documentation for Patients Newly Diagnosed With Colorectal Cancer.使用自然语言处理评估新诊断为结直肠癌患者的功能状态文档记录频率。

JAMA Oncol. 2020 Oct 1;6(10):1628-1630. doi: 10.1001/jamaoncol.2020.2708.

Preparing Medical Imaging Data for Machine Learning.医学影像数据的机器学习准备

Radiology. 2020 Apr;295(1):4-15. doi: 10.1148/radiol.2020192224. Epub 2020 Feb 18.

Redefining the structure of structured reporting in radiology.重新定义放射学结构化报告的结构。

Insights Imaging. 2020 Feb 4;11(1):10. doi: 10.1186/s13244-019-0831-6.

Natural Language Processing Approaches to Detect the Timeline of Metastatic Recurrence of Breast Cancer.用于检测乳腺癌转移复发时间线的自然语言处理方法

JCO Clin Cancer Inform. 2019 Oct;3:1-12. doi: 10.1200/CCI.19.00034.

Use of Natural Language Processing to Extract Clinical Cancer Phenotypes from Electronic Medical Records.利用自然语言处理从电子病历中提取临床癌症表型

Cancer Res. 2019 Nov 1;79(21):5463-5470. doi: 10.1158/0008-5472.CAN-19-0579. Epub 2019 Aug 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于深度学习的结构化放射学报告自然语言处理对肿瘤学结果的评估

Deep Learning-based Assessment of Oncologic Outcomes from Natural Language Processing of Structured Radiology Reports.

作者信息

机构信息

出版信息

PURPOSE

MATERIALS AND METHODS

RESULTS

CONCLUSION

目的

材料与方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献