用于从放射学报告中自动提取关键发现的弱监督语言模型。

Weakly supervised language models for automated extraction of critical findings from radiology reports.

作者信息

Das Avisha, Talati Ish A, Chaves Juan Manuel Zambrano, Rubin Daniel, Banerjee Imon

机构信息

Arizona Advanced AI & Innovation (A3I) Hub, Mayo Clinic Arizona, Phoenix, AZ, USA.

Department of Radiology, Stanford University, Stanford, CA, USA.

出版信息

NPJ Digit Med. 2025 May 8;8(1):257. doi: 10.1038/s41746-025-01522-4.

DOI:10.1038/s41746-025-01522-4

PMID:40341617

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12062347/

Abstract

Critical findings in radiology reports are life threatening conditions that need to be communicated promptly to physicians for timely management of patients. Although challenging, advancements in natural language processing (NLP), particularly large language models (LLMs), now enable the automated identification of key findings from verbose reports. Given the scarcity of labeled critical findings data, we implemented a two-phase, weakly supervised fine-tuning approach on 15,000 unlabeled Mayo Clinic reports. This fine-tuned model then automatically extracted critical terms on internal (Mayo Clinic, n = 80) and external (MIMIC-III, n = 123) test datasets, validated against expert annotations. Model performance was further assessed on 5000 MIMIC-IV reports using LLM-aided metrics, G-eval and Prometheus. Both manual and LLM-based evaluations showed improved task alignment with weak supervision. The pipeline and model, publicly available under an academic license, can aid in critical finding extraction for research and clinical use ( https://github.com/dasavisha/CriticalFindings_Extract ).

摘要

放射学报告中的关键发现是危及生命的情况，需要及时告知医生以便对患者进行及时治疗。尽管具有挑战性，但自然语言处理（NLP）的进展，特别是大语言模型（LLMs），现在能够从冗长的报告中自动识别关键发现。鉴于标记的关键发现数据稀缺，我们对15000份未标记的梅奥诊所报告实施了两阶段的弱监督微调方法。然后，这个经过微调的模型在内部（梅奥诊所，n = 80）和外部（MIMIC-III，n = 123）测试数据集上自动提取关键术语，并根据专家注释进行验证。使用大语言模型辅助指标G-eval和Prometheus在5000份MIMIC-IV报告上进一步评估模型性能。基于人工和大语言模型的评估均显示，在弱监督下任务对齐得到了改善。该流程和模型在学术许可下公开可用，可有助于提取关键发现以供研究和临床使用（https://github.com/dasavisha/CriticalFindings_Extract）。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7a/12062347/4648014e9942/41746_2025_1522_Fig1_HTML.jpg

相似文献

Weakly supervised language models for automated extraction of critical findings from radiology reports.

NPJ Digit Med. 2025 May 8;8(1):257. doi: 10.1038/s41746-025-01522-4.

A dataset and benchmark for hospital course summarization with adapted large language models.

J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.

Toward Cross-Hospital Deployment of Natural Language Processing Systems: Model Development and Validation of Fine-Tuned Large Language Models for Disease Name Recognition in Japanese.

JMIR Med Inform. 2025 Jul 8;13:e76773. doi: 10.2196/76773.

Use of ChatGPT Large Language Models to Extract Details of Recommendations for Additional Imaging From Free-Text Impressions of Radiology Reports.

AJR Am J Roentgenol. 2025 Apr;224(4):e2432341. doi: 10.2214/AJR.24.32341. Epub 2025 Jan 29.

Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.

J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.

Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.

Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.

Data extraction from free-text stroke CT reports using GPT-4o and Llama-3.3-70B: the impact of annotation guidelines.

Eur Radiol Exp. 2025 Jun 19;9(1):61. doi: 10.1186/s41747-025-00600-2.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Open-Source Hybrid Large Language Model Integrated System for Extraction of Breast Cancer Treatment Pathway From Free-Text Clinical Notes.

JCO Clin Cancer Inform. 2025 Jun;9:e2500002. doi: 10.1200/CCI-25-00002. Epub 2025 Jun 27.

Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.

J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.

引用本文的文献

Evaluating prompt and data perturbation sensitivity in large language models for radiology reports classification.

JAMIA Open. 2025 Aug 12;8(4):ooaf073. doi: 10.1093/jamiaopen/ooaf073. eCollection 2025 Aug.

本文引用的文献

Evaluation of GPT-4 ability to identify and generate patient instructions for actionable incidental radiology findings.

J Am Med Inform Assoc. 2024 Sep 1;31(9):1983-1993. doi: 10.1093/jamia/ocae117.

Use of GPT-4 With Single-Shot Learning to Identify Incidental Findings in Radiology Reports.

AJR Am J Roentgenol. 2024 Mar;222(3):e2330651. doi: 10.2214/AJR.23.30651. Epub 2024 Jan 10.

Learning to Summarize Chinese Radiology Findings With a Pre-Trained Encoder.

IEEE Trans Biomed Eng. 2023 Dec;70(12):3277-3287. doi: 10.1109/TBME.2023.3280987. Epub 2023 Nov 21.

MIMIC-IV, a freely accessible electronic health record dataset.

Sci Data. 2023 Jan 3;10(1):1. doi: 10.1038/s41597-022-01899-x.

Natural Language Processing Model for Identifying Critical Findings-A Multi-Institutional Study.

J Digit Imaging. 2023 Feb;36(1):105-113. doi: 10.1007/s10278-022-00712-w. Epub 2022 Nov 7.

RadBERT: Adapting Transformer-based Language Models to Radiology.

Radiol Artif Intell. 2022 Jun 15;4(4):e210258. doi: 10.1148/ryai.210258. eCollection 2022 Jul.

How does artificial intelligence in radiology improve efficiency and health outcomes?

Pediatr Radiol. 2022 Oct;52(11):2087-2093. doi: 10.1007/s00247-021-05114-8. Epub 2021 Jun 12.

Framework for Extracting Critical Findings in Radiology Reports.

J Digit Imaging. 2020 Aug;33(4):988-995. doi: 10.1007/s10278-020-00349-7.

Feasibility of Natural Language Processing-Assisted Auditing of Critical Findings in Chest Radiology.

J Am Coll Radiol. 2019 Sep;16(9 Pt B):1299-1304. doi: 10.1016/j.jacr.2019.05.038. Epub 2019 Jun 21.

MIMIC-III, a freely accessible critical care database.

Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于从放射学报告中自动提取关键发现的弱监督语言模型。

Weakly supervised language models for automated extraction of critical findings from radiology reports.

作者信息

Das Avisha, Talati Ish A, Chaves Juan Manuel Zambrano, Rubin Daniel, Banerjee Imon

机构信息

Arizona Advanced AI & Innovation (A3I) Hub, Mayo Clinic Arizona, Phoenix, AZ, USA.

Department of Radiology, Stanford University, Stanford, CA, USA.

出版信息

NPJ Digit Med. 2025 May 8;8(1):257. doi: 10.1038/s41746-025-01522-4.

DOI:10.1038/s41746-025-01522-4

PMID:40341617

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12062347/

Abstract

摘要

用于从放射学报告中自动提取关键发现的弱监督语言模型。

Weakly supervised language models for automated extraction of critical findings from radiology reports.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于从放射学报告中自动提取关键发现的弱监督语言模型。

Weakly supervised language models for automated extraction of critical findings from radiology reports.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献