从放射学报告中进行弱监督空间关系提取。

Weakly supervised spatial relation extraction from radiology reports.

作者信息

Datta Surabhi, Roberts Kirk

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, USA.

出版信息

JAMIA Open. 2023 Apr 22;6(2):ooad027. doi: 10.1093/jamiaopen/ooad027. eCollection 2023 Jul.

DOI:10.1093/jamiaopen/ooad027

PMID:37096148

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10122604/

Abstract

OBJECTIVE

Weak supervision holds significant promise to improve clinical natural language processing by leveraging domain resources and expertise instead of large manually annotated datasets alone. Here, our objective is to evaluate a weak supervision approach to extract spatial information from radiology reports.

MATERIALS AND METHODS

Our weak supervision approach is based on data programming that uses rules (or labeling functions) relying on domain-specific dictionaries and radiology language characteristics to generate weak labels. The labels correspond to different spatial relations that are critical to understanding radiology reports. These weak labels are then used to fine-tune a pretrained Bidirectional Encoder Representations from Transformers (BERT) model.

RESULTS

Our weakly supervised BERT model provided satisfactory results in extracting spatial relations without manual annotations for training (spatial trigger F1: 72.89, relation F1: 52.47). When this model is further fine-tuned on manual annotations (relation F1: 68.76), performance surpasses the fully supervised state-of-the-art.

DISCUSSION

To our knowledge, this is the first work to automatically create detailed weak labels corresponding to radiological information of clinical significance. Our data programming approach is (1) adaptable as the labeling functions can be updated with relatively little manual effort to incorporate more variations in radiology language reporting formats and (2) generalizable as these functions can be applied across multiple radiology subdomains in most cases.

CONCLUSIONS

We demonstrate a weakly supervision model performs sufficiently well in identifying a variety of relations from radiology text without manual annotations, while exceeding state-of-the-art results when annotated data are available.

摘要

目的

弱监督通过利用领域资源和专业知识而非仅依靠大型人工标注数据集，在改善临床自然语言处理方面具有巨大潜力。在此，我们的目标是评估一种从放射学报告中提取空间信息的弱监督方法。

材料与方法

我们的弱监督方法基于数据编程，该编程使用依赖于特定领域词典和放射学语言特征的规则（或标注函数）来生成弱标签。这些标签对应于理解放射学报告至关重要的不同空间关系。然后，这些弱标签用于微调预训练的来自变换器的双向编码器表征（BERT）模型。

结果

我们的弱监督BERT模型在无需人工标注进行训练的情况下，在提取空间关系方面取得了令人满意的结果（空间触发F1值：72.89，关系F1值：52.47）。当该模型在人工标注上进一步微调时（关系F1值：68.76），性能超过了完全监督的当前最优方法。

讨论

据我们所知，这是第一项自动创建与具有临床意义的放射学信息相对应的详细弱标签的工作。我们的数据编程方法具有以下特点：（1）具有适应性，因为标注函数可以通过相对较少的人工努力进行更新，以纳入放射学语言报告格式中的更多变化；（2）具有通用性，因为在大多数情况下，这些函数可以应用于多个放射学子领域。

结论

我们证明了一个弱监督模型在无需人工标注的情况下，从放射学文本中识别各种关系方面表现良好，而在有标注数据时，其性能超过了当前最优结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f0/10122604/f8cdcefbb1f6/ooad027f1.jpg

相似文献

Weakly supervised spatial relation extraction from radiology reports.从放射学报告中进行弱监督空间关系提取。

JAMIA Open. 2023 Apr 22;6(2):ooad027. doi: 10.1093/jamiaopen/ooad027. eCollection 2023 Jul.

Language model-based labeling of German thoracic radiology reports.基于语言模型的德国胸部放射学报告标注

Rofo. 2025 Jan;197(1):55-64. doi: 10.1055/a-2287-5054. Epub 2024 Apr 25.

Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework.基于 BERT（来自 Transformers 的双向编码器表示）的深度学习方法在提取中文放射学报告证据中的应用：计算机辅助肝癌诊断框架的开发。

J Med Internet Res. 2021 Jan 12;23(1):e19689. doi: 10.2196/19689.

Information extraction from weakly structured radiological reports with natural language queries.利用自然语言查询从弱结构放射学报告中提取信息。

Eur Radiol. 2024 Jan;34(1):330-337. doi: 10.1007/s00330-023-09977-3. Epub 2023 Jul 28.

Extracting comprehensive clinical information for breast cancer using deep learning methods.利用深度学习方法提取乳腺癌全面临床信息。

Int J Med Inform. 2019 Dec;132:103985. doi: 10.1016/j.ijmedinf.2019.103985. Epub 2019 Oct 2.

Ontology-driven and weakly supervised rare disease identification from clinical notes.基于本体的临床笔记辅助下的弱监督罕见病识别。

BMC Med Inform Decis Mak. 2023 May 5;23(1):86. doi: 10.1186/s12911-023-02181-9.

Fine-grained spatial information extraction in radiology as two-turn question answering.放射学中细粒度空间信息提取作为两阶段问答

Int J Med Inform. 2021 Nov 6;158:104628. doi: 10.1016/j.ijmedinf.2021.104628.

Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT).使用基于转换器的双向编码器表示 (BERT) 和领域内预训练 (IDPT) 对耳鸣患者的可操作放射学报告进行自动文本分类。

BMC Med Inform Decis Mak. 2022 Jul 30;22(1):200. doi: 10.1186/s12911-022-01946-y.

Understanding spatial language in radiology: Representation framework, annotation, and spatial relation extraction from chest X-ray reports using deep learning.理解放射学中的空间语言：使用深度学习从胸部X光报告中进行表示框架、标注和空间关系提取。

J Biomed Inform. 2020 Aug;108:103473. doi: 10.1016/j.jbi.2020.103473. Epub 2020 Jun 18.

Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports of Lung Cancer Screening Patients Using Transformer Models.使用Transformer模型从肺癌筛查患者的放射学报告中提取肺结节及结节特征

J Healthc Inform Res. 2024 May 17;8(3):463-477. doi: 10.1007/s41666-024-00166-5. eCollection 2024 Sep.

引用本文的文献

Information Extraction from Lumbar Spine MRI Radiology Reports Using GPT4: Accuracy and Benchmarking Against Research-Grade Comprehensive Scoring.使用GPT4从腰椎MRI放射学报告中提取信息：准确性及与研究级综合评分的基准对比

Diagnostics (Basel). 2025 Apr 4;15(7):930. doi: 10.3390/diagnostics15070930.

Year 2023 in Biomedical Natural Language Processing: a Tribute to Large Language Models and Generative AI.2023年生物医学自然语言处理领域：向大语言模型和生成式人工智能致敬。

Yearb Med Inform. 2024 Aug;33(1):241-248. doi: 10.1055/s-0044-1800751. Epub 2025 Apr 8.

Leveraging large language models for knowledge-free weak supervision in clinical natural language processing.在临床自然语言处理中利用大语言模型进行无知识弱监督。

Sci Rep. 2025 Mar 10;15(1):8241. doi: 10.1038/s41598-024-68168-2.

A scoping review of large language model based approaches for information extraction from radiology reports.基于大语言模型从放射学报告中提取信息的方法的范围综述。

NPJ Digit Med. 2024 Aug 24;7(1):222. doi: 10.1038/s41746-024-01219-0.

Leveraging Large Language Models for Knowledge-free Weak Supervision in Clinical Natural Language Processing.利用大语言模型进行临床自然语言处理中的无知识弱监督

Res Sq. 2024 Jun 28:rs.3.rs-4559971. doi: 10.21203/rs.3.rs-4559971/v1.

Scalable Approach to Consumer Wearable Postmarket Surveillance: Development and Validation Study.消费者可穿戴设备上市后监测的可扩展方法：开发与验证研究

JMIR Med Inform. 2024 Apr 4;12:e51171. doi: 10.2196/51171.

本文引用的文献

Classifying the lifestyle status for Alzheimer's disease from clinical notes using deep learning with weak supervision.使用基于弱监督的深度学习对临床笔记进行阿尔茨海默病生活方式状况分类。

BMC Med Inform Decis Mak. 2022 Jul 7;22(Suppl 1):88. doi: 10.1186/s12911-022-01819-4.

Strategies to Address the Lack of Labeled Data for Supervised Machine Learning Training With Electronic Health Records: Case Study for the Extraction of Symptoms From Clinical Notes.应对电子健康记录监督式机器学习训练中标记数据不足的策略：从临床笔记中提取症状的案例研究

JMIR Med Inform. 2022 Mar 14;10(3):e32903. doi: 10.2196/32903.

Rare Disease Identification from Clinical Notes with Ontologies and Weak Supervision.利用本体和弱监督从临床记录中识别罕见病。

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2294-2298. doi: 10.1109/EMBC46164.2021.9630043.

Extracting and Learning Fine-Grained Labels from Chest Radiographs.从胸部X光片中提取和学习细粒度标签。

AMIA Annu Symp Proc. 2021 Jan 25;2020:1190-1199. eCollection 2020.

Ontology-driven weak supervision for clinical entity classification in electronic health records.基于本体的电子健康记录中临床实体分类的弱监督方法。

Nat Commun. 2021 Apr 1;12(1):2017. doi: 10.1038/s41467-021-22328-4.

Multi-task weak supervision enables anatomically-resolved abnormality detection in whole-body FDG-PET/CT.多任务弱监督实现了全身 FDG-PET/CT 解剖解析的异常检测。

Nat Commun. 2021 Mar 25;12(1):1880. doi: 10.1038/s41467-021-22018-1.

Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation.利用弱监督和深度学习对临床记录进行分类，以识别当前的自杀意念。

J Psychiatr Res. 2021 Apr;136:95-102. doi: 10.1016/j.jpsychires.2021.01.052. Epub 2021 Feb 2.

A Hybrid Deep Learning Approach for Spatial Trigger Extraction from Radiology Reports.一种用于从放射学报告中提取空间触发词的混合深度学习方法。

Proc Conf Empir Methods Nat Lang Process. 2020 Nov;2020:50-55. doi: 10.18653/v1/2020.splu-1.6.

Rad-SpatialNet: A Frame-based Resource for Fine-Grained Spatial Relations in Radiology Reports.Rad-SpatialNet：用于放射学报告中细粒度空间关系的基于框架的资源。

LREC Int Conf Lang Resour Eval. 2020 May;2020:2251-2260.

A corpus-driven standardization framework for encoding clinical problems with HL7 FHIR.一种用于使用HL7 FHIR对临床问题进行编码的语料库驱动标准化框架。

J Biomed Inform. 2020 Oct;110:103541. doi: 10.1016/j.jbi.2020.103541. Epub 2020 Aug 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从放射学报告中进行弱监督空间关系提取。

Weakly supervised spatial relation extraction from radiology reports.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSIONS

目的

材料与方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献