• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

放射学中细粒度空间信息提取作为两阶段问答

Fine-grained spatial information extraction in radiology as two-turn question answering.

作者信息

Datta Surabhi, Roberts Kirk

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, United States.

出版信息

Int J Med Inform. 2021 Nov 6;158:104628. doi: 10.1016/j.ijmedinf.2021.104628.

DOI:10.1016/j.ijmedinf.2021.104628
PMID:34839119
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9072592/
Abstract

OBJECTIVES

Radiology reports contain important clinical information that can be used to automatically construct fine-grained labels for applications requiring deep phenotyping. We propose a two-turn question answering (QA) method based on a transformer language model, BERT, for extracting detailed spatial information from radiology reports. We aim to demonstrate the advantage that a multi-turn QA framework provides over sequence-based methods for extracting fine-grained information.

METHODS

Our proposed method identifies spatial and descriptor information by answering queries given a radiology report text. We frame the extraction problem such that all the main radiology entities (e.g., finding, device, anatomy) and the spatial trigger terms (denoting the presence of a spatial relation between finding/device and anatomical location) are identified in the first turn. In the subsequent turn, various other contextual information that acts as important spatial roles with respect to a spatial trigger term are extracted along with identifying the spatial and other descriptor terms qualifying a radiological entity. The queries are constructed using separate templates for the two turns and we employ two query variations in the second turn.

RESULTS

When compared to the best-reported work on this task using a traditional sequence tagging method, the two-turn QA model exceeds its performance on every component. This includes promising improvements of 12, 13, and 12 points in the average F1 scores for identifying the spatial triggers, Figure, and Ground frame elements, respectively.

DISCUSSION

Our experiments suggest that incorporating domain knowledge in the query (a general description about a frame element) helps in obtaining better results for some of the spatial and descriptive frame elements, especially in the case of the clinical pre-trained BERT model. We further highlight that the two-turn QA approach fits well for extracting information for complex schema where the objective is to identify all the frame elements linked to each spatial trigger and finding/device/anatomy entity, thereby enabling the extraction of more comprehensive information in the radiology domain.

CONCLUSION

Extracting fine-grained spatial information from text in the form of answering natural language queries holds potential in achieving better results when compared to more standard sequence labeling-based approaches.

摘要

目的

放射学报告包含重要的临床信息,可用于为需要深度表型分析的应用自动构建细粒度标签。我们提出了一种基于变压器语言模型BERT的两阶段问答(QA)方法,用于从放射学报告中提取详细的空间信息。我们旨在证明多阶段QA框架在提取细粒度信息方面比基于序列的方法具有优势。

方法

我们提出的方法通过回答给定放射学报告文本的查询来识别空间和描述符信息。我们构建提取问题,以便在第一阶段识别所有主要的放射学实体(例如,发现、设备、解剖结构)和空间触发词(表示发现/设备与解剖位置之间存在空间关系)。在随后的阶段,除了识别限定放射学实体的空间和其他描述符术语外,还提取相对于空间触发词起重要空间作用的各种其他上下文信息。这两个阶段使用单独的模板构建查询,并且在第二阶段我们采用两种查询变体。

结果

与使用传统序列标记方法在该任务上报告的最佳工作相比,两阶段QA模型在每个组件上都超过了其性能。这包括在识别空间触发词、图和地框架元素的平均F1分数方面分别有12、13和12分的显著提高。

讨论

我们的实验表明,在查询中纳入领域知识(关于框架元素的一般描述)有助于为一些空间和描述性框架元素获得更好的结果,特别是在临床预训练的BERT模型的情况下。我们进一步强调,两阶段QA方法非常适合为复杂模式提取信息,其目标是识别与每个空间触发词以及发现/设备/解剖结构实体相关联的所有框架元素,从而能够在放射学领域提取更全面的信息。

结论

与更标准的基于序列标记的方法相比,以回答自然语言查询的形式从文本中提取细粒度空间信息具有取得更好结果的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a698/9072592/e5a778a5190c/nihms-1759745-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a698/9072592/f8d4be29002d/nihms-1759745-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a698/9072592/41645f2e6bae/nihms-1759745-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a698/9072592/e5a778a5190c/nihms-1759745-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a698/9072592/f8d4be29002d/nihms-1759745-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a698/9072592/41645f2e6bae/nihms-1759745-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a698/9072592/e5a778a5190c/nihms-1759745-f0004.jpg

相似文献

1
Fine-grained spatial information extraction in radiology as two-turn question answering.放射学中细粒度空间信息提取作为两阶段问答
Int J Med Inform. 2021 Nov 6;158:104628. doi: 10.1016/j.ijmedinf.2021.104628.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model.基于事件的放射学报告临床发现提取与预训练语言模型。
J Digit Imaging. 2023 Feb;36(1):91-104. doi: 10.1007/s10278-022-00717-5. Epub 2022 Oct 17.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Gender differences in the context of interventions for improving health literacy in migrants: a qualitative evidence synthesis.移民健康素养提升干预措施背景下的性别差异:一项定性证据综合分析
Cochrane Database Syst Rev. 2024 Dec 12;12(12):CD013302. doi: 10.1002/14651858.CD013302.pub2.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Sentences, entities, and keyphrases extraction from consumer health forums using multi-task learning.使用多任务学习从消费者健康论坛中提取句子、实体和关键短语。
J Biomed Semantics. 2025 May 6;16(1):8. doi: 10.1186/s13326-025-00329-2.
9
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
10
Factors that influence caregivers' and adolescents' views and practices regarding human papillomavirus (HPV) vaccination for adolescents: a qualitative evidence synthesis.影响照顾者和青少年对青少年人乳头瘤病毒(HPV)疫苗接种的看法及做法的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2025 Apr 15;4(4):CD013430. doi: 10.1002/14651858.CD013430.pub2.

引用本文的文献

1
Uncertainty-aware automatic TNM staging classification for [F] Fluorodeoxyglucose PET-CT reports for lung cancer utilising transformer-based language models and multi-task learning.利用基于Transformer的语言模型和多任务学习对[F]氟脱氧葡萄糖PET-CT肺癌报告进行不确定性感知自动TNM分期分类。
BMC Med Inform Decis Mak. 2024 Dec 18;24(1):396. doi: 10.1186/s12911-024-02814-7.
2
Question Answering for Electronic Health Records: Scoping Review of Datasets and Models.电子健康记录问答:数据集和模型的范围综述。
J Med Internet Res. 2024 Oct 30;26:e53636. doi: 10.2196/53636.
3
A scoping review of large language model based approaches for information extraction from radiology reports.

本文引用的文献

1
Biomedical named entity recognition using BERT in the machine reading comprehension framework.基于机器阅读理解框架的 BERT 在生物医学命名实体识别中的应用。
J Biomed Inform. 2021 Jun;118:103799. doi: 10.1016/j.jbi.2021.103799. Epub 2021 May 6.
2
Extracting and Learning Fine-Grained Labels from Chest Radiographs.从胸部X光片中提取和学习细粒度标签。
AMIA Annu Symp Proc. 2021 Jan 25;2020:1190-1199. eCollection 2020.
3
Extracting clinical terms from radiology reports with deep learning.深度学习从放射学报告中提取临床术语。
基于大语言模型从放射学报告中提取信息的方法的范围综述。
NPJ Digit Med. 2024 Aug 24;7(1):222. doi: 10.1038/s41746-024-01219-0.
4
SELF-SUPERVISED LEARNING WITH RADIOLOGY REPORTS, A COMPARATIVE ANALYSIS OF STRATEGIES FOR LARGE VESSEL OCCLUSION AND BRAIN CTA IMAGES.基于放射学报告的自监督学习:大血管闭塞与脑CT血管造影图像策略的比较分析
Proc IEEE Int Symp Biomed Imaging. 2023 Apr;2023. doi: 10.1109/isbi53787.2023.10230623. Epub 2023 Sep 1.
5
quEHRy: a question answering system to query electronic health records.QueHRy:一个问答系统,用于查询电子健康记录。
J Am Med Inform Assoc. 2023 May 19;30(6):1091-1102. doi: 10.1093/jamia/ocad050.
J Biomed Inform. 2021 Apr;116:103729. doi: 10.1016/j.jbi.2021.103729. Epub 2021 Mar 9.
4
A Hybrid Deep Learning Approach for Spatial Trigger Extraction from Radiology Reports.一种用于从放射学报告中提取空间触发词的混合深度学习方法。
Proc Conf Empir Methods Nat Lang Process. 2020 Nov;2020:50-55. doi: 10.18653/v1/2020.splu-1.6.
5
Rad-SpatialNet: A Frame-based Resource for Fine-Grained Spatial Relations in Radiology Reports.Rad-SpatialNet:用于放射学报告中细粒度空间关系的基于框架的资源。
LREC Int Conf Lang Resour Eval. 2020 May;2020:2251-2260.
6
Applying a deep learning-based sequence labeling approach to detect attributes of medical concepts in clinical text.应用基于深度学习的序列标注方法来检测临床文本中医疗概念的属性。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):236. doi: 10.1186/s12911-019-0937-2.
7
A validated natural language processing algorithm for brain imaging phenotypes from radiology reports in UK electronic health records.从英国电子健康记录中的放射学报告中提取脑影像学表型的验证自然语言处理算法。
BMC Med Inform Decis Mak. 2019 Sep 9;19(1):184. doi: 10.1186/s12911-019-0908-7.
8
Enhancing clinical concept extraction with contextual embeddings.利用上下文嵌入增强临床概念提取。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1297-1304. doi: 10.1093/jamia/ocz096.
9
Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm.使用混合自然语言处理算法自动检测放射学报告中的测量值及其描述符。
J Digit Imaging. 2019 Aug;32(4):544-553. doi: 10.1007/s10278-019-00237-9.
10
Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning.利用机器学习从放射学报告中提取完整的结构化信息。
J Digit Imaging. 2019 Aug;32(4):554-564. doi: 10.1007/s10278-019-00234-y.