基于主题捕获和局部实体池化的生物医学文档级关系抽取

Biomedical document-level relation extraction with thematic capture and localized entity pooling.

作者信息

Li Yuqing, Shao Xinhui

机构信息

Department of Mathematics, College of Sciences, Northeastern University, Shenyang, China.

出版信息

J Biomed Inform. 2024 Dec;160:104756. doi: 10.1016/j.jbi.2024.104756. Epub 2024 Nov 30.

DOI:10.1016/j.jbi.2024.104756

PMID:39622399

Abstract

In contrast to sentence-level relational extraction, document-level relation extraction poses greater challenges as a document typically contains multiple entities, and one entity may be associated with multiple other entities. Existing methods often rely on graph structures to capture path representations between entity pairs. However, this paper introduces a novel approach called local entity pooling that solely relies on the pre-training model to identify the bridge entity related to the current entity pair and generate the reasoning path representation. This technique effectively mitigates the multi-entity problem. Additionally, the model leverages the multi-entity and multi-label characteristics of the document to acquire the document's thematic representation, thereby enhancing the document-level relation extraction task. Experimental evaluations conducted on two biomedical datasets, CDR and GDA. Our TCLEP (Thematic Capture and Localized Entity Pooling) model achieved the Macro-F1 scores of 71.7% and 85.3%, respectively. Simultaneously, we incorporated local entity pooling and thematic capture modules into the state-of-the-art model, resulting in performance improvements of 1.5% and 0.2% on the respective datasets. These results highlight the advanced performance of our proposed approach.

摘要

与句子级关系抽取相比，文档级关系抽取带来了更大的挑战，因为文档通常包含多个实体，并且一个实体可能与多个其他实体相关联。现有方法通常依赖图结构来捕获实体对之间的路径表示。然而，本文介绍了一种名为局部实体池化的新颖方法，该方法仅依靠预训练模型来识别与当前实体对相关的桥梁实体并生成推理路径表示。这种技术有效地缓解了多实体问题。此外，该模型利用文档的多实体和多标签特征来获取文档的主题表示，从而增强文档级关系抽取任务。在两个生物医学数据集CDR和GDA上进行了实验评估。我们的TCLEP（主题捕获和局部实体池化）模型分别取得了71.7%和85.3%的宏F1分数。同时，我们将局部实体池化和主题捕获模块纳入到最先进的模型中，在各自的数据集上分别带来了1.5%和0.2%的性能提升。这些结果突出了我们所提出方法的先进性能。

相似文献

Biomedical document-level relation extraction with thematic capture and localized entity pooling.基于主题捕获和局部实体池化的生物医学文档级关系抽取

J Biomed Inform. 2024 Dec;160:104756. doi: 10.1016/j.jbi.2024.104756. Epub 2024 Nov 30.

BAMRE: Joint extraction model of Chinese medical entities and relations based on Biaffine transformation with relation attention.基于关系注意力的双线性变换的中文医疗实体和关系联合抽取模型。

J Biomed Inform. 2024 Oct;158:104733. doi: 10.1016/j.jbi.2024.104733. Epub 2024 Oct 3.

SyRACT: zero-shot biomedical document-level relation extraction with synergistic RAG and CoT.SyRACT：基于协同检索增强生成（RAG）和思维链（CoT）的零样本生物医学文档级关系抽取

Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf356.

Relation Extraction in Biomedical Texts: A Cross-Sentence Approach.生物医学文本中的关系抽取：一种跨句子方法。

IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2156-2166. doi: 10.1109/TCBB.2024.3451348. Epub 2024 Dec 10.

HEART: Learning better representation of EHR data with a heterogeneous relation-aware transformer.心脏：使用异构关系感知转换器学习更好的 EHR 数据表示。

J Biomed Inform. 2024 Nov;159:104741. doi: 10.1016/j.jbi.2024.104741. Epub 2024 Oct 29.

PLRTE: Progressive learning for biomedical relation triplet extraction using large language models.基于大语言模型的生物医学关系三元组抽取的渐进式学习方法（PLRTE）。

J Biomed Inform. 2024 Nov;159:104738. doi: 10.1016/j.jbi.2024.104738. Epub 2024 Oct 18.

SSGU-CD: A combined semantic and structural information graph U-shaped network for document-level Chemical-Disease interaction extraction.SSGU-CD：一种用于文档级化学-疾病交互作用提取的结合语义和结构信息图 U 形网络。

J Biomed Inform. 2024 Sep;157:104719. doi: 10.1016/j.jbi.2024.104719. Epub 2024 Aug 29.

Augmenting biomedical named entity recognition with general-domain resources.利用通用领域资源增强生物医学命名实体识别。

J Biomed Inform. 2024 Nov;159:104731. doi: 10.1016/j.jbi.2024.104731. Epub 2024 Oct 4.

Extracting adverse drug events from clinical Notes: A systematic review of approaches used.从临床记录中提取药物不良事件：对所用方法的系统评价

J Biomed Inform. 2024 Mar;151:104603. doi: 10.1016/j.jbi.2024.104603. Epub 2024 Feb 6.

Enhancing biomedical relation extraction with directionality.通过方向性增强生物医学关系提取

Bioinformatics. 2025 Jul 1;41(Supplement_1):i68-i76. doi: 10.1093/bioinformatics/btaf226.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于主题捕获和局部实体池化的生物医学文档级关系抽取

Biomedical document-level relation extraction with thematic capture and localized entity pooling.

作者信息

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献