BioEGRE：一种基于 BioELECTRA 和图指针神经网络的生物医学关系抽取的语言拓扑增强方法。

BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.

机构信息

Academy of Military Medical Sciences, Beijing, 100039, China.

出版信息

BMC Bioinformatics. 2023 Dec 19;24(1):486. doi: 10.1186/s12859-023-05601-9.

DOI:10.1186/s12859-023-05601-9

PMID:38114906

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10731880/

Abstract

BACKGROUND

Automatic and accurate extraction of diverse biomedical relations from literature is a crucial component of bio-medical text mining. Currently, stacking various classification networks on pre-trained language models to perform fine-tuning is a common framework to end-to-end solve the biomedical relation extraction (BioRE) problem. However, the sequence-based pre-trained language models underutilize the graphical topology of language to some extent. In addition, sequence-oriented deep neural networks have limitations in processing graphical features.

RESULTS

In this paper, we propose a novel method for sentence-level BioRE task, BioEGRE (BioELECTRA and Graph pointer neural net-work for Relation Extraction), aimed at leveraging the linguistic topological features. First, the biomedical literature is preprocessed to retain sentences involving pre-defined entity pairs. Secondly, SciSpaCy is employed to conduct dependency parsing; sentences are modeled as graphs based on the parsing results; BioELECTRA is utilized to generate token-level representations, which are modeled as attributes of nodes in the sentence graphs; a graph pointer neural network layer is employed to select the most relevant multi-hop neighbors to optimize representations; a fully-connected neural network layer is employed to generate the sentence-level representation. Finally, the Softmax function is employed to calculate the probabilities. Our proposed method is evaluated on three BioRE tasks: a multi-class (CHEMPROT) and two binary tasks (GAD and EU-ADR). The results show that our method achieves F1-scores of 79.97% (CHEMPROT), 83.31% (GAD), and 83.51% (EU-ADR), surpassing the performance of existing state-of-the-art models.

CONCLUSION

The experimental results on 3 biomedical benchmark datasets demonstrate the effectiveness and generalization of BioEGRE, which indicates that linguistic topology and a graph pointer neural network layer explicitly improve performance for BioRE tasks.

摘要

背景

从文献中自动、准确地提取多样化的生物医学关系是生物医学文本挖掘的关键组成部分。目前，基于预训练语言模型堆叠各种分类网络进行微调是端到端解决生物医学关系抽取（BioRE）问题的常用框架。然而，基于序列的预训练语言模型在某种程度上未能充分利用语言的图形拓扑结构。此外，面向序列的深度神经网络在处理图形特征方面存在局限性。

结果

在本文中，我们提出了一种新颖的句子级 BioRE 任务方法，即 BioEGRE（BioELECTRA 和图形指针神经网络关系提取），旨在利用语言的拓扑特征。首先，对生物医学文献进行预处理，以保留涉及预定义实体对的句子。其次，使用 SciSpaCy 进行依存句法分析；根据解析结果将句子建模为图形；使用 BioELECTRA 生成令牌级表示，将其建模为句子图形节点的属性；使用图形指针神经网络层选择最相关的多跳邻居进行优化表示；使用全连接神经网络层生成句子级表示。最后，使用 Softmax 函数计算概率。我们的方法在三个 BioRE 任务上进行了评估：多类（CHEMPROT）和两类（GAD 和 EU-ADR）。结果表明，我们的方法在 CHEMPROT、GAD 和 EU-ADR 三个生物医学基准数据集上的 F1 得分分别达到了 79.97%、83.31%和 83.51%，超过了现有最先进模型的性能。

结论

在 3 个生物医学基准数据集上的实验结果表明，BioEGRE 具有有效性和泛化能力，表明语言拓扑结构和图形指针神经网络层可以显著提高 BioRE 任务的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf9/10731880/86341e8f8dce/12859_2023_5601_Fig1_HTML.jpg

相似文献

BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.

BMC Bioinformatics. 2023 Dec 19;24(1):486. doi: 10.1186/s12859-023-05601-9.

BioByGANS: biomedical named entity recognition by fusing contextual and syntactic features through graph attention network in node classification framework.

BMC Bioinformatics. 2022 Nov 22;23(1):501. doi: 10.1186/s12859-022-05051-9.

Extracting biomedical relation from cross-sentence text using syntactic dependency graph attention network.

J Biomed Inform. 2023 Aug;144:104445. doi: 10.1016/j.jbi.2023.104445. Epub 2023 Jul 17.

Integrating graph convolutional networks to enhance prompt learning for biomedical relation extraction.

J Biomed Inform. 2024 Sep;157:104717. doi: 10.1016/j.jbi.2024.104717. Epub 2024 Aug 28.

Multi-View Graph Neural Architecture Search for Biomedical Entity and Relation Extraction.

IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1221-1233. doi: 10.1109/TCBB.2022.3205113. Epub 2023 Apr 3.

Enriching contextualized language model from knowledge graph for biomedical information extraction.

Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa110.

A hybrid model based on neural networks for biomedical relation extraction.

J Biomed Inform. 2018 May;81:83-92. doi: 10.1016/j.jbi.2018.03.011. Epub 2018 Mar 27.

Exploiting graph kernels for high performance biomedical relation extraction.

J Biomed Semantics. 2018 Jan 30;9(1):7. doi: 10.1186/s13326-017-0168-3.

ADPG: Biomedical entity recognition based on Automatic Dependency Parsing Graph.

J Biomed Inform. 2023 Apr;140:104317. doi: 10.1016/j.jbi.2023.104317. Epub 2023 Feb 17.

Relation Extraction in Biomedical Texts Based on Multi-Head Attention Model With Syntactic Dependency Feature: Modeling Study.

JMIR Med Inform. 2022 Oct 20;10(10):e41136. doi: 10.2196/41136.

引用本文的文献

BioGSF: a graph-driven semantic feature integration framework for biomedical relation extraction.

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf025.

本文引用的文献

BioByGANS: biomedical named entity recognition by fusing contextual and syntactic features through graph attention network in node classification framework.

BMC Bioinformatics. 2022 Nov 22;23(1):501. doi: 10.1186/s12859-022-05051-9.

BioGPT: generative pre-trained transformer for biomedical text generation and mining.

Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac409.

A span-based joint model for extracting entities and relations of bacteria biotopes.

Bioinformatics. 2021 Dec 22;38(1):220-227. doi: 10.1093/bioinformatics/btab593.

Syntactically-informed word representations from graph neural network.

Neurocomputing (Amst). 2020 Nov 6;413:431-443. doi: 10.1016/j.neucom.2020.06.070.

Chemical-protein interaction extraction via Gaussian probability distribution and external biomedical knowledge.

Bioinformatics. 2020 Aug 1;36(15):4323-4330. doi: 10.1093/bioinformatics/btaa491.

A Comprehensive Survey on Graph Neural Networks.

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.

Attention guided capsule networks for chemical-protein interaction extraction.

J Biomed Inform. 2020 Mar;103:103392. doi: 10.1016/j.jbi.2020.103392. Epub 2020 Feb 15.

Neural network-based approaches for biomedical relation classification: A review.

J Biomed Inform. 2019 Nov;99:103294. doi: 10.1016/j.jbi.2019.103294. Epub 2019 Sep 23.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

BioWordVec, improving biomedical word embeddings with subword information and MeSH.

Sci Data. 2019 May 10;6(1):52. doi: 10.1038/s41597-019-0055-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

BioEGRE：一种基于 BioELECTRA 和图指针神经网络的生物医学关系抽取的语言拓扑增强方法。

BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.

机构信息

Academy of Military Medical Sciences, Beijing, 100039, China.

出版信息

BMC Bioinformatics. 2023 Dec 19;24(1):486. doi: 10.1186/s12859-023-05601-9.

DOI:10.1186/s12859-023-05601-9

PMID:38114906

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10731880/

Abstract

BACKGROUND

RESULTS

CONCLUSION

摘要

背景

结果

结论

在 3 个生物医学基准数据集上的实验结果表明，BioEGRE 具有有效性和泛化能力，表明语言拓扑结构和图形指针神经网络层可以显著提高 BioRE 任务的性能。

BioEGRE：一种基于 BioELECTRA 和图指针神经网络的生物医学关系抽取的语言拓扑增强方法。

BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

BioEGRE：一种基于 BioELECTRA 和图指针神经网络的生物医学关系抽取的语言拓扑增强方法。

BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献