• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于依存图和译码器增强型变压器模型的生物医学关系抽取

Biomedical Relation Extraction Using Dependency Graph and Decoder-Enhanced Transformer Model.

作者信息

Kim Seonho, Yoon Juntae, Kwon Ohyoung

机构信息

Department of Computer Science and Engineering, Sogang University, Seoul 04107, Republic of Korea.

VAIV Company, Seoul 04107, Republic of Korea.

出版信息

Bioengineering (Basel). 2023 May 12;10(5):586. doi: 10.3390/bioengineering10050586.

DOI:10.3390/bioengineering10050586
PMID:37237656
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10215465/
Abstract

The identification of drug-drug and chemical-protein interactions is essential for understanding unpredictable changes in the pharmacological effects of drugs and mechanisms of diseases and developing therapeutic drugs. In this study, we extract drug-related interactions from the DDI (Drug-Drug Interaction) Extraction-2013 Shared Task dataset and the BioCreative ChemProt (Chemical-Protein) dataset using various transfer transformers. We propose BERT that uses a graph attention network (GAT) to take into account the local structure of sentences and embedding features of nodes under the self-attention scheme and investigate whether incorporating syntactic structure can help relation extraction. In addition, we suggest T5, which adapts the autoregressive generation task of the T5 (text-to-text transfer transformer) to the relation classification problem by removing the self-attention layer in the decoder block. Furthermore, we evaluated the potential of biomedical relation extraction of GPT-3 (Generative Pre-trained Transformer) using GPT-3 variant models. As a result, T5, which is a model with a tailored decoder designed for classification problems within the T5 architecture, demonstrated very promising performances for both tasks. We achieved an accuracy of 91.15% in the DDI dataset and an accuracy of 94.29% for the CPR (Chemical-Protein Relation) class group in ChemProt dataset. However, BERT did not show a significant performance improvement in the aspect of relation extraction. We demonstrated that transformer-based approaches focused only on relationships between words are implicitly eligible to understand language well without additional knowledge such as structural information.

摘要

药物与药物以及化学物质与蛋白质相互作用的识别对于理解药物药理作用的不可预测变化、疾病机制以及开发治疗药物至关重要。在本研究中,我们使用各种迁移变换器从药物 - 药物相互作用(DDI)提取2013共享任务数据集和生物创意化学蛋白质(化学物质 - 蛋白质)数据集中提取与药物相关的相互作用。我们提出了BERT,它在自注意力机制下使用图注意力网络(GAT)来考虑句子的局部结构和节点的嵌入特征,并研究纳入句法结构是否有助于关系提取。此外,我们提出了T5,它通过去除解码器块中的自注意力层,将T5(文本到文本迁移变换器)的自回归生成任务应用于关系分类问题。此外,我们使用GPT - 3变体模型评估了GPT - 3(生成式预训练变换器)在生物医学关系提取方面的潜力。结果,T5作为一种在T5架构内为分类问题设计了定制解码器的模型,在这两项任务中都表现出非常有前景的性能。我们在DDI数据集中达到了91.15%的准确率,在化学蛋白质数据集中的化学物质 - 蛋白质关系(CPR)类组中达到了94.29%的准确率。然而,BERT在关系提取方面并没有表现出显著的性能提升。我们证明了仅关注单词之间关系的基于变换器的方法在没有诸如结构信息等额外知识的情况下,隐含地有能力很好地理解语言。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/ea5485eda355/bioengineering-10-00586-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/7089cdf167c3/bioengineering-10-00586-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/2769af4e547f/bioengineering-10-00586-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/f6f3bc0effa7/bioengineering-10-00586-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/fd2616220132/bioengineering-10-00586-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/15d7d4ba35ee/bioengineering-10-00586-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/30ebb477ad75/bioengineering-10-00586-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/ea5485eda355/bioengineering-10-00586-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/7089cdf167c3/bioengineering-10-00586-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/2769af4e547f/bioengineering-10-00586-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/f6f3bc0effa7/bioengineering-10-00586-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/fd2616220132/bioengineering-10-00586-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/15d7d4ba35ee/bioengineering-10-00586-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/30ebb477ad75/bioengineering-10-00586-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5045/10215465/ea5485eda355/bioengineering-10-00586-g008.jpg

相似文献

1
Biomedical Relation Extraction Using Dependency Graph and Decoder-Enhanced Transformer Model.基于依存图和译码器增强型变压器模型的生物医学关系抽取
Bioengineering (Basel). 2023 May 12;10(5):586. doi: 10.3390/bioengineering10050586.
2
Extracting biomedical relation from cross-sentence text using syntactic dependency graph attention network.基于句法依存图注意力网络的跨句文本生物医学关系抽取
J Biomed Inform. 2023 Aug;144:104445. doi: 10.1016/j.jbi.2023.104445. Epub 2023 Jul 17.
3
VAIV bio-discovery service using transformer model and retrieval augmented generation.基于 Transformer 模型和检索增强生成的 VAIV 生物发现服务。
BMC Bioinformatics. 2024 Aug 21;25(1):273. doi: 10.1186/s12859-024-05903-6.
4
A Graph Convolutional Network-Based Method for Chemical-Protein Interaction Extraction: Algorithm Development.一种基于图卷积网络的化学-蛋白质相互作用提取方法:算法开发
JMIR Med Inform. 2020 May 19;8(5):e17643. doi: 10.2196/17643.
5
Enhancing Biomedical Relation Extraction with Transformer Models using Shortest Dependency Path Features and Triplet Information.利用最短依赖路径特征和三元组信息增强基于 Transformer 的生物医学关系抽取
J Biomed Inform. 2021 Oct;122:103893. doi: 10.1016/j.jbi.2021.103893. Epub 2021 Sep 2.
6
BERT-GT: cross-sentence n-ary relation extraction with BERT and Graph Transformer.BERT-GT:使用BERT和图变换器进行跨句子n元关系提取
Bioinformatics. 2021 Apr 5;36(24):5678-5685. doi: 10.1093/bioinformatics/btaa1087.
7
Generative Pre-trained Transformer (GPT) based model with relative attention for de novo drug design.基于生成式预训练转换器(GPT)的相对注意力模型在从头设计药物中的应用。
Comput Biol Chem. 2023 Oct;106:107911. doi: 10.1016/j.compbiolchem.2023.107911. Epub 2023 Jun 29.
8
An attentive joint model with transformer-based weighted graph convolutional network for extracting adverse drug event relation.基于注意力机制的联合模型与基于转换器的加权图卷积网络在提取药物不良反应关系中的应用。
J Biomed Inform. 2022 Jan;125:103968. doi: 10.1016/j.jbi.2021.103968. Epub 2021 Dec 4.
9
BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.BioEGRE:一种基于 BioELECTRA 和图指针神经网络的生物医学关系抽取的语言拓扑增强方法。
BMC Bioinformatics. 2023 Dec 19;24(1):486. doi: 10.1186/s12859-023-05601-9.
10
Relation Extraction in Biomedical Texts Based on Multi-Head Attention Model With Syntactic Dependency Feature: Modeling Study.基于具有句法依存特征的多头注意力模型的生物医学文本关系抽取:建模研究
JMIR Med Inform. 2022 Oct 20;10(10):e41136. doi: 10.2196/41136.

引用本文的文献

1
VAIV bio-discovery service using transformer model and retrieval augmented generation.基于 Transformer 模型和检索增强生成的 VAIV 生物发现服务。
BMC Bioinformatics. 2024 Aug 21;25(1):273. doi: 10.1186/s12859-024-05903-6.
2
Enhanced Data Mining and Visualization of Sensory-Graph-Modeled Datasets through Summarization.通过汇总增强感官图模型数据集的数据挖掘与可视化
Sensors (Basel). 2024 Jul 14;24(14):4554. doi: 10.3390/s24144554.
3
Enhancing Health and Public Health through Machine Learning: Decision Support for Smarter Choices.

本文引用的文献

1
Integrating heterogeneous knowledge graphs into drug-drug interaction extraction from the literature.将异质知识图谱整合到文献中的药物-药物相互作用提取中。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac754.
2
Comparative Toxicogenomics Database (CTD): update 2023.比较毒理学基因组数据库(CTD):2023 年更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1257-D1262. doi: 10.1093/nar/gkac833.
3
Enhancing Biomedical Relation Extraction with Transformer Models using Shortest Dependency Path Features and Triplet Information.
通过机器学习促进健康与公共卫生:为更明智的选择提供决策支持。
Bioengineering (Basel). 2023 Jul 2;10(7):792. doi: 10.3390/bioengineering10070792.
利用最短依赖路径特征和三元组信息增强基于 Transformer 的生物医学关系抽取
J Biomed Inform. 2021 Oct;122:103893. doi: 10.1016/j.jbi.2021.103893. Epub 2021 Sep 2.
4
Drug-Drug Interaction Extraction via Recurrent Hybrid Convolutional Neural Networks with an Improved Focal Loss.基于改进焦点损失的循环混合卷积神经网络的药物-药物相互作用提取
Entropy (Basel). 2019 Jan 8;21(1):37. doi: 10.3390/e21010037.
5
Extracting drug-drug interactions from texts with BioBERT and multiple entity-aware attentions.利用BioBERT和多种实体感知注意力从文本中提取药物-药物相互作用。
J Biomed Inform. 2020 Jun;106:103451. doi: 10.1016/j.jbi.2020.103451. Epub 2020 May 23.
6
Keep up with the latest coronavirus research.跟上冠状病毒的最新研究进展。
Nature. 2020 Mar;579(7798):193. doi: 10.1038/d41586-020-00694-1.
7
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
8
Drug-drug interaction extraction from biomedical texts using long short-term memory network.基于长短时记忆网络的生物医学文献中药物-药物相互作用提取
J Biomed Inform. 2018 Oct;86:15-24. doi: 10.1016/j.jbi.2018.08.005. Epub 2018 Aug 21.
9
Drug-Drug Interaction Extraction via Convolutional Neural Networks.通过卷积神经网络进行药物-药物相互作用提取
Comput Math Methods Med. 2016;2016:6918381. doi: 10.1155/2016/6918381. Epub 2016 Jan 31.
10
IUPHAR-DB: the IUPHAR database of G protein-coupled receptors and ion channels.IUPHAR-DB:IUPHAR G蛋白偶联受体和离子通道数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D680-5. doi: 10.1093/nar/gkn728. Epub 2008 Oct 23.