Suppr超能文献

利用BioBERT和多种实体感知注意力从文本中提取药物-药物相互作用。

Extracting drug-drug interactions from texts with BioBERT and multiple entity-aware attentions.

作者信息

Zhu Yu, Li Lishuang, Lu Hongbin, Zhou Anqiao, Qin Xueyang

机构信息

School of Computer Science and Technology, Dalian University of Technology, 116024 Dalian, China.

出版信息

J Biomed Inform. 2020 Jun;106:103451. doi: 10.1016/j.jbi.2020.103451. Epub 2020 May 23.

Abstract

Drug-drug interactions (DDIs) extraction is one of the important tasks in the field of biomedical relation extraction, which plays an important role in the field of pharmacovigilance. Previous neural network based models have achieved good performance in DDIs extraction. However, most of the previous models did not make good use of the information of drug entity names, which can help to judge the relation between drugs. This is mainly because drug names are often very complex, leading to the fact that neural network models cannot understand their semantics directly. To address this issue, we propose a DDIs extraction model using multiple entity-aware attentions with various entity information. We use an output-modified bidirectional transformer (BioBERT) and a bidirectional gated recurrent unit layer (BiGRU) to obtain the vector representation of sentences. The vectors of drug description documents encoded by Doc2Vec are used as drug description information, which is an external knowledge to our model. Then we construct three different kinds of entity-aware attentions to get the sentence representations with entity information weighted, including attentions using the drug description information. The outputs of attention layers are concatenated and fed into a multi-layer perception layer. Finally, we get the result by a softmax classifier. The F-score is used to evaluate our model, which is also adopted by most previous DDIs extraction models. We evaluate our proposed model on the DDIExtraction 2013 corpus, which is the benchmark corpus of this domain, and achieves the state-of-the-art result (80.9% in F-score).

摘要

药物-药物相互作用(DDIs)提取是生物医学关系提取领域的重要任务之一,在药物警戒领域发挥着重要作用。以往基于神经网络的模型在DDIs提取方面取得了良好的性能。然而,大多数先前的模型没有充分利用药物实体名称的信息,而这些信息有助于判断药物之间的关系。这主要是因为药物名称通常非常复杂,导致神经网络模型无法直接理解其语义。为了解决这个问题,我们提出了一种使用多种实体感知注意力和各种实体信息的DDIs提取模型。我们使用输出修正的双向变换器(BioBERT)和双向门控循环单元层(BiGRU)来获得句子的向量表示。由Doc2Vec编码的药物描述文档的向量用作药物描述信息,这是我们模型的外部知识。然后我们构建三种不同类型的实体感知注意力,以获得加权实体信息的句子表示,包括使用药物描述信息的注意力。注意力层的输出被连接起来并输入到一个多层感知器层。最后,我们通过一个softmax分类器得到结果。F分数用于评估我们的模型,大多数先前的DDIs提取模型也采用了该指标。我们在DDIExtraction 2013语料库上评估我们提出的模型,该语料库是该领域的基准语料库,并取得了当前最优的结果(F分数为80.9%)。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验