van der Vegt Anton H, Zuccon Guido, Koopman Bevan
The University of Queensland, St Lucia, Qld, Australia.
AMIA Annu Symp Proc. 2020 Mar 4;2019:1216-1225. eCollection 2019.
Relationships between disorders and their associated tests, treatments and symptoms underpin essential information needs of clinicians and can support biomedical knowledge bases, information retrieval and ultimately clinical decision support. These relationships exist in the biomedical literature, however they are not directly available and have to be extracted from the text. Existing, automated biomedical relationship extraction methods tend to be narrow in scope, e.g., protein-protein interactions, and pertain to intra-sentence relationships. The proposed approach targets intra and inter-sentence, disorder-centric relationship extraction. It employs an LSTM deep learning model that utilises a novel, sequential feature set, including medical concept embeddings. The LSTM model outperforms rule based and co-occurrence models by at least +78% in F1 score, suggesting that inter-sentence relationships are an important subset of all disorder-centric relations and that our approach shows promise for inter-sentence relationship extraction in this and possibly other domains.
疾病与其相关检测、治疗方法及症状之间的关系是临床医生基本信息需求的基础,并且能够支持生物医学知识库、信息检索以及最终的临床决策支持。这些关系存在于生物医学文献中,然而它们并非直接可得,必须从文本中提取。现有的自动化生物医学关系提取方法往往范围较窄,例如蛋白质 - 蛋白质相互作用,并且涉及句内关系。所提出的方法针对以疾病为中心的句内和句间关系提取。它采用了一种长短期记忆(LSTM)深度学习模型,该模型利用了一种新颖的顺序特征集,包括医学概念嵌入。LSTM模型在F1分数上比基于规则和共现的模型至少高出78%,这表明句间关系是所有以疾病为中心的关系中的一个重要子集,并且我们的方法在这个领域以及可能的其他领域中显示出句间关系提取的潜力。