School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China.
School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):22. doi: 10.1186/s12911-019-0736-9.
Extracting relations between important clinical entities is critical but very challenging for natural language processing (NLP) in the medical domain. Researchers have applied deep learning-based approaches to clinical relation extraction; but most of them consider sentence sequence only, without modeling syntactic structures. The aim of this study was to utilize a deep neural network to capture the syntactic features and further improve the performances of relation extraction in clinical notes.
We propose a novel neural approach to model shortest dependency path (SDP) between target entities together with the sentence sequence for clinical relation extraction. Our neural network architecture consists of three modules: (1) sentence sequence representation module using bidirectional long short-term memory network (Bi-LSTM) to capture the features in the sentence sequence; (2) SDP representation module implementing the convolutional neural network (CNN) and Bi-LSTM network to capture the syntactic context for target entities using SDP information; and (3) classification module utilizing a fully-connected layer with Softmax function to classify the relation type between target entities.
Using the 2010 i2b2/VA relation extraction dataset, we compared our approach with other baseline methods. Our experimental results show that the proposed approach achieved significant improvements over comparable existing methods, demonstrating the effectiveness of utilizing syntactic structures in deep learning-based relation extraction. The F-measure of our method reaches 74.34% which is 2.5% higher than the method without using syntactic features.
We propose a new neural network architecture by modeling SDP along with sentence sequence to extract multi-relations from clinical text. Our experimental results show that the proposed approach significantly improve the performances on clinical notes, demonstrating the effectiveness of syntactic structures in deep learning-based relation extraction.
从自然语言处理(NLP)的角度来看,提取医学领域重要临床实体之间的关系至关重要,但极具挑战性。研究人员已经将深度学习方法应用于临床关系提取,但其中大多数方法只考虑句子序列,而没有对语法结构进行建模。本研究旨在利用深度神经网络捕捉语法特征,进一步提高临床记录中关系提取的性能。
我们提出了一种新的神经网络方法,用于建模目标实体之间的最短依赖路径(SDP)和句子序列,以进行临床关系提取。我们的神经网络架构由三个模块组成:(1)使用双向长短期记忆网络(Bi-LSTM)的句子序列表示模块,用于捕获句子序列中的特征;(2)SDP 表示模块,实现卷积神经网络(CNN)和 Bi-LSTM 网络,使用 SDP 信息捕获目标实体的语法上下文;(3)分类模块,利用全连接层和 Softmax 函数对目标实体之间的关系类型进行分类。
在使用 2010 年 i2b2/VA 关系提取数据集的实验中,我们将我们的方法与其他基线方法进行了比较。实验结果表明,与其他可比方法相比,我们的方法取得了显著的改进,证明了在基于深度学习的关系提取中利用语法结构的有效性。我们的方法的 F 度量达到 74.34%,比不使用语法特征的方法高 2.5%。
我们提出了一种新的神经网络架构,通过对 SDP 与句子序列进行建模,从临床文本中提取多关系。实验结果表明,我们的方法显著提高了临床记录的性能,证明了语法结构在基于深度学习的关系提取中的有效性。