Suppr超能文献

基于改进标注规则和双向长短期记忆网络-条件随机场的电子病历实体关系抽取

Entity relation extraction from electronic medical records based on improved annotation rules and BiLSTM-CRF.

作者信息

Chen Tingyin, Hu Yongmei

机构信息

Department of Network and Information, Xiangya Hospital, Central South University, Changsha, China.

Mobile Health Ministry of Education-China Mobile Joint Laboratory, Changsha, China.

出版信息

Ann Transl Med. 2021 Sep;9(18):1415. doi: 10.21037/atm-21-3828.

Abstract

BACKGROUND

Extracting entities and their relationships from electronic medical records (EMRs) is an important research direction in the development of medical informatization. Recently, a method was proposed to transform entity relation extraction into entity recognition by using annotation rules, and then solve the problem of relation extraction by an entity recognition model. However, this method cannot deal with one-to-many entity relationship problems.

METHODS

This paper combined the bidirectional long- and short-term memory-conditional random field (BiLSTM-CRF) deep learning model with an improvement of sequence annotation rules, hided relationships between entities in entity labels, then the problem of one-to-many named entity relation extraction in EMRs was transformed into entity recognition based on relation sets, and entity extraction was carried out through the entity recognition model.

RESULTS

Entity extraction was achieved through the entity recognition model. The result of entity recognition was transformed into the corresponding entity relationship, thus completing the task of one-to-many entity relation extraction by the improved annotation rules, the accuracy rate of proposed method reaches 83.46%, the recall rate is 81.12%, and the value of comprehensive index F1 is 0.8227.

CONCLUSIONS

Through the annotation analysis of EMRs, our experimental results show that the improved annotation rules can effectively complete the task of one-to-many medical entity relation extraction from EMRs.

摘要

背景

从电子病历(EMR)中提取实体及其关系是医学信息化发展中的一个重要研究方向。最近,有人提出一种方法,通过使用注释规则将实体关系提取转化为实体识别,然后通过实体识别模型解决关系提取问题。然而,这种方法无法处理一对多的实体关系问题。

方法

本文将双向长短期记忆条件随机场(BiLSTM-CRF)深度学习模型与序列注释规则的改进相结合,在实体标签中隐藏实体之间的关系,从而将电子病历中一对多命名实体关系提取问题转化为基于关系集的实体识别,并通过实体识别模型进行实体提取。

结果

通过实体识别模型实现了实体提取。将实体识别结果转化为相应的实体关系,从而通过改进的注释规则完成一对多实体关系提取任务,所提方法的准确率达到83.46%,召回率为81.12%,综合指标F1值为0.8227。

结论

通过对电子病历的注释分析,我们的实验结果表明,改进的注释规则能够有效完成从电子病历中提取一对多医学实体关系的任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09ee/8506757/5c9468a9d1e0/atm-09-18-1415-f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验