临床电子病历中实体关系联合抽取的建模。

Modeling of joint extraction of entity relationships in clinical electronic medical records.

机构信息

School of computer science and technology, Zhejiang Sci-Tech University, Hangzhou 310018, China.

School of Information Science and Engineering, Zhejiang Sci-Tech University, Hangzhou 310018, China.

出版信息

Comput Biol Med. 2024 Nov;182:109161. doi: 10.1016/j.compbiomed.2024.109161. Epub 2024 Sep 18.

DOI:10.1016/j.compbiomed.2024.109161

PMID:39298887

Abstract

The advancement of medical informatization necessitates extracting entities and their relationships from electronic medical records. Presently, research on electronic medical records predominantly concentrates on single-entity relationship extraction. However, clinical electronic medical records frequently exhibit overlapping complex entity relationships, thereby heightening the challenge of information extraction. To rectify the absence of a clinical medical relationship extraction dataset, this study utilizes electronic medical records from 584 patients in a hospital to create a compact clinical medical relationship extraction dataset. To address the pipelined relationship extraction model's limitation in overlooking the one-to-many correlation problem between entities and relationships, this paper introduces a cascading relationship extraction model. This model integrates the MacBERT pre-training model, gated recurrent network, and multi-head self-attention mechanism to enhance the extraction of text features. Simultaneously, adversarial learning is incorporated to bolster the model's robustness. In scenarios involving one-to-many relationships between entities, a two-phase task is employed. Initially, the main entity is predicted, followed by predicting the associated object and their correspondences. Employing this cascade-structured approach enables the model to flexibly manage intricate entity relationships, thereby enhancing extraction accuracy. Experimental results demonstrate the model's efficiency, yielding F1-scores of 82.8%, 76.8%, and 88.2% for fulfilling relational extraction requirements and tasks on DuIE, CHIP-CDEE, and private datasets, respectively. These scores represent improvements over the benchmark model. The findings indicate the model's applicability in practical domains, particularly in tasks such as biomedical information extraction.

摘要

医疗信息化的发展需要从电子病历中提取实体及其关系。目前，电子病历的研究主要集中在单一实体关系的提取上。然而，临床电子病历经常表现出重叠的复杂实体关系，从而增加了信息提取的难度。为了解决临床医学关系提取数据集的缺乏，本研究利用来自一家医院的 584 名患者的电子病历创建了一个紧凑的临床医学关系提取数据集。为了解决流水线关系提取模型忽略实体和关系之间一对一多相关问题的局限性，本文引入了级联关系提取模型。该模型集成了 MacBERT 预训练模型、门控循环网络和多头自注意力机制，以增强对文本特征的提取。同时，引入对抗学习来增强模型的鲁棒性。在实体之间存在一对多关系的情况下，采用两阶段任务。首先，预测主要实体，然后预测相关对象及其对应关系。使用这种级联结构方法，模型可以灵活地管理复杂的实体关系，从而提高提取准确性。实验结果表明，该模型的效率很高，在 DuIE、CHIP-CDEE 和私有数据集上的关系提取要求和任务的 F1 分数分别为 82.8%、76.8%和 88.2%，优于基准模型。这些结果表明该模型在实际领域中的适用性，特别是在生物医学信息提取等任务中。

相似文献

Modeling of joint extraction of entity relationships in clinical electronic medical records.临床电子病历中实体关系联合抽取的建模。

Comput Biol Med. 2024 Nov;182:109161. doi: 10.1016/j.compbiomed.2024.109161. Epub 2024 Sep 18.

BAMRE: Joint extraction model of Chinese medical entities and relations based on Biaffine transformation with relation attention.基于关系注意力的双线性变换的中文医疗实体和关系联合抽取模型。

J Biomed Inform. 2024 Oct;158:104733. doi: 10.1016/j.jbi.2024.104733. Epub 2024 Oct 3.

Extracting entities with attributes in clinical text via joint deep learning.通过联合深度学习从临床文本中提取具有属性的实体。

J Am Med Inform Assoc. 2019 Dec 1;26(12):1584-1591. doi: 10.1093/jamia/ocz158.

PRTA:Joint extraction of medical nested entities and overlapping relation via parameter sharing progressive recognition and targeted assignment decoding scheme.PRTA：通过参数共享递进式识别和目标分配解码方案联合提取医学嵌套实体和重叠关系。

Comput Biol Med. 2024 Jun;176:108539. doi: 10.1016/j.compbiomed.2024.108539. Epub 2024 Apr 29.

Application of Entity-BERT model based on neuroscience and brain-like cognition in electronic medical record entity recognition.基于神经科学和类脑认知的实体BERT模型在电子病历实体识别中的应用

Front Neurosci. 2023 Sep 20;17:1259652. doi: 10.3389/fnins.2023.1259652. eCollection 2023.

Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules.通过结合领域字典和规则来提高中文电子病历的命名实体识别。

Int J Environ Res Public Health. 2020 Apr 14;17(8):2687. doi: 10.3390/ijerph17082687.

A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。

BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.

Entity relation extraction from electronic medical records based on improved annotation rules and BiLSTM-CRF.基于改进标注规则和双向长短期记忆网络-条件随机场的电子病历实体关系抽取

Ann Transl Med. 2021 Sep;9(18):1415. doi: 10.21037/atm-21-3828.

RTJTN: Relational Triplet Joint Tagging Network for Joint Entity and Relation Extraction.RTJTN：关系三元组联合标注网络，用于联合实体和关系抽取。

Comput Intell Neurosci. 2021 Oct 16;2021:3447473. doi: 10.1155/2021/3447473. eCollection 2021.

Multi-head CRF classifier for biomedical multi-class named entity recognition on Spanish clinical notes.基于多头条件随机场分类器的西班牙语临床文档中生物医学多类命名实体识别。

Database (Oxford). 2024 Jul 30;2024. doi: 10.1093/database/baae068.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

临床电子病历中实体关系联合抽取的建模。

Modeling of joint extraction of entity relationships in clinical electronic medical records.

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献