基于机器阅读理解框架融合标签关系的中文电子病历命名实体识别

Chinese EMR Named Entity Recognition Using Fused Label Relations Based on Machine Reading Comprehension Framework.

作者信息

Duan Junwen, Liu Shuyue, Liao Xincheng, Gong Feng, Yue Hailin, Wang Jianxin

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1143-1153. doi: 10.1109/TCBB.2024.3376591. Epub 2024 Oct 9.

DOI:10.1109/TCBB.2024.3376591

Abstract

Chinese electronic medical record (EMR) presents significant challenges for named entity recognition (NER) due to their specialized nature, unique language features, and diverse expressions. Traditionally, NER is treated as a sequence labeling task, where each token is assigned a label. Recent research has reframed NER within the machine reading comprehension (MRC) framework, extracting entities in a question-answer format, achieving state-of-the-art performance. However, these MRC-based methods have a significant limitation: they extract entities of various types independently, ignoring their interrelations. To address this, we introduce the Fusion Label Relations with MRC (FLR-MRC) model, which enhances the MRC model by implicitly capturing dependencies among entity types. FLR-MRC models interrelations between labels using graph attention networks, integrating these with textual data to identify entities. On the benchmark CMeEE and CCKS2017-CNER datasets, FLR-MRC achieves F1-scores of 0.6652 and 0.9101, respectively, outperforming existing clinical NER methods.

摘要

由于其专业性、独特的语言特征和多样的表达方式，中文电子病历（EMR）在命名实体识别（NER）方面面临重大挑战。传统上，NER被视为一个序列标注任务，其中每个词元都被分配一个标签。最近的研究在机器阅读理解（MRC）框架内对NER进行了重新构建，以问答格式提取实体，取得了最优性能。然而，这些基于MRC的方法有一个重大局限性：它们独立提取各种类型的实体，忽略了它们之间的相互关系。为了解决这个问题，我们引入了融合标签关系的MRC（FLR-MRC）模型，该模型通过隐式捕捉实体类型之间的依赖关系来增强MRC模型。FLR-MRC使用图注意力网络对标签之间的关系进行建模，将这些关系与文本数据相结合以识别实体。在基准CMeEE和CCKS2017-CNER数据集上，FLR-MRC的F1分数分别达到0.6652和0.9101，优于现有的临床NER方法。

相似文献

Chinese EMR Named Entity Recognition Using Fused Label Relations Based on Machine Reading Comprehension Framework.基于机器阅读理解框架融合标签关系的中文电子病历命名实体识别

IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1143-1153. doi: 10.1109/TCBB.2024.3376591. Epub 2024 Oct 9.

Chinese Clinical Named Entity Recognition With Segmentation Synonym Sentence Synthesis Mechanism: Algorithm Development and Validation.基于分词、同义词和句子合成机制的中文临床命名实体识别：算法开发与验证

JMIR Med Inform. 2024 Nov 21;12:e60334. doi: 10.2196/60334.

Biomedical named entity recognition using BERT in the machine reading comprehension framework.基于机器阅读理解框架的 BERT 在生物医学命名实体识别中的应用。

J Biomed Inform. 2021 Jun;118:103799. doi: 10.1016/j.jbi.2021.103799. Epub 2021 May 6.

Application of machine reading comprehension techniques for named entity recognition in materials science.机器阅读理解技术在材料科学中用于命名实体识别的应用

J Cheminform. 2024 Jul 2;16(1):76. doi: 10.1186/s13321-024-00874-5.

A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。

BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.

From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.从零到英雄：利用变压器在零样本和少样本上下文中进行生物医学命名实体识别。

Artif Intell Med. 2024 Oct;156:102970. doi: 10.1016/j.artmed.2024.102970. Epub 2024 Aug 24.

Chinese medical entity recognition based on the dual-branch TENER model.基于双分支 TENER 模型的中文医疗实体识别。

BMC Med Inform Decis Mak. 2023 Jul 24;23(1):136. doi: 10.1186/s12911-023-02243-y.

Improving deep learning method for biomedical named entity recognition by using entity definition information.利用实体定义信息改进生物医学命名实体识别的深度学习方法。

BMC Bioinformatics. 2021 Dec 17;22(Suppl 1):600. doi: 10.1186/s12859-021-04236-y.

Leveraging Multi-source knowledge for Chinese clinical named entity recognition via relational graph convolutional network.基于关系图卷积网络的多源知识融合的中文临床命名实体识别。

J Biomed Inform. 2022 Apr;128:104035. doi: 10.1016/j.jbi.2022.104035. Epub 2022 Feb 23.

Improving large language models for clinical named entity recognition via prompt engineering.通过提示工程改进临床命名实体识别的大型语言模型。

J Am Med Inform Assoc. 2024 Sep 1;31(9):1812-1820. doi: 10.1093/jamia/ocad259.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器阅读理解框架融合标签关系的中文电子病历命名实体识别

Chinese EMR Named Entity Recognition Using Fused Label Relations Based on Machine Reading Comprehension Framework.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献