• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CLART:一种用于中文医学命名实体识别的级联格与激进变压器网络。

CLART: A cascaded lattice-and-radical transformer network for Chinese medical named entity recognition.

作者信息

Xiao Yinlong, Ji Zongcheng, Li Jianqiang, Zhu Qing

机构信息

Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China.

PAII Inc., CA 94087, United States of America.

出版信息

Heliyon. 2023 Oct 10;9(10):e20692. doi: 10.1016/j.heliyon.2023.e20692. eCollection 2023 Oct.

DOI:10.1016/j.heliyon.2023.e20692
PMID:37876457
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10590790/
Abstract

Chinese medical named entity recognition (NER) is a fundamental task in Chinese medical natural language processing, aiming to recognize Chinese medical entities within unstructured medical texts. However, it poses significant challenges mainly due to the extensive usage of medical terms in Chinese medical texts. Although previous studies have made attempts to incorporate lexical or radical knowledge in order to improve the comprehension of medical texts, these studies either focus solely on one of these aspects or utilize a basic concatenation operation to combine these features, which fails to fully utilize the potential of lexical and radical knowledge. In this paper, we propose a novel Cascaded LAttice-and-Radical Transformer (CLART) network to exploit both lexical and radical information for Chinese medical NER. Specifically, given a sentence, a medical lexicon, and a radical dictionary, we first construct a flat lattice (, character-word sequence) for the sentence and radical components of each Chinese character through word matching and radical parsing, respectively. We then employ a lattice Transformer module to capture the dense interactions between characters and matched words, facilitating the enhanced utilization of lexical knowledge. Subsequently, we design a radical Transformer module to model the dense interactions between the lattice and radical features, facilitating better fusion of the lexical and radical knowledge. Finally, we feed the updated lattice-and-radical-aware character representations into a Conditional Random Fields (CRF) decoder to obtain the predicted labels. Experimental results conducted on two publicly available Chinese medical NER datasets show the effectiveness of the proposed method.

摘要

中文医学命名实体识别(NER)是中文医学自然语言处理中的一项基础任务,旨在识别非结构化医学文本中的中文医学实体。然而,它带来了重大挑战,主要原因是中文医学文本中医学术语的广泛使用。尽管先前的研究已尝试纳入词汇或部首知识以提高对医学文本的理解,但这些研究要么仅关注其中一个方面,要么使用基本的拼接操作来组合这些特征,这未能充分利用词汇和部首知识的潜力。在本文中,我们提出了一种新颖的级联格与部首变换器(CLART)网络,以利用词汇和部首信息进行中文医学NER。具体而言,给定一个句子、一个医学词汇表和一个部首词典,我们首先分别通过词匹配和部首解析为句子和每个汉字的部首成分构建一个扁平格(字符-词序列)。然后,我们使用格变换器模块来捕捉字符与匹配词之间的密集交互,促进词汇知识的增强利用。随后,我们设计一个部首变换器模块来对格与部首特征之间的密集交互进行建模,促进词汇和部首知识的更好融合。最后,我们将更新后的格与部首感知字符表示输入到条件随机场(CRF)解码器中以获得预测标签。在两个公开可用的中文医学NER数据集上进行的实验结果表明了所提方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/7f7688d29f86/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/c801dde02ef2/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/f1f865412804/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/1f784cc32f3d/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/b61d9db6452e/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/8c3586de1a95/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/7f7688d29f86/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/c801dde02ef2/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/f1f865412804/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/1f784cc32f3d/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/b61d9db6452e/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/8c3586de1a95/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/875f/10590790/7f7688d29f86/gr006.jpg

相似文献

1
CLART: A cascaded lattice-and-radical transformer network for Chinese medical named entity recognition.CLART:一种用于中文医学命名实体识别的级联格与激进变压器网络。
Heliyon. 2023 Oct 10;9(10):e20692. doi: 10.1016/j.heliyon.2023.e20692. eCollection 2023 Oct.
2
A multi-layer soft lattice based model for Chinese clinical named entity recognition.基于多层软晶格的中文临床命名实体识别模型。
BMC Med Inform Decis Mak. 2022 Jul 30;22(1):201. doi: 10.1186/s12911-022-01924-4.
3
Kcr-FLAT: A Chinese-Named Entity Recognition Model with Enhanced Semantic Information.Kcr-FLAT:一种具有增强语义信息的中文命名实体识别模型。
Sensors (Basel). 2023 Feb 4;23(4):1771. doi: 10.3390/s23041771.
4
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
5
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
6
Chinese medical entity recognition based on the dual-branch TENER model.基于双分支 TENER 模型的中文医疗实体识别。
BMC Med Inform Decis Mak. 2023 Jul 24;23(1):136. doi: 10.1186/s12911-023-02243-y.
7
Chinese Clinical Named Entity Recognition in Electronic Medical Records: Development of a Lattice Long Short-Term Memory Model With Contextualized Character Representations.电子病历中的中文临床命名实体识别:基于上下文特征表示的格长短期记忆模型的开发
JMIR Med Inform. 2020 Sep 4;8(9):e19848. doi: 10.2196/19848.
8
Multi-level semantic fusion network for Chinese medical named entity recognition.用于中文医学命名实体识别的多层次语义融合网络
J Biomed Inform. 2022 Sep;133:104144. doi: 10.1016/j.jbi.2022.104144. Epub 2022 Jul 22.
9
Chinese clinical named entity recognition with radical-level feature and self-attention mechanism.基于词干级特征和自注意力机制的中文临床命名实体识别。
J Biomed Inform. 2019 Oct;98:103289. doi: 10.1016/j.jbi.2019.103289. Epub 2019 Sep 18.
10
Leveraging Multi-source knowledge for Chinese clinical named entity recognition via relational graph convolutional network.基于关系图卷积网络的多源知识融合的中文临床命名实体识别。
J Biomed Inform. 2022 Apr;128:104035. doi: 10.1016/j.jbi.2022.104035. Epub 2022 Feb 23.

本文引用的文献

1
Multi-level semantic fusion network for Chinese medical named entity recognition.用于中文医学命名实体识别的多层次语义融合网络
J Biomed Inform. 2022 Sep;133:104144. doi: 10.1016/j.jbi.2022.104144. Epub 2022 Jul 22.
2
Leveraging Multi-source knowledge for Chinese clinical named entity recognition via relational graph convolutional network.基于关系图卷积网络的多源知识融合的中文临床命名实体识别。
J Biomed Inform. 2022 Apr;128:104035. doi: 10.1016/j.jbi.2022.104035. Epub 2022 Feb 23.
3
A self-attention based neural architecture for Chinese medical named entity recognition.
一种基于自注意力机制的中文医学命名实体识别神经架构。
Math Biosci Eng. 2020 May 9;17(4):3498-3511. doi: 10.3934/mbe.2020197.
4
Chinese clinical named entity recognition with variant neural structures based on BERT methods.基于 BERT 方法的中文临床命名实体识别与变体神经结构。
J Biomed Inform. 2020 Jul;107:103422. doi: 10.1016/j.jbi.2020.103422. Epub 2020 Apr 28.
5
Adversarial training based lattice LSTM for Chinese clinical named entity recognition.基于对抗训练的格 lattice LSTM 进行中文临床命名实体识别。
J Biomed Inform. 2019 Nov;99:103290. doi: 10.1016/j.jbi.2019.103290. Epub 2019 Sep 23.
6
Incorporating dictionaries into deep neural networks for the Chinese clinical named entity recognition.将词典纳入深度神经网络以进行中文临床命名实体识别。
J Biomed Inform. 2019 Apr;92:103133. doi: 10.1016/j.jbi.2019.103133. Epub 2019 Feb 25.
7
Clinical Named Entity Recognition Using Deep Learning Models.使用深度学习模型的临床命名实体识别
AMIA Annu Symp Proc. 2018 Apr 16;2017:1812-1819. eCollection 2017.
8
A hybrid model based on neural networks for biomedical relation extraction.基于神经网络的生物医学关系抽取混合模型。
J Biomed Inform. 2018 May;81:83-92. doi: 10.1016/j.jbi.2018.03.011. Epub 2018 Mar 27.
9
Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.用于健康领域命名实体识别的具有专用词嵌入的递归神经网络。
J Biomed Inform. 2017 Dec;76:102-109. doi: 10.1016/j.jbi.2017.11.007. Epub 2017 Nov 13.
10
Entity recognition from clinical texts via recurrent neural network.基于循环神经网络的临床文本实体识别。
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):67. doi: 10.1186/s12911-017-0468-7.