• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Kcr-FLAT:一种具有增强语义信息的中文命名实体识别模型。

Kcr-FLAT: A Chinese-Named Entity Recognition Model with Enhanced Semantic Information.

机构信息

Guangxi Key Laboratory of Images and Graphics Intelligent Processing, Guilin University of Electronic Technology, Guilin 541004, China.

Nanning Research Institute, Guilin University of Electronic Technology, Guilin 541004, China.

出版信息

Sensors (Basel). 2023 Feb 4;23(4):1771. doi: 10.3390/s23041771.

DOI:10.3390/s23041771
PMID:36850367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9961421/
Abstract

The performance of Chinese-named entity recognition (NER) has improved via word enhancement or new frameworks that incorporate various types of external data. However, for Chinese NER, syntactic composition (in sentence level) and inner regularity (in character-level) have rarely been studied. Chinese characters are highly sensitive to sentential syntactic data. The same Chinese character sequence can be decomposed into different combinations of words according to how they are used and placed in the context. In addition, the same type of entities usually have the same naming rules due to the specificity of the Chinese language structure. This paper presents a Kcr-FLAT to improve the performance of Chinese NER with enhanced semantic information. Specifically, we first extract different types of syntactic data, functionalize the syntactic information by a key-value memory network (KVMN), and fuse them by attention mechanism. Then the syntactic information and lexical information are integrated by a cross-transformer. Finally, we use an inner regularity perception module to capture the internal regularity of each entity for better entity type prediction. The experimental results show that with F1 scores as the evaluation index, the proposed model obtains 96.51%, 96.81%, and 70.12% accuracy rates on MSRA, resume, and Weibo datasets, respectively.

摘要

通过词增强或包含各种类型外部数据的新框架,中文命名实体识别(NER)的性能得到了提高。然而,对于中文 NER,句子级别的句法构成和字符级别的内在规律很少被研究。汉字对句子级别的句法数据非常敏感。根据在上下文中的使用和位置,相同的汉字序列可以分解成不同的词组合。此外,由于汉语结构的特殊性,同一类型的实体通常具有相同的命名规则。本文提出了一种 Kcr-FLAT 方法,通过增强语义信息来提高中文 NER 的性能。具体来说,我们首先提取不同类型的句法数据,通过键值记忆网络(KVMN)对句法信息进行功能化,并通过注意力机制对其进行融合。然后,句法信息和词汇信息通过交叉变换器进行整合。最后,我们使用内部规律感知模块来捕捉每个实体的内部规律,以更好地进行实体类型预测。实验结果表明,以 F1 分数作为评价指标,该模型在 MSRA、简历和微博数据集上的准确率分别为 96.51%、96.81%和 70.12%。

相似文献

1
Kcr-FLAT: A Chinese-Named Entity Recognition Model with Enhanced Semantic Information.Kcr-FLAT:一种具有增强语义信息的中文命名实体识别模型。
Sensors (Basel). 2023 Feb 4;23(4):1771. doi: 10.3390/s23041771.
2
CLART: A cascaded lattice-and-radical transformer network for Chinese medical named entity recognition.CLART:一种用于中文医学命名实体识别的级联格与激进变压器网络。
Heliyon. 2023 Oct 10;9(10):e20692. doi: 10.1016/j.heliyon.2023.e20692. eCollection 2023 Oct.
3
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
4
Multi-level semantic fusion network for Chinese medical named entity recognition.用于中文医学命名实体识别的多层次语义融合网络
J Biomed Inform. 2022 Sep;133:104144. doi: 10.1016/j.jbi.2022.104144. Epub 2022 Jul 22.
5
A multi-layer soft lattice based model for Chinese clinical named entity recognition.基于多层软晶格的中文临床命名实体识别模型。
BMC Med Inform Decis Mak. 2022 Jul 30;22(1):201. doi: 10.1186/s12911-022-01924-4.
6
ADPG: Biomedical entity recognition based on Automatic Dependency Parsing Graph.ADPG:基于自动依存句法分析图的生物医学实体识别
J Biomed Inform. 2023 Apr;140:104317. doi: 10.1016/j.jbi.2023.104317. Epub 2023 Feb 17.
7
Chinese medical entity recognition based on the dual-branch TENER model.基于双分支 TENER 模型的中文医疗实体识别。
BMC Med Inform Decis Mak. 2023 Jul 24;23(1):136. doi: 10.1186/s12911-023-02243-y.
8
BioByGANS: biomedical named entity recognition by fusing contextual and syntactic features through graph attention network in node classification framework.BioByGANS:通过图注意力网络在节点分类框架中融合上下文和句法特征进行生物医学命名实体识别。
BMC Bioinformatics. 2022 Nov 22;23(1):501. doi: 10.1186/s12859-022-05051-9.
9
Chinese clinical named entity recognition with radical-level feature and self-attention mechanism.基于词干级特征和自注意力机制的中文临床命名实体识别。
J Biomed Inform. 2019 Oct;98:103289. doi: 10.1016/j.jbi.2019.103289. Epub 2019 Sep 18.
10
A multitask bi-directional RNN model for named entity recognition on Chinese electronic medical records.一种用于中文电子病历命名实体识别的多任务双向 RNN 模型。
BMC Bioinformatics. 2018 Dec 28;19(Suppl 17):499. doi: 10.1186/s12859-018-2467-9.

本文引用的文献

1
Dual Sticky Hierarchical Dirichlet Process Hidden Markov Model and Its Application to Natural Language Description of Motions.双粘性分层狄利克雷过程隐马尔可夫模型及其在运动自然语言描述中的应用。
IEEE Trans Pattern Anal Mach Intell. 2018 Oct;40(10):2355-2373. doi: 10.1109/TPAMI.2017.2756039. Epub 2017 Sep 25.