• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种结合循环单元和信息状态递归的中文命名实体识别模型。

A Chinese named entity recognition model incorporating recurrent cell and information state recursion.

作者信息

Han Qingbin, Ma Jialin

机构信息

School of Computer and Software Engineering, Huaiyin Institute of Technology, Huaian, 223003, Jiangsu, China.

出版信息

Sci Rep. 2024 Mar 6;14(1):5564. doi: 10.1038/s41598-024-56166-3.

DOI:10.1038/s41598-024-56166-3
PMID:38448461
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11319810/
Abstract

Chinese is characterized by high syntactic complexity, chaotic annotation granularity, and slow convergence. Joint learning models can effectively improve the accuracy of Chinese Named Entity Recognition (NER), but they focus too much on local feature information and reduce the ability of long sequence feature extraction. To address the limitations of long sequence feature extraction ability, we propose a Chinese NER model called Incorporating Recurrent Cell and Information State Recursion (IRCSR-NER). The model integrates recurrent cells and information state recursion to improve the recognition ability of long entity boundaries. To solve the problem that Chinese and English have different focuses in syntactic analysis. We use the syntactic dependency approach to add lexical relationship information to sentences represented at the word level. The IRCSR-NER is applied to sequence feature extraction to improve the model efficiency and long-text feature extraction ability. The model captures contextual long-distance dependent information while focusing on local feature information. We evaluated our proposed model using four public datasets and compared it with other mainstream models. Experimental results demonstrate that our model outperforms traditional and mainstream models.

摘要

中文具有句法复杂度高、标注粒度混乱和收敛速度慢的特点。联合学习模型可以有效提高中文命名实体识别(NER)的准确率,但它们过于关注局部特征信息,降低了长序列特征提取的能力。为了解决长序列特征提取能力的局限性,我们提出了一种名为“整合循环单元与信息状态递归”(IRCSR-NER)的中文NER模型。该模型整合了循环单元和信息状态递归,以提高长实体边界的识别能力。为了解决中文和英文在句法分析上重点不同的问题,我们使用句法依存方法为词级表示的句子添加词汇关系信息。IRCSR-NER应用于序列特征提取,以提高模型效率和长文本特征提取能力。该模型在关注局部特征信息的同时,捕捉上下文长距离依赖信息。我们使用四个公共数据集对我们提出的模型进行了评估,并将其与其他主流模型进行了比较。实验结果表明,我们的模型优于传统模型和主流模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09ac/11319810/2f27e635f4bb/41598_2024_56166_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09ac/11319810/b589fc1c7c12/41598_2024_56166_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09ac/11319810/1353597f50e1/41598_2024_56166_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09ac/11319810/2f27e635f4bb/41598_2024_56166_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09ac/11319810/b589fc1c7c12/41598_2024_56166_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09ac/11319810/1353597f50e1/41598_2024_56166_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09ac/11319810/2f27e635f4bb/41598_2024_56166_Fig5_HTML.jpg

相似文献

1
A Chinese named entity recognition model incorporating recurrent cell and information state recursion.一种结合循环单元和信息状态递归的中文命名实体识别模型。
Sci Rep. 2024 Mar 6;14(1):5564. doi: 10.1038/s41598-024-56166-3.
2
Chinese medical named entity recognition based on multi-granularity semantic dictionary and multimodal tree.基于多粒度语义词典和多模态树的中文医学命名实体识别。
J Biomed Inform. 2020 Nov;111:103583. doi: 10.1016/j.jbi.2020.103583. Epub 2020 Sep 30.
3
Kcr-FLAT: A Chinese-Named Entity Recognition Model with Enhanced Semantic Information.Kcr-FLAT:一种具有增强语义信息的中文命名实体识别模型。
Sensors (Basel). 2023 Feb 4;23(4):1771. doi: 10.3390/s23041771.
4
MMBERT: a unified framework for biomedical named entity recognition.MMBERT:一个用于生物医学命名实体识别的统一框架。
Med Biol Eng Comput. 2024 Jan;62(1):327-341. doi: 10.1007/s11517-023-02934-8. Epub 2023 Oct 14.
5
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
6
Context-Aware Attentive Multilevel Feature Fusion for Named Entity Recognition.用于命名实体识别的上下文感知注意力多级特征融合
IEEE Trans Neural Netw Learn Syst. 2022 Jun 8;PP. doi: 10.1109/TNNLS.2022.3178522.
7
ADPG: Biomedical entity recognition based on Automatic Dependency Parsing Graph.ADPG:基于自动依存句法分析图的生物医学实体识别
J Biomed Inform. 2023 Apr;140:104317. doi: 10.1016/j.jbi.2023.104317. Epub 2023 Feb 17.
8
Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition.基于 CNN 和 LSTM 的组合特征嵌入的生物医学命名实体识别。
J Biomed Inform. 2020 Mar;103:103381. doi: 10.1016/j.jbi.2020.103381. Epub 2020 Jan 28.
9
Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach.基于词汇和注意力机制的猕猴桃病虫害命名实体识别:一种深度学习方法。
Front Plant Sci. 2022 Nov 17;13:1053449. doi: 10.3389/fpls.2022.1053449. eCollection 2022.
10
Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征,利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别:模型开发与验证
JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.

引用本文的文献

1
Construction and Application of Traditional Chinese Medicine Knowledge Graph Based on Large Language Model.基于大语言模型的中医药知识图谱构建与应用
Interdiscip Sci. 2025 Jul 2. doi: 10.1007/s12539-025-00735-1.
2
A Position- and Similarity-Aware Named Entity Recognition Model for Power Equipment Maintenance Work Orders.一种用于电力设备维护工单的位置和相似度感知命名实体识别模型
Sensors (Basel). 2025 Mar 26;25(7):2062. doi: 10.3390/s25072062.

本文引用的文献

1
Enriching contextualized language model from knowledge graph for biomedical information extraction.从知识图谱中丰富上下文相关的语言模型以进行生物医学信息抽取。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa110.
2
Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules.通过结合领域字典和规则来提高中文电子病历的命名实体识别。
Int J Environ Res Public Health. 2020 Apr 14;17(8):2687. doi: 10.3390/ijerph17082687.