• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

电子病历中的中文临床命名实体识别:基于上下文特征表示的格长短期记忆模型的开发

Chinese Clinical Named Entity Recognition in Electronic Medical Records: Development of a Lattice Long Short-Term Memory Model With Contextualized Character Representations.

作者信息

Li Yongbin, Wang Xiaohua, Hui Linhu, Zou Liping, Li Hongjin, Xu Luo, Liu Weihai

机构信息

School of Medical Information Engineering, Zunyi Medical University, Zunyi, China.

Radiology Department, Beilun District People's Hospital, Ningbo, China.

出版信息

JMIR Med Inform. 2020 Sep 4;8(9):e19848. doi: 10.2196/19848.

DOI:10.2196/19848
PMID:32885786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7501578/
Abstract

BACKGROUND

Clinical named entity recognition (CNER), whose goal is to automatically identify clinical entities in electronic medical records (EMRs), is an important research direction of clinical text data mining and information extraction. The promotion of CNER can provide support for clinical decision making and medical knowledge base construction, which could then improve overall medical quality. Compared with English CNER, and due to the complexity of Chinese word segmentation and grammar, Chinese CNER was implemented later and is more challenging.

OBJECTIVE

With the development of distributed representation and deep learning, a series of models have been applied in Chinese CNER. Different from the English version, Chinese CNER is mainly divided into character-based and word-based methods that cannot make comprehensive use of EMR information and cannot solve the problem of ambiguity in word representation.

METHODS

In this paper, we propose a lattice long short-term memory (LSTM) model combined with a variant contextualized character representation and a conditional random field (CRF) layer for Chinese CNER: the Embeddings from Language Models (ELMo)-lattice-LSTM-CRF model. The lattice LSTM model can effectively utilize the information from characters and words in Chinese EMRs; in addition, the variant ELMo model uses Chinese characters as input instead of the character-encoding layer of the ELMo model, so as to learn domain-specific contextualized character embeddings.

RESULTS

We evaluated our method using two Chinese CNER datasets from the China Conference on Knowledge Graph and Semantic Computing (CCKS): the CCKS-2017 CNER dataset and the CCKS-2019 CNER dataset. We obtained F1 scores of 90.13% and 85.02% on the test sets of these two datasets, respectively.

CONCLUSIONS

Our results show that our proposed method is effective in Chinese CNER. In addition, the results of our experiments show that variant contextualized character representations can significantly improve the performance of the model.

摘要

背景

临床命名实体识别(CNER)旨在自动识别电子病历(EMR)中的临床实体,是临床文本数据挖掘和信息提取的重要研究方向。CNER的推广可为临床决策和医学知识库建设提供支持,进而提高整体医疗质量。与英文CNER相比,由于中文分词和语法的复杂性,中文CNER起步较晚且更具挑战性。

目的

随着分布式表示和深度学习的发展,一系列模型已应用于中文CNER。与英文版本不同,中文CNER主要分为基于字符和基于词的方法,这些方法无法全面利用EMR信息,也无法解决词表示中的歧义问题。

方法

在本文中,我们提出了一种结合变体上下文相关字符表示和条件随机场(CRF)层的格长短期记忆(LSTM)模型用于中文CNER:基于语言模型(ELMo)的嵌入 - 格 - LSTM - CRF模型。格LSTM模型可以有效利用中文EMR中字符和词的信息;此外,变体ELMo模型使用汉字作为输入,而不是ELMo模型的字符编码层,从而学习特定领域的上下文相关字符嵌入。

结果

我们使用来自中国知识图谱与语义计算会议(CCKS)的两个中文CNER数据集评估了我们的方法:CCKS - 2017 CNER数据集和CCKS - 2019 CNER数据集。我们在这两个数据集的测试集上分别获得了90.13%和85.02%的F1分数。

结论

我们的结果表明,我们提出的方法在中文CNER中是有效的。此外,我们的实验结果表明,变体上下文相关字符表示可以显著提高模型的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6a7/7501578/56e02e457c05/medinform_v8i9e19848_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6a7/7501578/f25779ea4890/medinform_v8i9e19848_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6a7/7501578/db49c75a638f/medinform_v8i9e19848_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6a7/7501578/56e02e457c05/medinform_v8i9e19848_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6a7/7501578/f25779ea4890/medinform_v8i9e19848_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6a7/7501578/db49c75a638f/medinform_v8i9e19848_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6a7/7501578/56e02e457c05/medinform_v8i9e19848_fig3.jpg

相似文献

1
Chinese Clinical Named Entity Recognition in Electronic Medical Records: Development of a Lattice Long Short-Term Memory Model With Contextualized Character Representations.电子病历中的中文临床命名实体识别:基于上下文特征表示的格长短期记忆模型的开发
JMIR Med Inform. 2020 Sep 4;8(9):e19848. doi: 10.2196/19848.
2
Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征,利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别:模型开发与验证
JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.
3
Adversarial training based lattice LSTM for Chinese clinical named entity recognition.基于对抗训练的格 lattice LSTM 进行中文临床命名实体识别。
J Biomed Inform. 2019 Nov;99:103290. doi: 10.1016/j.jbi.2019.103290. Epub 2019 Sep 23.
4
An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records.基于注意力的深度学习模型在中文电子病历临床命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):235. doi: 10.1186/s12911-019-0933-6.
5
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
6
Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods.基于机器学习方法的中文电子健康记录临床命名实体识别
JMIR Med Inform. 2018 Dec 17;6(4):e50. doi: 10.2196/medinform.9965.
7
Medical Named Entity Extraction from Chinese Resident Admit Notes Using Character and Word Attention-Enhanced Neural Network.基于字符和词注意力增强神经网络的中文住院病案中医学命名实体抽取
Int J Environ Res Public Health. 2020 Mar 2;17(5):1614. doi: 10.3390/ijerph17051614.
8
Chinese clinical named entity recognition with variant neural structures based on BERT methods.基于 BERT 方法的中文临床命名实体识别与变体神经结构。
J Biomed Inform. 2020 Jul;107:103422. doi: 10.1016/j.jbi.2020.103422. Epub 2020 Apr 28.
9
Multi-Level Representation Learning for Chinese Medical Entity Recognition: Model Development and Validation.用于中文医学实体识别的多层次表示学习:模型开发与验证
JMIR Med Inform. 2020 May 4;8(5):e17637. doi: 10.2196/17637.
10
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.

引用本文的文献

1
Chinese medical named entity recognition integrating adversarial training and feature enhancement.融合对抗训练与特征增强的中医命名实体识别
Sci Rep. 2025 Apr 28;15(1):14844. doi: 10.1038/s41598-025-98465-3.
2
Chinese Clinical Named Entity Recognition With Segmentation Synonym Sentence Synthesis Mechanism: Algorithm Development and Validation.基于分词、同义词和句子合成机制的中文临床命名实体识别:算法开发与验证
JMIR Med Inform. 2024 Nov 21;12:e60334. doi: 10.2196/60334.
3
MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records.

本文引用的文献

1
Adversarial training based lattice LSTM for Chinese clinical named entity recognition.基于对抗训练的格 lattice LSTM 进行中文临床命名实体识别。
J Biomed Inform. 2019 Nov;99:103290. doi: 10.1016/j.jbi.2019.103290. Epub 2019 Sep 23.
2
Incorporating dictionaries into deep neural networks for the Chinese clinical named entity recognition.将词典纳入深度神经网络以进行中文临床命名实体识别。
J Biomed Inform. 2019 Apr;92:103133. doi: 10.1016/j.jbi.2019.103133. Epub 2019 Feb 25.
3
Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods.
MF-MNER:中文临床电子病历中的多模型融合命名实体识别。
Interdiscip Sci. 2024 Jun;16(2):489-502. doi: 10.1007/s12539-024-00624-z. Epub 2024 Apr 5.
4
A BERT-Span model for Chinese named entity recognition in rehabilitation medicine.一种用于康复医学中文命名实体识别的BERT跨度模型。
PeerJ Comput Sci. 2023 Aug 21;9:e1535. doi: 10.7717/peerj-cs.1535. eCollection 2023.
5
Chinese medical entity recognition based on the dual-branch TENER model.基于双分支 TENER 模型的中文医疗实体识别。
BMC Med Inform Decis Mak. 2023 Jul 24;23(1):136. doi: 10.1186/s12911-023-02243-y.
6
Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征,利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别:模型开发与验证
JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.
7
An Alternative Application of Natural Language Processing to Express a Characteristic Feature of Diseases in Japanese Medical Records.自然语言处理在日本医疗记录中表达疾病特征的一种新应用。
Methods Inf Med. 2023 Sep;62(3-04):110-118. doi: 10.1055/a-2039-3773. Epub 2023 Feb 21.
8
An imConvNet-based deep learning model for Chinese medical named entity recognition.基于 imConvNet 的深度学习模型在中文医疗命名实体识别中的应用。
BMC Med Inform Decis Mak. 2022 Nov 21;22(1):303. doi: 10.1186/s12911-022-02049-4.
9
An Ensemble Semantic Textual Similarity Measure Based on Multiple Evidences for Biomedical Documents.基于多种证据的生物医学文档集成语义文本相似度度量。
Comput Math Methods Med. 2022 Aug 27;2022:8238432. doi: 10.1155/2022/8238432. eCollection 2022.
10
Medical Specialty Recommendations by an Artificial Intelligence Chatbot on a Smartphone: Development and Deployment.智能手机人工智能聊天机器人的医学专业推荐:开发与部署。
J Med Internet Res. 2021 May 6;23(5):e27460. doi: 10.2196/27460.
基于机器学习方法的中文电子健康记录临床命名实体识别
JMIR Med Inform. 2018 Dec 17;6(4):e50. doi: 10.2196/medinform.9965.
4
Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.用于健康领域命名实体识别的具有专用词嵌入的递归神经网络。
J Biomed Inform. 2017 Dec;76:102-109. doi: 10.1016/j.jbi.2017.11.007. Epub 2017 Nov 13.
5
Deep learning with word embeddings improves biomedical named entity recognition.使用词嵌入的深度学习可改善生物医学命名实体识别。
Bioinformatics. 2017 Jul 15;33(14):i37-i48. doi: 10.1093/bioinformatics/btx228.
6
Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts.用于药物警戒的深度学习:用于标记推特帖子中药物不良反应的循环神经网络架构
J Am Med Inform Assoc. 2017 Jul 1;24(4):813-821. doi: 10.1093/jamia/ocw180.
7
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.
8
Electronic medical records (EMRs), epidemiology, and epistemology: reflections on EMRs and future pediatric clinical research.电子病历 (EMR)、流行病学和认识论:对 EMR 与未来儿科临床研究的反思。
Acad Pediatr. 2011 Jul-Aug;11(4):280-7. doi: 10.1016/j.acap.2011.02.007. Epub 2011 May 31.
9
Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统(cTAKES):架构、组件评估和应用。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.
10
What can natural language processing do for clinical decision support?自然语言处理能为临床决策支持做些什么?
J Biomed Inform. 2009 Oct;42(5):760-72. doi: 10.1016/j.jbi.2009.08.007. Epub 2009 Aug 13.