• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于字符和词注意力增强神经网络的中文住院病案中医学命名实体抽取

Medical Named Entity Extraction from Chinese Resident Admit Notes Using Character and Word Attention-Enhanced Neural Network.

机构信息

School of Automation, Central South University, Changsha 410083, China.

PTY LTD., Changsha 410083, China.

出版信息

Int J Environ Res Public Health. 2020 Mar 2;17(5):1614. doi: 10.3390/ijerph17051614.

DOI:10.3390/ijerph17051614
PMID:32131522
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7084381/
Abstract

The resident admit notes (RANs) in electronic medical records (EMRs) is first-hand information to study the patient's condition. Medical entity extraction of RANs is an important task to get disease information for medical decision-making. For Chinese electronic medical records, each medical entity contains not only word information but also rich character information. Effective combination of words and characters is very important for medical entity extraction. We propose a medical entity recognition model based on a character and word attention-enhanced (CWAE) neural network for Chinese RANs. In our model, word embeddings and character-based embeddings are obtained through character-enhanced word embedding (CWE) model and Convolutional Neural Network (CNN) model. Then attention mechanism combines the character-based embeddings and word embeddings together, which significantly improves the expression ability of words. The new word embeddings obtained by the attention mechanism are taken as the input to bidirectional long short-term memory (BI-LSTM) and conditional random field (CRF) to extract entities. We extracted nine types of key medical entities from Chinese RANs and evaluated our model. The proposed method was compared with two traditional machine learning methods CRF, support vector machine (SVM), and the related deep learning models. The result shows that our model has better performance, and the result of our model reaches 94.44% in the F1-score.

摘要

住院医师入院记录(RANs)是电子病历(EMRs)中用于研究患者病情的第一手资料。从 RANs 中提取医学实体是获取疾病信息以用于医疗决策的重要任务。对于中文电子病历,每个医学实体不仅包含单词信息,还包含丰富的字符信息。有效结合单词和字符对于医学实体提取非常重要。我们提出了一种基于字符和单词注意力增强(CWAE)神经网络的中文 RANs 医学实体识别模型。在我们的模型中,通过字符增强的单词嵌入(CWE)模型和卷积神经网络(CNN)模型获得单词嵌入和基于字符的嵌入。然后,注意力机制将基于字符的嵌入和单词嵌入结合在一起,这显著提高了单词的表达能力。通过注意力机制获得的新单词嵌入被用作双向长短期记忆(BI-LSTM)和条件随机场(CRF)的输入,以提取实体。我们从中文 RANs 中提取了九种关键医学实体,并对我们的模型进行了评估。将提出的方法与两种传统机器学习方法 CRF、支持向量机(SVM)和相关的深度学习模型进行了比较。结果表明,我们的模型具有更好的性能,F1 得分达到了 94.44%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/42df277fe277/ijerph-17-01614-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/1739a08f57d2/ijerph-17-01614-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/47e9cdfb18fb/ijerph-17-01614-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/cb4ddd7748e1/ijerph-17-01614-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/df48abce2375/ijerph-17-01614-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/7fb944aa0435/ijerph-17-01614-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/412aee56e870/ijerph-17-01614-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/935983c4b659/ijerph-17-01614-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/4226842542c9/ijerph-17-01614-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/42df277fe277/ijerph-17-01614-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/1739a08f57d2/ijerph-17-01614-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/47e9cdfb18fb/ijerph-17-01614-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/cb4ddd7748e1/ijerph-17-01614-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/df48abce2375/ijerph-17-01614-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/7fb944aa0435/ijerph-17-01614-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/412aee56e870/ijerph-17-01614-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/935983c4b659/ijerph-17-01614-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/4226842542c9/ijerph-17-01614-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ebf/7084381/42df277fe277/ijerph-17-01614-g009.jpg

相似文献

1
Medical Named Entity Extraction from Chinese Resident Admit Notes Using Character and Word Attention-Enhanced Neural Network.基于字符和词注意力增强神经网络的中文住院病案中医学命名实体抽取
Int J Environ Res Public Health. 2020 Mar 2;17(5):1614. doi: 10.3390/ijerph17051614.
2
Constructing a Chinese electronic medical record corpus for named entity recognition on resident admit notes.构建用于住院记录中命名实体识别的中文电子病历语料库。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):56. doi: 10.1186/s12911-019-0759-2.
3
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
4
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
5
Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征,利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别:模型开发与验证
JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.
6
Chinese Clinical Named Entity Recognition in Electronic Medical Records: Development of a Lattice Long Short-Term Memory Model With Contextualized Character Representations.电子病历中的中文临床命名实体识别:基于上下文特征表示的格长短期记忆模型的开发
JMIR Med Inform. 2020 Sep 4;8(9):e19848. doi: 10.2196/19848.
7
Temporal indexing of medical entity in Chinese clinical notes.中文临床记录中医疗实体的时间索引。
BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):17. doi: 10.1186/s12911-019-0735-x.
8
Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules.通过结合领域字典和规则来提高中文电子病历的命名实体识别。
Int J Environ Res Public Health. 2020 Apr 14;17(8):2687. doi: 10.3390/ijerph17082687.
9
Chinese clinical named entity recognition with radical-level feature and self-attention mechanism.基于词干级特征和自注意力机制的中文临床命名实体识别。
J Biomed Inform. 2019 Oct;98:103289. doi: 10.1016/j.jbi.2019.103289. Epub 2019 Sep 18.
10
Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods.基于机器学习方法的中文电子健康记录临床命名实体识别
JMIR Med Inform. 2018 Dec 17;6(4):e50. doi: 10.2196/medinform.9965.

引用本文的文献

1
Automatic knowledge extraction from Chinese electronic medical records and rheumatoid arthritis knowledge graph construction.从中国电子病历中自动提取知识并构建类风湿性关节炎知识图谱。
Quant Imaging Med Surg. 2023 Jun 1;13(6):3873-3890. doi: 10.21037/qims-22-1158. Epub 2023 May 8.
2
An early aortic dissection screening model and applied research based on ensemble learning.基于集成学习的早期主动脉夹层筛查模型及应用研究
Ann Transl Med. 2020 Dec;8(23):1578. doi: 10.21037/atm-20-1475.

本文引用的文献

1
Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis.深度电子健康记录(EHR):深度学习技术在电子健康记录(EHR)分析中的最新进展综述。
IEEE J Biomed Health Inform. 2018 Sep;22(5):1589-1604. doi: 10.1109/JBHI.2017.2767063. Epub 2017 Oct 27.
2
Deep learning with word embeddings improves biomedical named entity recognition.使用词嵌入的深度学习可改善生物医学命名实体识别。
Bioinformatics. 2017 Jul 15;33(14):i37-i48. doi: 10.1093/bioinformatics/btx228.
3
Deep Learning for Health Informatics.
用于健康信息学的深度学习
IEEE J Biomed Health Inform. 2017 Jan;21(1):4-21. doi: 10.1109/JBHI.2016.2636665. Epub 2016 Dec 29.
4
Electronic medical records (EMRs), epidemiology, and epistemology: reflections on EMRs and future pediatric clinical research.电子病历 (EMR)、流行病学和认识论:对 EMR 与未来儿科临床研究的反思。
Acad Pediatr. 2011 Jul-Aug;11(4):280-7. doi: 10.1016/j.acap.2011.02.007. Epub 2011 May 31.
5
Incorporating rich background knowledge for gene named entity classification and recognition.整合丰富的背景知识用于基因命名实体分类与识别。
BMC Bioinformatics. 2009 Jul 17;10:223. doi: 10.1186/1471-2105-10-223.
6
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.