• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

中文临床文本中医疗实体识别的新方法。

A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text.

机构信息

Second Affiliated Hospital Zhejiang University School of Medicine, Hangzhou, Zhejiang Province 310000, China.

Sir Run Run Shaw Hospital Zhejiang University School of Medicine, Hangzhou, Zhejiang Province 310000, China.

出版信息

J Healthc Eng. 2017;2017:4898963. doi: 10.1155/2017/4898963. Epub 2017 Jul 5.

DOI:10.1155/2017/4898963
PMID:29065612
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5516712/
Abstract

Medical entity recognition, a basic task in the language processing of clinical data, has been extensively studied in analyzing admission notes in alphabetic languages such as English. However, much less work has been done on nonstructural texts that are written in Chinese, or in the setting of differentiation of Chinese drug names between traditional Chinese medicine and Western medicine. Here, we propose a novel cascade-type Chinese medication entity recognition approach that aims at integrating the sentence category classifier from a support vector machine and the conditional random field-based medication entity recognition. We hypothesized that this approach could avoid the side effects of abundant negative samples and improve the performance of the named entity recognition from admission notes written in Chinese. Therefore, we applied this approach to a test set of 324 Chinese-written admission notes with manual annotation by medical experts. Our data demonstrated that this approach had a score of 94.2% in precision, 92.8% in recall, and 93.5% in F-measure for the recognition of traditional Chinese medicine drug names and 91.2% in precision, 92.6% in recall, and 91.7% F-measure for the recognition of Western medicine drug names. The differences in F-measure were significant compared with those in the baseline systems.

摘要

医学实体识别是临床数据语言处理中的一个基本任务,在分析英语等字母语言的入院记录时已经得到了广泛研究。然而,对于用中文书写的非结构化文本,或者在区分中药和西药药物名称方面的研究工作要少得多。在这里,我们提出了一种新的级联式中药药物实体识别方法,旨在整合来自支持向量机的句子类别分类器和基于条件随机场的药物实体识别。我们假设这种方法可以避免大量负样本的副作用,并提高中文入院记录中命名实体识别的性能。因此,我们将这种方法应用于一组由医学专家手动注释的 324 份中文入院记录的测试集。我们的数据表明,这种方法在识别中药药物名称时的精度、召回率和 F1 分数分别为 94.2%、92.8%和 93.5%,在识别西药药物名称时的精度、召回率和 F1 分数分别为 91.2%、92.6%和 91.7%。与基线系统相比,F1 分数的差异具有统计学意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/4214fa2edcac/JHE2017-4898963.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/2e81d433942f/JHE2017-4898963.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/404ef9f1a0df/JHE2017-4898963.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/38d2f77b387f/JHE2017-4898963.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/4214fa2edcac/JHE2017-4898963.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/2e81d433942f/JHE2017-4898963.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/404ef9f1a0df/JHE2017-4898963.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/38d2f77b387f/JHE2017-4898963.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f993/5516712/4214fa2edcac/JHE2017-4898963.004.jpg

相似文献

1
A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text.中文临床文本中医疗实体识别的新方法。
J Healthc Eng. 2017;2017:4898963. doi: 10.1155/2017/4898963. Epub 2017 Jul 5.
2
A comprehensive study of named entity recognition in Chinese clinical text.中文临床文本命名实体识别的综合研究。
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):808-14. doi: 10.1136/amiajnl-2013-002381. Epub 2013 Dec 17.
3
Automatic recognition of disorders, findings, pharmaceuticals and body structures from clinical text: an annotation and machine learning study.从临床文本中自动识别疾病、检查结果、药物和身体结构:一项注释与机器学习研究。
J Biomed Inform. 2014 Jun;49:148-58. doi: 10.1016/j.jbi.2014.01.012. Epub 2014 Feb 4.
4
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
5
Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining.基于深度学习预训练的中文电子病历临床命名实体识别。
J Healthc Eng. 2020 Nov 24;2020:8829219. doi: 10.1155/2020/8829219. eCollection 2020.
6
An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records.基于注意力的深度学习模型在中文电子病历临床命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):235. doi: 10.1186/s12911-019-0933-6.
7
A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.基于机器学习的方法从出院小结中提取临床实体及其断言的研究。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):601-6. doi: 10.1136/amiajnl-2011-000163. Epub 2011 Apr 20.
8
A multitask bi-directional RNN model for named entity recognition on Chinese electronic medical records.一种用于中文电子病历命名实体识别的多任务双向 RNN 模型。
BMC Bioinformatics. 2018 Dec 28;19(Suppl 17):499. doi: 10.1186/s12859-018-2467-9.
9
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
10
Extracting important information from Chinese Operation Notes with natural language processing methods.运用自然语言处理方法从中文手术记录中提取重要信息。
J Biomed Inform. 2014 Apr;48:130-6. doi: 10.1016/j.jbi.2013.12.017. Epub 2014 Jan 31.

引用本文的文献

1
Improved XLNet modeling for Chinese named entity recognition of edible fungus.用于食用菌中文命名实体识别的改进型XLNet模型
Front Plant Sci. 2024 Jun 25;15:1368847. doi: 10.3389/fpls.2024.1368847. eCollection 2024.
2
Surgical procedure long terms recognition from Chinese literature incorporating structural feature.结合结构特征从中国文献中获得手术程序的长期认知。
Heliyon. 2022 Oct 29;8(11):e11291. doi: 10.1016/j.heliyon.2022.e11291. eCollection 2022 Nov.
3
Information Extraction from the Text Data on Traditional Chinese Medicine: A Review on Tasks, Challenges, and Methods from 2010 to 2021.

本文引用的文献

1
Detecting negation and scope in Chinese clinical notes using character and word embedding.使用字符和词嵌入检测中文临床记录中的否定和范围
Comput Methods Programs Biomed. 2017 Mar;140:53-59. doi: 10.1016/j.cmpb.2016.11.009. Epub 2016 Nov 23.
2
Natural language processing in biomedicine: a unified system architecture overview.生物医学中的自然语言处理:统一系统架构概述。
Methods Mol Biol. 2014;1168:275-94. doi: 10.1007/978-1-4939-0847-9_16.
3
A comprehensive study of named entity recognition in Chinese clinical text.中文临床文本命名实体识别的综合研究。
从中医文本数据中提取信息:2010年至2021年任务、挑战及方法综述
Evid Based Complement Alternat Med. 2022 May 13;2022:1679589. doi: 10.1155/2022/1679589. eCollection 2022.
4
A Core Drug Discovery Framework from Large-Scale Literature for Cold Pathogenic Disease Treatment in Traditional Chinese Medicine.基于大规模文献的寒病治疗中药核心药物发现框架
J Healthc Eng. 2021 Aug 4;2021:9930543. doi: 10.1155/2021/9930543. eCollection 2021.
5
Minimalistic Approach to Coreference Resolution in Lithuanian Medical Records.立陶宛语病历中指代消解的极简方法。
Comput Math Methods Med. 2019 Mar 20;2019:9079840. doi: 10.1155/2019/9079840. eCollection 2019.
6
Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods.基于机器学习方法的中文电子健康记录临床命名实体识别
JMIR Med Inform. 2018 Dec 17;6(4):e50. doi: 10.2196/medinform.9965.
7
Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text.中文在线医疗文本中的无监督医学实体识别与链接
J Healthc Eng. 2018 Apr 18;2018:2548537. doi: 10.1155/2018/2548537. eCollection 2018.
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):808-14. doi: 10.1136/amiajnl-2013-002381. Epub 2013 Dec 17.
4
Introduction to machine learning.机器学习导论
Methods Mol Biol. 2014;1107:105-28. doi: 10.1007/978-1-62703-748-8_7.
5
Supervised methods for symptom name recognition in free-text clinical records of traditional Chinese medicine: an empirical study.中医自由文本临床记录中症状名称识别的监督方法:一项实证研究。
J Biomed Inform. 2014 Feb;47:91-104. doi: 10.1016/j.jbi.2013.09.008. Epub 2013 Sep 23.
6
Combining an expert-based medical entity recognizer to a machine-learning system: methods and a case study.将基于专家的医学实体识别器与机器学习系统相结合:方法与案例研究。
Biomed Inform Insights. 2013 Aug 1;6(Suppl 1):51-62. doi: 10.4137/BII.S11770. eCollection 2013.
7
A comparison of electronic health records at two major Peking University Hospitals in China to United States meaningful use objectives.中国两所主要北京大学附属医院的电子健康记录与美国有意义使用目标的比较。
BMC Med Inform Decis Mak. 2013 Aug 28;13:96. doi: 10.1186/1472-6947-13-96.
8
Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries.使用中文出院小结中的对偶分解进行联合分割和命名实体识别。
J Am Med Inform Assoc. 2014 Feb;21(e1):e84-92. doi: 10.1136/amiajnl-2013-001806. Epub 2013 Aug 9.
9
Building large collections of Chinese and English medical terms from semi-structured and encyclopedia websites.从半结构化和百科网站构建大型中文和英文医学术语集。
PLoS One. 2013 Jul 9;8(7):e67526. doi: 10.1371/journal.pone.0067526. Print 2013.
10
Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.使用带有词表示特征的结构支持向量机识别医院出院小结中的临床实体。
BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-13-S1-S1. Epub 2013 Apr 5.