• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于机器学习的方法识别临床文本中的不相交临床概念。

Recognizing Disjoint Clinical Concepts in Clinical Text Using Machine Learning-based Methods.

作者信息

Tang Buzhou, Chen Qingcai, Wang Xiaolong, Wu Yonghui, Zhang Yaoyun, Jiang Min, Wang Jingqi, Xu Hua

机构信息

Key Laboratory of Network Oriented Intelligent Computation, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China; School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, USA.

Key Laboratory of Network Oriented Intelligent Computation, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, China.

出版信息

AMIA Annu Symp Proc. 2015 Nov 5;2015:1184-93. eCollection 2015.

PMID:26958258
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4765674/
Abstract

Clinical concept recognition (CCR) is a fundamental task in clinical natural language processing (NLP) field. Almost all current machine learning-based CCR systems can only recognize clinical concepts of consecutive words (called consecutive clinical concepts, CCCs), but can do nothing about clinical concepts of disjoint words (called disjoint clinical concepts, DCCs), which widely exist in clinical text. In this paper, we proposed two novel types of representations for disjoint clinical concepts, and applied two state-of-the-art machine learning methods to recognizing consecutive and disjoint concepts. Experiments conducted on the 2013 ShARe/CLEF challenge corpus showed that our best system achieved a "strict" F-measure of 0.803 for CCCs, a "strict" F-measure of 0.477 for DCCs, and a "strict" F-measure of 0.783 for all clinical concepts, significantly higher than the baseline systems by 4.2% and 4.1% respectively.

摘要

临床概念识别(CCR)是临床自然语言处理(NLP)领域的一项基础任务。几乎所有当前基于机器学习的CCR系统都只能识别连续单词的临床概念(称为连续临床概念,CCCs),但对于临床文本中广泛存在的不连续单词的临床概念(称为不连续临床概念,DCCs)却无能为力。在本文中,我们提出了两种用于不连续临床概念的新型表示方法,并应用两种最先进的机器学习方法来识别连续和不连续概念。在2013年ShARe/CLEF挑战语料库上进行的实验表明,我们最好的系统在CCCs上的“严格”F值为0.803,在DCCs上的“严格”F值为0.477,在所有临床概念上的“严格”F值为0.783,分别比基线系统显著高出4.2%和4.1%。

相似文献

1
Recognizing Disjoint Clinical Concepts in Clinical Text Using Machine Learning-based Methods.使用基于机器学习的方法识别临床文本中的不相交临床概念。
AMIA Annu Symp Proc. 2015 Nov 5;2015:1184-93. eCollection 2015.
2
Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.使用带有词表示特征的结构支持向量机识别医院出院小结中的临床实体。
BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-13-S1-S1. Epub 2013 Apr 5.
3
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.开发和评估 RapTAT:一种用于从医学叙述中映射短语概念的机器学习系统。
J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.
4
Drug-Drug Interaction Extraction via Convolutional Neural Networks.通过卷积神经网络进行药物-药物相互作用提取
Comput Math Methods Med. 2016;2016:6918381. doi: 10.1155/2016/6918381. Epub 2016 Jan 31.
5
A study of active learning methods for named entity recognition in clinical text.临床文本中命名实体识别的主动学习方法研究
J Biomed Inform. 2015 Dec;58:11-18. doi: 10.1016/j.jbi.2015.09.010. Epub 2015 Sep 15.
6
Machine learning-based coreference resolution of concepts in clinical documents.基于机器学习的临床文档中概念的共指消解。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):883-7. doi: 10.1136/amiajnl-2011-000774. Epub 2012 May 12.
7
Recognizing Questions and Answers in EMR Templates Using Natural Language Processing.使用自然语言处理识别电子病历模板中的问题与答案
Stud Health Technol Inform. 2014;202:149-52.
8
Gene ontology concept recognition using named concept: understanding the various presentations of the gene functions in biomedical literature.使用命名概念进行基因本体论概念识别:理解生物医学文献中基因功能的各种表现形式。
Database (Oxford). 2018 Jan 1;2018:bay115. doi: 10.1093/database/bay115.
9
Clinical concept normalization with a hybrid natural language processing system combining multilevel matching and machine learning ranking.临床概念规范化的混合自然语言处理系统,结合多层次匹配和机器学习排序。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1576-1584. doi: 10.1093/jamia/ocaa155.
10
Patient representation learning and interpretable evaluation using clinical notes.利用临床记录进行患者表示学习和可解释评估。
J Biomed Inform. 2018 Aug;84:103-113. doi: 10.1016/j.jbi.2018.06.016. Epub 2018 Jul 3.

引用本文的文献

1
A comparative study of pre-trained language models for named entity recognition in clinical trial eligibility criteria from multiple corpora.基于多语料库的临床试验资格标准中命名实体识别的预训练语言模型的比较研究。
BMC Med Inform Decis Mak. 2022 Sep 6;22(Suppl 3):235. doi: 10.1186/s12911-022-01967-7.
2
Clinical Text Data in Machine Learning: Systematic Review.机器学习中的临床文本数据:系统综述
JMIR Med Inform. 2020 Mar 31;8(3):e17984. doi: 10.2196/17984.
3
Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning.利用结合知识库和深度学习的自然语言处理系统提取药物和相关药物不良事件。
J Am Med Inform Assoc. 2020 Jan 1;27(1):56-64. doi: 10.1093/jamia/ocz141.
4
Enhancing clinical concept extraction with contextual embeddings.利用上下文嵌入增强临床概念提取。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1297-1304. doi: 10.1093/jamia/ocz096.
5
Entity recognition in Chinese clinical text using attention-based CNN-LSTM-CRF.基于注意力机制的卷积神经网络-长短时记忆网络-条件随机场在中文临床文本中的实体识别。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):74. doi: 10.1186/s12911-019-0787-y.
6
Extraction of Information Related to Adverse Drug Events from Electronic Health Record Notes: Design of an End-to-End Model Based on Deep Learning.从电子健康记录笔记中提取与药物不良事件相关的信息:基于深度学习的端到端模型设计
JMIR Med Inform. 2018 Nov 26;6(4):e12159. doi: 10.2196/12159.
7
Disorder recognition in clinical texts using multi-label structured SVM.使用多标签结构化支持向量机识别临床文本中的病症
BMC Bioinformatics. 2017 Jan 31;18(1):75. doi: 10.1186/s12859-017-1476-4.

本文引用的文献

1
A hybrid system for temporal information extraction from clinical text.一种从临床文本中提取时间信息的混合系统。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):828-35. doi: 10.1136/amiajnl-2013-001635. Epub 2013 Apr 9.
2
Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.使用带有词表示特征的结构支持向量机识别医院出院小结中的临床实体。
BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-13-S1-S1. Epub 2013 Apr 5.
3
Clinical decision support with automated text processing for cervical cancer screening.基于自动化文本处理的宫颈癌筛查临床决策支持。
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):833-9. doi: 10.1136/amiajnl-2012-000820. Epub 2012 Apr 29.
4
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.
5
A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.基于机器学习的方法从出院小结中提取临床实体及其断言的研究。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):601-6. doi: 10.1136/amiajnl-2011-000163. Epub 2011 Apr 20.
6
Extracting medication information from clinical text.从临床文本中提取药物信息。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):514-8. doi: 10.1136/jamia.2010.003947.
7
Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统(cTAKES):架构、组件评估和应用。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.
8
An overview of MetaMap: historical perspective and recent advances.MetaMap 概述:历史视角与最新进展。
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36. doi: 10.1136/jamia.2009.002733.
9
Clinical research informatics: challenges, opportunities and definition for an emerging domain.临床研究信息学:一个新兴领域的挑战、机遇与定义
J Am Med Inform Assoc. 2009 May-Jun;16(3):316-27. doi: 10.1197/jamia.M3005. Epub 2009 Mar 4.
10
Extracting information from textual documents in the electronic health record: a review of recent research.从电子健康记录中的文本文件提取信息:近期研究综述
Yearb Med Inform. 2008:128-44.