• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从电子病历中推导断言的灵活框架。

A flexible framework for deriving assertions from electronic medical records.

机构信息

Human Language Technology Research Institute, University of Texas at Dallas, Richardson, Texas 75080-0688, USA.

出版信息

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):568-73. doi: 10.1136/amiajnl-2011-000152. Epub 2011 Jul 1.

DOI:10.1136/amiajnl-2011-000152
PMID:21724741
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3168311/
Abstract

OBJECTIVE

This paper describes natural-language-processing techniques for two tasks: identification of medical concepts in clinical text, and classification of assertions, which indicate the existence, absence, or uncertainty of a medical problem. Because so many resources are available for processing clinical texts, there is interest in developing a framework in which features derived from these resources can be optimally selected for the two tasks of interest.

MATERIALS AND METHODS

The authors used two machine-learning (ML) classifiers: support vector machines (SVMs) and conditional random fields (CRFs). Because SVMs and CRFs can operate on a large set of features extracted from both clinical texts and external resources, the authors address the following research question: Which features need to be selected for obtaining optimal results? To this end, the authors devise feature-selection techniques which greatly reduce the amount of manual experimentation and improve performance.

RESULTS

The authors evaluated their approaches on the 2010 i2b2/VA challenge data. Concept extraction achieves 79.59 micro F-measure. Assertion classification achieves 93.94 micro F-measure.

DISCUSSION

Approaching medical concept extraction and assertion classification through ML-based techniques has the advantage of easily adapting to new data sets and new medical informatics tasks. However, ML-based techniques perform best when optimal features are selected. By devising promising feature-selection techniques, the authors obtain results that outperform the current state of the art.

CONCLUSION

This paper presents two ML-based approaches for processing language in the clinical texts evaluated in the 2010 i2b2/VA challenge. By using novel feature-selection methods, the techniques presented in this paper are unique among the i2b2 participants.

摘要

目的

本文描述了两种自然语言处理技术任务:在临床文本中识别医学概念,以及对断言进行分类,这些断言表明存在、不存在或不确定医学问题。由于有如此多的资源可用于处理临床文本,因此人们有兴趣开发一种框架,在该框架中,可以针对两个感兴趣的任务最优地选择从这些资源中得出的特征。

材料和方法

作者使用了两种机器学习(ML)分类器:支持向量机(SVM)和条件随机场(CRF)。由于 SVM 和 CRF 可以对从临床文本和外部资源中提取的大量特征进行操作,因此作者提出了以下研究问题:需要选择哪些特征才能获得最佳结果?为此,作者设计了特征选择技术,这些技术大大减少了手动实验的次数并提高了性能。

结果

作者在 2010 年 i2b2/VA 挑战赛的数据上评估了他们的方法。概念提取的微 F1 值达到 79.59。断言分类的微 F1 值达到 93.94。

讨论

通过基于 ML 的技术来处理医学概念提取和断言分类具有易于适应新数据集和新医学信息学任务的优势。但是,只有在选择最佳特征时,基于 ML 的技术才能发挥最佳性能。通过设计有前途的特征选择技术,作者获得了优于当前最先进水平的结果。

结论

本文提出了两种基于 ML 的方法来处理 2010 年 i2b2/VA 挑战赛中评估的临床文本中的语言。通过使用新颖的特征选择方法,本文提出的技术在 i2b2 参与者中是独一无二的。

相似文献

1
A flexible framework for deriving assertions from electronic medical records.从电子病历中推导断言的灵活框架。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):568-73. doi: 10.1136/amiajnl-2011-000152. Epub 2011 Jul 1.
2
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.
3
A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.基于机器学习的方法从出院小结中提取临床实体及其断言的研究。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):601-6. doi: 10.1136/amiajnl-2011-000163. Epub 2011 Apr 20.
4
MITRE system for clinical assertion status classification.MITRE 临床断言状态分类系统。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):563-7. doi: 10.1136/amiajnl-2011-000164. Epub 2011 Apr 22.
5
Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.混合方法提高临床文档信息获取:概念、断言和关系识别。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):588-93. doi: 10.1136/amiajnl-2011-000154. Epub 2011 May 19.
6
Automatic extraction of relations between medical concepts in clinical texts.临床文本中医用概念间关系的自动提取。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):594-600. doi: 10.1136/amiajnl-2011-000153.
7
A knowledge discovery and reuse pipeline for information extraction in clinical notes.临床笔记中信息抽取的知识发现和重用管道。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):574-9. doi: 10.1136/amiajnl-2011-000302. Epub 2011 Jul 7.
8
Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010.基于机器学习的临床信息抽取三阶段解决方案:i2b2 2010 年的研究现状。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):557-62. doi: 10.1136/amiajnl-2011-000150. Epub 2011 May 12.
9
Automated concept-level information extraction to reduce the need for custom software and rules development.自动化概念级信息提取,以减少对定制软件和规则开发的需求。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):607-13. doi: 10.1136/amiajnl-2011-000183. Epub 2011 Jun 22.
10
Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.使用带有词表示特征的结构支持向量机识别医院出院小结中的临床实体。
BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-13-S1-S1. Epub 2013 Apr 5.

引用本文的文献

1
Clinical Decision Support and Natural Language Processing in Medicine: Systematic Literature Review.临床决策支持与医学自然语言处理:系统文献回顾。
J Med Internet Res. 2024 Sep 30;26:e55315. doi: 10.2196/55315.
2
Trustworthy assertion classification through prompting.通过提示进行可信断言分类。
J Biomed Inform. 2022 Aug;132:104139. doi: 10.1016/j.jbi.2022.104139. Epub 2022 Jul 8.
3
Named Entity Recognition of Medical Text Based on the Deep Neural Network.基于深度神经网络的医学文本命名实体识别
J Healthc Eng. 2022 Mar 7;2022:3990563. doi: 10.1155/2022/3990563. eCollection 2022.
4
Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies.自然语言处理算法在将临床文本片段映射到本体概念上的应用:系统评价及对未来研究的建议。
J Biomed Semantics. 2020 Nov 16;11(1):14. doi: 10.1186/s13326-020-00231-z.
5
Classifying and Summarizing Information from Microblogs During Epidemics.疫情期间微博信息的分类与总结
Inf Syst Front. 2018;20(5):933-948. doi: 10.1007/s10796-018-9844-9. Epub 2018 Mar 20.
6
Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning.利用结合知识库和深度学习的自然语言处理系统提取药物和相关药物不良事件。
J Am Med Inform Assoc. 2020 Jan 1;27(1):56-64. doi: 10.1093/jamia/ocz141.
7
Detecting conversation topics in primary care office visits from transcripts of patient-provider interactions.从医患互动的转录本中检测初级保健就诊中的对话主题。
J Am Med Inform Assoc. 2019 Dec 1;26(12):1493-1504. doi: 10.1093/jamia/ocz140.
8
Learning relevance models for patient cohort retrieval.学习用于患者队列检索的相关性模型。
JAMIA Open. 2018 Oct;1(2):265-275. doi: 10.1093/jamiaopen/ooy010. Epub 2018 Sep 28.
9
A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text.中文临床文本中医疗实体识别的新方法。
J Healthc Eng. 2017;2017:4898963. doi: 10.1155/2017/4898963. Epub 2017 Jul 5.
10
Medical Question Answering for Clinical Decision Support.用于临床决策支持的医学问答
Proc ACM Int Conf Inf Knowl Manag. 2016 Oct;2016:297-306. doi: 10.1145/2983323.2983819.

本文引用的文献

1
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛:临床文本中的概念、断言和关系
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.
2
Toward an ontological treatment of disease and diagnosis.迈向疾病与诊断的本体论治疗。
Summit Transl Bioinform. 2009 Mar 1;2009:116-20.
3
High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge.从临床记录中提取药物信息的高精度信息提取:2009 i2b2 药物提取挑战赛。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):524-7. doi: 10.1136/jamia.2010.003939.
4
Exploiting semantic relations for literature-based discovery.利用语义关系进行基于文献的发现。
AMIA Annu Symp Proc. 2006;2006:349-53.
5
A statistical methodology for analyzing co-occurrence data from a large sample.一种用于分析来自大样本的共现数据的统计方法。
J Biomed Inform. 2007 Jun;40(3):343-52. doi: 10.1016/j.jbi.2006.11.003. Epub 2006 Dec 1.
6
PhenoGO: assigning phenotypic context to gene ontology annotations with natural language processing.PhenoGO:通过自然语言处理为基因本体注释赋予表型背景。
Pac Symp Biocomput. 2006:64-75.
7
Extracting drug-drug interaction articles from MEDLINE to improve the content of drug databases.从医学文献数据库(MEDLINE)中提取药物相互作用文章以改善药物数据库的内容。
AMIA Annu Symp Proc. 2005;2005:216-20.
8
Recent advances in natural language processing for biomedical applications.生物医学应用中自然语言处理的最新进展。
Int J Med Inform. 2006 Jun;75(6):413-7. doi: 10.1016/j.ijmedinf.2005.06.008. Epub 2005 Aug 31.
9
An ontology for cell types.一种细胞类型本体。
Genome Biol. 2005;6(2):R21. doi: 10.1186/gb-2005-6-2-r21. Epub 2005 Jan 14.
10
The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information.哺乳动物表型本体论作为一种注释、分析和比较表型信息的工具。
Genome Biol. 2005;6(1):R7. doi: 10.1186/gb-2004-6-1-r7. Epub 2004 Dec 15.