• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于集成方法的临床文本中药物及相关信息的提取。

Ensemble method-based extraction of medication and related information from clinical texts.

机构信息

Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA.

出版信息

J Am Med Inform Assoc. 2020 Jan 1;27(1):31-38. doi: 10.1093/jamia/ocz100.

DOI:10.1093/jamia/ocz100
PMID:31282932
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7489099/
Abstract

OBJECTIVE

Accurate and complete information about medications and related information is crucial for effective clinical decision support and precise health care. Recognition and reduction of adverse drug events is also central to effective patient care. The goal of this research is the development of a natural language processing (NLP) system to automatically extract medication and adverse drug event information from electronic health records. This effort was part of the 2018 n2c2 shared task on adverse drug events and medication extraction.

MATERIALS AND METHODS

The new NLP system implements a stacked generalization based on a search-based structured prediction algorithm for concept extraction. We trained 4 sequential classifiers using a variety of structured learning algorithms. To enhance accuracy, we created a stacked ensemble consisting of these concept extraction models trained on the shared task training data. We implemented a support vector machine model to identify related concepts.

RESULTS

Experiments with the official test set showed that our stacked ensemble achieved an F1 score of 92.66%. The relation extraction model with given concepts reached a 93.59% F1 score. Our end-to-end system yielded overall micro-averaged recall, precision, and F1 score of 92.52%, 81.88% and 86.88%, respectively. Our NLP system for adverse drug events and medication extraction ranked within the top 5 of teams participating in the challenge.

CONCLUSION

This study demonstrated that a stacked ensemble with a search-based structured prediction algorithm achieved good performance by effectively integrating the output of individual classifiers and could provide a valid solution for other clinical concept extraction tasks.

摘要

目的

准确、完整的药物及相关信息对于有效的临床决策支持和精准的医疗保健至关重要。识别和减少药物不良事件也是有效患者护理的核心。本研究的目标是开发一种自然语言处理(NLP)系统,以自动从电子健康记录中提取药物和药物不良事件信息。这项工作是 2018 年 n2c2 药物不良事件和药物提取共享任务的一部分。

材料与方法

新的 NLP 系统实现了基于搜索的结构化预测算法的堆叠泛化,用于概念提取。我们使用各种结构化学习算法训练了 4 个顺序分类器。为了提高准确性,我们创建了一个堆叠集成,由在共享任务训练数据上训练的这些概念提取模型组成。我们实现了一个支持向量机模型来识别相关概念。

结果

在官方测试集上的实验表明,我们的堆叠集成达到了 92.66%的 F1 得分。给定概念的关系提取模型达到了 93.59%的 F1 得分。我们的端到端系统的总体微平均召回率、精度和 F1 得分为 92.52%、81.88%和 86.88%。我们的药物不良事件和药物提取 NLP 系统在参与挑战的团队中排名前 5。

结论

这项研究表明,基于搜索的结构化预测算法的堆叠集成通过有效整合各个分类器的输出,取得了良好的性能,可以为其他临床概念提取任务提供有效的解决方案。

相似文献

1
Ensemble method-based extraction of medication and related information from clinical texts.基于集成方法的临床文本中药物及相关信息的提取。
J Am Med Inform Assoc. 2020 Jan 1;27(1):31-38. doi: 10.1093/jamia/ocz100.
2
A study of deep learning approaches for medication and adverse drug event extraction from clinical text.深度学习方法在从临床文本中提取药物和药物不良事件的研究。
J Am Med Inform Assoc. 2020 Jan 1;27(1):13-21. doi: 10.1093/jamia/ocz063.
3
Adverse drug events and medication relation extraction in electronic health records with ensemble deep learning methods.基于集成深度学习方法的电子健康记录中的药物不良反应和药物关系提取。
J Am Med Inform Assoc. 2020 Jan 1;27(1):39-46. doi: 10.1093/jamia/ocz101.
4
Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning.利用结合知识库和深度学习的自然语言处理系统提取药物和相关药物不良事件。
J Am Med Inform Assoc. 2020 Jan 1;27(1):56-64. doi: 10.1093/jamia/ocz141.
5
2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records.2018n2c2 电子健康记录中药物不良反应和药物提取共享任务。
J Am Med Inform Assoc. 2020 Jan 1;27(1):3-12. doi: 10.1093/jamia/ocz166.
6
Identifying relations of medications with adverse drug events using recurrent convolutional neural networks and gradient boosting.利用递归卷积神经网络和梯度提升来识别药物与药物不良事件之间的关系。
J Am Med Inform Assoc. 2020 Jan 1;27(1):65-72. doi: 10.1093/jamia/ocz144.
7
An ensemble of neural models for nested adverse drug events and medication extraction with subwords.基于子词的嵌套不良药物事件和药物提取的神经模型集合。
J Am Med Inform Assoc. 2020 Jan 1;27(1):22-30. doi: 10.1093/jamia/ocz075.
8
Adverse drug event and medication extraction in electronic health records via a cascading architecture with different sequence labeling models and word embeddings.基于具有不同序列标注模型和词向量的级联架构,从电子健康记录中提取药物不良事件和药物信息。
J Am Med Inform Assoc. 2020 Jan 1;27(1):47-55. doi: 10.1093/jamia/ocz120.
9
A Hybrid Model for Family History Information Identification and Relation Extraction: Development and Evaluation of an End-to-End Information Extraction System.一种用于家族病史信息识别与关系抽取的混合模型:一个端到端信息抽取系统的开发与评估
JMIR Med Inform. 2021 Apr 22;9(4):e22797. doi: 10.2196/22797.
10
Recognition of medication information from discharge summaries using ensembles of classifiers.使用分类器集成识别出院小结中的药物信息。
BMC Med Inform Decis Mak. 2012 May 7;12:36. doi: 10.1186/1472-6947-12-36.

引用本文的文献

1
Do LLMs Surpass Encoders for Biomedical NER?大型语言模型在生物医学命名实体识别方面是否超越了编码器?
Proc (IEEE Int Conf Healthc Inform). 2025 Jun;2025:352-358. doi: 10.1109/ICHI64645.2025.00048. Epub 2025 Jul 22.
2
Clinical applications of large language models in medicine and surgery: A scoping review.大型语言模型在医学与外科中的临床应用:一项范围综述
J Int Med Res. 2025 Jul;53(7):3000605251347556. doi: 10.1177/03000605251347556. Epub 2025 Jul 4.
3
Grammar-constrained decoding for structured information extraction with fine-tuned generative models applied to clinical trial abstracts.用于结构化信息提取的语法约束解码,将微调生成模型应用于临床试验摘要。
Front Artif Intell. 2025 Jan 7;7:1406857. doi: 10.3389/frai.2024.1406857. eCollection 2024.
4
Comparing generative and extractive approaches to information extraction from abstracts describing randomized clinical trials.比较从描述随机临床试验的摘要中提取信息的生成式方法和抽取式方法。
J Biomed Semantics. 2024 Apr 23;15(1):3. doi: 10.1186/s13326-024-00305-2.
5
Extracting social determinants of health from clinical note text with classification and sequence-to-sequence approaches.使用分类和序列到序列方法从临床记录文本中提取健康的社会决定因素。
J Am Med Inform Assoc. 2023 Jul 19;30(8):1448-1455. doi: 10.1093/jamia/ocad071.
6
Piloting an automated clinical trial eligibility surveillance and provider alert system based on artificial intelligence and standard data models.基于人工智能和标准数据模型,试点自动化临床试验资格监测和提供方提醒系统。
BMC Med Res Methodol. 2023 Apr 11;23(1):88. doi: 10.1186/s12874-023-01916-6.
7
Adverse drug event detection using natural language processing: A scoping review of supervised learning methods.基于自然语言处理的药物不良反应检测:监督学习方法的范围综述。
PLoS One. 2023 Jan 3;18(1):e0279842. doi: 10.1371/journal.pone.0279842. eCollection 2023.
8
Machine learning approaches for electronic health records phenotyping: a methodical review.基于机器学习的电子健康记录表型分析方法:系统评价
J Am Med Inform Assoc. 2023 Jan 18;30(2):367-381. doi: 10.1093/jamia/ocac216.
9
Predicting Adverse Drug Reactions from Social Media Posts: Data Balance, Feature Selection and Deep Learning.从社交媒体帖子预测药物不良反应:数据平衡、特征选择与深度学习
Healthcare (Basel). 2022 Mar 25;10(4):618. doi: 10.3390/healthcare10040618.
10
Recent Developments in Privacy-Preserving Mining of Clinical Data.临床数据隐私保护挖掘的最新进展
ACM IMS Trans Data Sci. 2021 Nov;2(4). doi: 10.1145/3447774.

本文引用的文献

1
Exploiting Unlabeled Texts with Clustering-based Instance Selection for Medical Relation Classification.基于聚类的实例选择利用未标记文本进行医学关系分类
AMIA Annu Symp Proc. 2018 Apr 16;2017:1060-1069. eCollection 2017.
2
De-identification of psychiatric intake records: Overview of 2016 CEGS N-GRID shared tasks Track 1.去识别精神科入院记录:2016 年 CEGS N-GRID 共享任务跟踪 1 概述。
J Biomed Inform. 2017 Nov;75S:S4-S18. doi: 10.1016/j.jbi.2017.06.011. Epub 2017 Jun 11.
3
Detection of drug-drug interactions through data mining studies using clinical sources, scientific literature and social media.通过临床来源、科学文献和社交媒体的数据挖掘研究来检测药物-药物相互作用。
Brief Bioinform. 2018 Sep 28;19(5):863-877. doi: 10.1093/bib/bbx010.
4
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.
5
A Study of Concept Extraction Across Different Types of Clinical Notes.不同类型临床记录中的概念提取研究。
AMIA Annu Symp Proc. 2015 Nov 5;2015:737-46. eCollection 2015.
6
Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.用于去识别化的纵向临床记录标注:2014年i2b2/德克萨斯大学健康科学中心语料库
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S20-S29. doi: 10.1016/j.jbi.2015.07.020. Epub 2015 Aug 28.
7
Extracting and standardizing medication information in clinical text - the MedEx-UIMA system.在临床文本中提取和规范用药信息——MedEx-UIMA系统
AMIA Jt Summits Transl Sci Proc. 2014 Apr 7;2014:37-42. eCollection 2014.
8
Cadec: A corpus of adverse drug event annotations.Cadec:一个药物不良事件注释语料库。
J Biomed Inform. 2015 Jun;55:73-81. doi: 10.1016/j.jbi.2015.03.010. Epub 2015 Mar 27.
9
Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.使用带有词表示特征的结构支持向量机识别医院出院小结中的临床实体。
BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-13-S1-S1. Epub 2013 Apr 5.
10
Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.评估临床文本中的时间关系:2012 i2b2 挑战赛。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):806-13. doi: 10.1136/amiajnl-2013-001628. Epub 2013 Apr 5.