快速训练的分类模型检测药物不良事件。

Detecting Adverse Drug Events with Rapidly Trained Classification Models.

机构信息

Health Fidelity, San Mateo, CA, USA.

VA Salt Lake City Health Care System, University of Utah, Salt Lake City, UT, USA.

出版信息

Drug Saf. 2019 Jan;42(1):147-156. doi: 10.1007/s40264-018-0763-y.

DOI:10.1007/s40264-018-0763-y

PMID:30649737

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6373386/

Abstract

INTRODUCTION

Identifying occurrences of medication side effects and adverse drug events (ADEs) is an important and challenging task because they are frequently only mentioned in clinical narrative and are not formally reported.

METHODS

We developed a natural language processing (NLP) system that aims to identify mentions of symptoms and drugs in clinical notes and label the relationship between the mentions as indications or ADEs. The system leverages an existing word embeddings model with induced word clusters for dimensionality reduction. It employs a conditional random field (CRF) model for named entity recognition (NER) and a random forest model for relation extraction (RE).

RESULTS

Final performance of each model was evaluated separately and then combined on a manually annotated evaluation set. The micro-averaged F1 score was 80.9% for NER, 88.1% for RE, and 61.2% for the integrated systems. Outputs from our systems were submitted to the NLP Challenges for Detecting Medication and Adverse Drug Events from Electronic Health Records (MADE 1.0) competition (Yu et al. in http://bio-nlp.org/index.php/projects/39-nlp-challenges , 2018). System performance was evaluated in three tasks (NER, RE, and complete system) with multiple teams submitting output from their systems for each task. Our RE system placed first in Task 2 of the challenge and our integrated system achieved third place in Task 3.

CONCLUSION

Adding to the growing number of publications that utilize NLP to detect occurrences of ADEs, our study illustrates the benefits of employing innovative feature engineering.

摘要

简介

识别药物副作用和药物不良事件（ADE）的发生是一项重要且具有挑战性的任务，因为它们通常仅在临床叙述中提及，并未正式报告。

方法

我们开发了一种自然语言处理（NLP）系统，旨在识别临床记录中症状和药物的提及，并将提及之间的关系标记为指示或 ADE。该系统利用现有的词嵌入模型和诱导的词聚类进行降维。它采用条件随机场（CRF）模型进行命名实体识别（NER），并采用随机森林模型进行关系提取（RE）。

结果

分别评估每个模型的最终性能，然后在手动标注的评估集上进行组合。NER 的微平均 F1 分数为 80.9%，RE 的微平均 F1 分数为 88.1%，集成系统的微平均 F1 分数为 61.2%。我们系统的输出已提交给从电子健康记录中检测药物和药物不良事件的自然语言处理挑战赛（MADE 1.0）（Yu 等人，http://bio-nlp.org/index.php/projects/39-nlp-challenges, 2018）。系统性能在三个任务（NER、RE 和完整系统）中进行评估，多个团队为每个任务提交其系统的输出。我们的 RE 系统在挑战的任务 2 中排名第一，我们的集成系统在任务 3 中排名第三。

结论

除了越来越多的利用 NLP 检测 ADE 发生的出版物外，我们的研究还说明了采用创新特征工程的好处。

相似文献

Detecting Adverse Drug Events with Rapidly Trained Classification Models.快速训练的分类模型检测药物不良事件。

Drug Saf. 2019 Jan;42(1):147-156. doi: 10.1007/s40264-018-0763-y.

Overview of the First Natural Language Processing Challenge for Extracting Medication, Indication, and Adverse Drug Events from Electronic Health Record Notes (MADE 1.0).从电子健康记录中提取药物、适应症和药物不良事件的自然语言处理挑战赛概述（MADE 1.0）。

Drug Saf. 2019 Jan;42(1):99-111. doi: 10.1007/s40264-018-0762-z.

MADEx: A System for Detecting Medications, Adverse Drug Events, and Their Relations from Clinical Notes.MADEx：从临床记录中检测药物、药物不良事件及其关系的系统。

Drug Saf. 2019 Jan;42(1):123-133. doi: 10.1007/s40264-018-0761-0.

A systematic review of natural language processing for classification tasks in the field of incident reporting and adverse event analysis.自然语言处理在事件报告和不良事件分析领域分类任务中的系统评价

Int J Med Inform. 2019 Dec;132:103971. doi: 10.1016/j.ijmedinf.2019.103971. Epub 2019 Oct 5.

Adverse Drug Event Detection from Electronic Health Records Using Hierarchical Recurrent Neural Networks with Dual-Level Embedding.基于具有双层嵌入的层次递归神经网络从电子健康记录中检测药物不良反应。

Drug Saf. 2019 Jan;42(1):113-122. doi: 10.1007/s40264-018-0765-9.

A study of deep learning approaches for medication and adverse drug event extraction from clinical text.深度学习方法在从临床文本中提取药物和药物不良事件的研究。

J Am Med Inform Assoc. 2020 Jan 1;27(1):13-21. doi: 10.1093/jamia/ocz063.

Leveraging Natural Language Processing and Machine Learning Methods for Adverse Drug Event Detection in Electronic Health/Medical Records: A Scoping Review.利用自然语言处理和机器学习方法在电子健康/医疗记录中进行药物不良事件检测：一项范围综述

Drug Saf. 2025 Apr;48(4):321-337. doi: 10.1007/s40264-024-01505-6. Epub 2025 Jan 9.

Automated System to Capture Patient Symptoms From Multitype Japanese Clinical Texts: Retrospective Study.从多类型日本临床文本中自动捕获患者症状的系统：回顾性研究。

JMIR Med Inform. 2024 Sep 24;12:e58977. doi: 10.2196/58977.

Extracting adverse drug events from clinical Notes: A systematic review of approaches used.从临床记录中提取药物不良事件：对所用方法的系统评价

J Biomed Inform. 2024 Mar;151:104603. doi: 10.1016/j.jbi.2024.104603. Epub 2024 Feb 6.

Deep learning approaches for extracting adverse events and indications of dietary supplements from clinical text.深度学习方法从临床文本中提取膳食补充剂的不良事件和适应证。

J Am Med Inform Assoc. 2021 Mar 1;28(3):569-577. doi: 10.1093/jamia/ocaa218.

引用本文的文献

A refined set of RxNorm drug names for enhancing unstructured data analysis in drug safety surveillance.一组经过优化的RxNorm药物名称，用于加强药物安全监测中的非结构化数据分析。

Exp Biol Med (Maywood). 2025 May 2;250:10374. doi: 10.3389/ebm.2025.10374. eCollection 2025.

Effectiveness of Transformer-Based Large Language Models in Identifying Adverse Drug Reaction Relations from Unstructured Discharge Summaries in Singapore.基于Transformer的大语言模型在识别新加坡非结构化出院小结中的药物不良反应关系方面的有效性。

Drug Saf. 2025 Jun;48(6):667-677. doi: 10.1007/s40264-025-01525-w. Epub 2025 Feb 21.

Artificial intelligence-enabled safety monitoring in Alzheimer's disease clinical trials.阿尔茨海默病临床试验中基于人工智能的安全监测

J Prev Alzheimers Dis. 2025 Jan;12(1):100002. doi: 10.1016/j.tjpad.2024.100002. Epub 2025 Jan 1.

Detection of Patient-Level Immunotherapy-Related Adverse Events (irAEs) from Clinical Narratives of Electronic Health Records: A High-Sensitivity Artificial Intelligence Model.从电子健康记录的临床叙述中检测患者层面的免疫治疗相关不良事件（irAEs）：一种高灵敏度人工智能模型。

Pragmat Obs Res. 2024 Dec 20;15:243-252. doi: 10.2147/POR.S468253. eCollection 2024.

An innovative method to strengthen evidence for potential drug safety signals using Electronic Health Records.利用电子健康记录加强潜在药物安全信号证据的创新方法。

J Med Syst. 2024 May 16;48(1):51. doi: 10.1007/s10916-024-02070-2.

Natural language processing with machine learning methods to analyze unstructured patient-reported outcomes derived from electronic health records: A systematic review.使用机器学习方法进行自然语言处理，以分析来自电子健康记录的非结构化患者报告结局：系统评价。

Artif Intell Med. 2023 Dec;146:102701. doi: 10.1016/j.artmed.2023.102701. Epub 2023 Nov 1.

Artificial intelligence in the field of pharmacy practice: A literature review.药学实践领域中的人工智能：一篇文献综述。

Explor Res Clin Soc Pharm. 2023 Oct 21;12:100346. doi: 10.1016/j.rcsop.2023.100346. eCollection 2023 Dec.

A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records.一项关于电子健康记录中乳腺癌表型自然语言处理算法的跨机构评估。

Comput Struct Biotechnol J. 2023 Aug 22;22:32-40. doi: 10.1016/j.csbj.2023.08.018. eCollection 2023.

Artificial Intelligence in Pharmaceutical Technology and Drug Delivery Design.制药技术与药物递送设计中的人工智能

Pharmaceutics. 2023 Jul 10;15(7):1916. doi: 10.3390/pharmaceutics15071916.

Generalizability of machine learning methods in detecting adverse drug events from clinical narratives in electronic medical records.机器学习方法在从电子病历中的临床叙述中检测药物不良事件方面的可推广性。

Front Pharmacol. 2023 Jul 12;14:1218679. doi: 10.3389/fphar.2023.1218679. eCollection 2023.

本文引用的文献

Drug Saf. 2019 Jan;42(1):99-111. doi: 10.1007/s40264-018-0762-z.

Clinical Named Entity Recognition Using Deep Learning Models.使用深度学习模型的临床命名实体识别

AMIA Annu Symp Proc. 2018 Apr 16;2017:1812-1819. eCollection 2017.

Opportunities and obstacles for deep learning in biology and medicine.深度学习在生物学和医学中的机遇与挑战。

J R Soc Interface. 2018 Apr;15(141). doi: 10.1098/rsif.2017.0387.

Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.用于健康领域命名实体识别的具有专用词嵌入的递归神经网络。

J Biomed Inform. 2017 Dec;76:102-109. doi: 10.1016/j.jbi.2017.11.007. Epub 2017 Nov 13.

SSEL-ADE: A semi-supervised ensemble learning framework for extracting adverse drug events from social media.SSEL-ADE：一种从社交媒体中提取不良药物事件的半监督集成学习框架。

Artif Intell Med. 2018 Jan;84:34-49. doi: 10.1016/j.artmed.2017.10.003. Epub 2017 Oct 27.

Natural Language Processing for EHR-Based Pharmacovigilance: A Structured Review.基于电子健康记录的药物警戒中的自然语言处理：系统综述。

Drug Saf. 2017 Nov;40(11):1075-1089. doi: 10.1007/s40264-017-0558-6.

Empirical estimation of under-reporting in the U.S. Food and Drug Administration Adverse Event Reporting System (FAERS).美国食品药品监督管理局不良事件报告系统（FAERS）中漏报情况的实证估计。

Expert Opin Drug Saf. 2017 Jul;16(7):761-767. doi: 10.1080/14740338.2017.1323867. Epub 2017 May 9.

Challenges in adapting existing clinical natural language processing systems to multiple, diverse health care settings.将现有的临床自然语言处理系统应用于多种不同医疗环境时所面临的挑战。

J Am Med Inform Assoc. 2017 Sep 1;24(5):986-991. doi: 10.1093/jamia/ocx039.

Structured prediction models for RNN based sequence labeling in clinical text.用于临床文本中基于循环神经网络的序列标注的结构化预测模型。

Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:856-865. doi: 10.18653/v1/d16-1082.

Bidirectional RNN for Medical Event Detection in Electronic Health Records.用于电子健康记录中医疗事件检测的双向循环神经网络

Proc Conf. 2016 Jun;2016:473-482. doi: 10.18653/v1/n16-1056.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验