• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床试验中不良事件的文本挖掘:深度学习方法。

Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach.

作者信息

Chopard Daphne, Treder Matthias S, Corcoran Padraig, Ahmed Nagheen, Johnson Claire, Busse Monica, Spasic Irena

机构信息

School of Computer Science & Informatics, Cardiff University, Cardiff, United Kingdom.

Centre for Trials Research, Cardiff University, Cardiff, United Kingdom.

出版信息

JMIR Med Inform. 2021 Dec 24;9(12):e28632. doi: 10.2196/28632.

DOI:10.2196/28632
PMID:34951601
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8742206/
Abstract

BACKGROUND

Pharmacovigilance and safety reporting, which involve processes for monitoring the use of medicines in clinical trials, play a critical role in the identification of previously unrecognized adverse events or changes in the patterns of adverse events.

OBJECTIVE

This study aims to demonstrate the feasibility of automating the coding of adverse events described in the narrative section of the serious adverse event report forms to enable statistical analysis of the aforementioned patterns.

METHODS

We used the Unified Medical Language System (UMLS) as the coding scheme, which integrates 217 source vocabularies, thus enabling coding against other relevant terminologies such as the International Classification of Diseases-10th Revision, Medical Dictionary for Regulatory Activities, and Systematized Nomenclature of Medicine). We used MetaMap, a highly configurable dictionary lookup software, to identify the mentions of the UMLS concepts. We trained a binary classifier using Bidirectional Encoder Representations from Transformers (BERT), a transformer-based language model that captures contextual relationships, to differentiate between mentions of the UMLS concepts that represented adverse events and those that did not.

RESULTS

The model achieved a high F1 score of 0.8080, despite the class imbalance. This is 10.15 percent points lower than human-like performance but also 17.45 percent points higher than that of the baseline approach.

CONCLUSIONS

These results confirmed that automated coding of adverse events described in the narrative section of serious adverse event reports is feasible. Once coded, adverse events can be statistically analyzed so that any correlations with the trialed medicines can be estimated in a timely fashion.

摘要

背景

药物警戒和安全性报告涉及在临床试验中监测药物使用的过程,在识别先前未被认识的不良事件或不良事件模式变化方面发挥着关键作用。

目的

本研究旨在证明对严重不良事件报告表叙述部分中描述的不良事件进行编码自动化的可行性,以便对上述模式进行统计分析。

方法

我们使用统一医学语言系统(UMLS)作为编码方案,该系统整合了217种源词汇表,从而能够对照其他相关术语进行编码,如《国际疾病分类-第十次修订本》、《药物监管活动医学词典》和《医学系统命名法》。我们使用MetaMap,一种高度可配置的词典查找软件,来识别UMLS概念的提及。我们使用基于变换器的双向编码器表示(BERT)训练了一个二元分类器,BERT是一种捕获上下文关系的基于变换器的语言模型,用于区分代表不良事件的UMLS概念提及和不代表不良事件的提及。

结果

尽管存在类别不平衡,该模型仍获得了0.8080的高F1分数。这比类似人类的表现低10.15个百分点,但也比基线方法高17.45个百分点。

结论

这些结果证实了对严重不良事件报告叙述部分中描述的不良事件进行编码自动化是可行的。一旦编码,不良事件就可以进行统计分析,以便及时估计与受试药物的任何相关性。

相似文献

1
Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach.临床试验中不良事件的文本挖掘:深度学习方法。
JMIR Med Inform. 2021 Dec 24;9(12):e28632. doi: 10.2196/28632.
2
Unified Medical Language System resources improve sieve-based generation and Bidirectional Encoder Representations from Transformers (BERT)-based ranking for concept normalization.统一医学语言系统资源提高了基于筛子的生成和基于双向编码器表示的转换器(BERT)的排名,以实现概念归一化。
J Am Med Inform Assoc. 2020 Oct 1;27(10):1510-1519. doi: 10.1093/jamia/ocaa080.
3
A systematic approach for developing a corpus of patient reported adverse drug events: A case study for SSRI and SNRI medications.构建患者报告的药物不良事件语料库的系统方法:以SSRI和SNRI药物为例的案例研究。
J Biomed Inform. 2019 Feb;90:103091. doi: 10.1016/j.jbi.2018.12.005. Epub 2019 Jan 4.
4
Training a Deep Contextualized Language Model for International Classification of Diseases, 10th Revision Classification via Federated Learning: Model Development and Validation Study.通过联邦学习训练用于国际疾病分类第10次修订版分类的深度情境化语言模型:模型开发与验证研究
JMIR Med Inform. 2022 Nov 10;10(11):e41342. doi: 10.2196/41342.
5
Autonomous International Classification of Diseases Coding Using Pretrained Language Models and Advanced Prompt Learning Techniques: Evaluation of an Automated Analysis System Using Medical Text.使用预训练语言模型和先进提示学习技术的自主国际疾病分类编码:对一个使用医学文本的自动分析系统的评估
JMIR Med Inform. 2025 Jan 6;13:e63020. doi: 10.2196/63020.
6
A Machine Learning Approach with Human-AI Collaboration for Automated Classification of Patient Safety Event Reports: Algorithm Development and Validation Study.一种人机协作的机器学习方法用于患者安全事件报告的自动分类:算法开发与验证研究
JMIR Hum Factors. 2024 Jan 25;11:e53378. doi: 10.2196/53378.
7
Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)-Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study.基于大规模电子健康记录笔记对基于变换器的双向编码器表征(BERT)模型进行微调:一项实证研究。
JMIR Med Inform. 2019 Sep 12;7(3):e14830. doi: 10.2196/14830.
8
Improving entity recognition using ensembles of deep learning and fine-tuned large language models: A case study on adverse event extraction from VAERS and social media.使用深度学习集成和微调大语言模型改进实体识别:以从VAERS和社交媒体中提取不良事件为例
J Biomed Inform. 2025 Mar;163:104789. doi: 10.1016/j.jbi.2025.104789. Epub 2025 Feb 7.
9
Multifaceted Natural Language Processing Task-Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation.基于转换器模型的双向编码器表示的多方面自然语言处理任务评估在双语(韩语和英语)临床笔记中的应用:算法开发和验证。
JMIR Med Inform. 2024 Oct 30;12:e52897. doi: 10.2196/52897.
10
Algorithmic Identification of Treatment-Emergent Adverse Events From Clinical Notes Using Large Language Models: A Pilot Study in Inflammatory Bowel Disease.利用大型语言模型从临床记录中算法识别治疗相关不良事件:炎症性肠病的初步研究。
Clin Pharmacol Ther. 2024 Jun;115(6):1391-1399. doi: 10.1002/cpt.3226. Epub 2024 Mar 8.

引用本文的文献

1
Introduction to Large Language Models (LLMs) for dementia care and research.用于痴呆症护理和研究的大语言模型介绍
Front Dement. 2024 May 14;3:1385303. doi: 10.3389/frdem.2024.1385303. eCollection 2024.
2
Word sense disambiguation of acronyms in clinical narratives.临床叙述中首字母缩略词的词义消歧
Front Digit Health. 2024 Feb 28;6:1282043. doi: 10.3389/fdgth.2024.1282043. eCollection 2024.
3
A Machine Learning Approach with Human-AI Collaboration for Automated Classification of Patient Safety Event Reports: Algorithm Development and Validation Study.

本文引用的文献

1
Extracting postmarketing adverse events from safety reports in the vaccine adverse event reporting system (VAERS) using deep learning.使用深度学习从疫苗不良事件报告系统(VAERS)中的安全报告中提取上市后不良事件。
J Am Med Inform Assoc. 2021 Jul 14;28(7):1393-1400. doi: 10.1093/jamia/ocab014.
2
The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study.预训练语言模型对跨语言医学文本中否定和推测检测的影响:比较研究
JMIR Med Inform. 2020 Dec 3;8(12):e18953. doi: 10.2196/18953.
3
Deep learning approaches for extracting adverse events and indications of dietary supplements from clinical text.
一种人机协作的机器学习方法用于患者安全事件报告的自动分类:算法开发与验证研究
JMIR Hum Factors. 2024 Jan 25;11:e53378. doi: 10.2196/53378.
4
Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques.利用自然语言处理技术从韩国不良事件报告系统的不良药物事件叙述中自动提取全面的药物安全信息。
Drug Saf. 2023 Aug;46(8):781-795. doi: 10.1007/s40264-023-01323-2. Epub 2023 Jun 17.
深度学习方法从临床文本中提取膳食补充剂的不良事件和适应证。
J Am Med Inform Assoc. 2021 Mar 1;28(3):569-577. doi: 10.1093/jamia/ocaa218.
4
Clinical Text Data in Machine Learning: Systematic Review.机器学习中的临床文本数据:系统综述
JMIR Med Inform. 2020 Mar 31;8(3):e17984. doi: 10.2196/17984.
5
Does BERT need domain adaptation for clinical negation detection?BERT 是否需要进行领域适应来进行临床否定检测?
J Am Med Inform Assoc. 2020 Apr 1;27(4):584-591. doi: 10.1093/jamia/ocaa001.
6
Cohort Selection for Clinical Trials From Longitudinal Patient Records: Text Mining Approach.基于纵向患者记录的临床试验队列选择:文本挖掘方法
JMIR Med Inform. 2019 Oct 31;7(4):e15980. doi: 10.2196/15980.
7
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
8
Detecting Potential Adverse Drug Reactions Using a Deep Neural Network Model.使用深度神经网络模型检测潜在药物不良反应
J Med Internet Res. 2019 Feb 6;21(2):e11016. doi: 10.2196/11016.
9
Normalizing Spontaneous Reports Into MedDRA: Some Experiments With MagiCoder.将自发报告标准化为 MedDRA:MagiCoder 的一些实验。
IEEE J Biomed Health Inform. 2019 Jan;23(1):95-102. doi: 10.1109/JBHI.2018.2861213. Epub 2018 Jul 30.
10
Natural Language Processing and Its Implications for the Future of Medication Safety: A Narrative Review of Recent Advances and Challenges.自然语言处理及其对药物安全未来的影响:对近期进展和挑战的叙述性综述。
Pharmacotherapy. 2018 Aug;38(8):822-841. doi: 10.1002/phar.2151. Epub 2018 Jul 22.