• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发大型语言模型以检测X上帖子中的药物不良事件。

Developing large language models to detect adverse drug events in posts on x.

作者信息

Deng Yu, Xing Yunzhao, Quach Jason, Chen Xiaotian, Wu Xiaoqiang, Zhang Yafei, Moureaud Charlotte, Yu Mengjia, Zhao Yujie, Wang Li, Zhong Sheng

机构信息

Data & Statistical Sciences, AbbVie Inc, North Chicago, Illinois, USA.

Computer Science & Engineering, University of California San Diego, La Jolla, California, USA.

出版信息

J Biopharm Stat. 2024 Sep 20:1-12. doi: 10.1080/10543406.2024.2403442.

DOI:10.1080/10543406.2024.2403442
PMID:39300965
Abstract

Adverse drug events (ADEs) are one of the major causes of hospital admissions and are associated with increased morbidity and mortality. Post-marketing ADE identification is one of the most important phases of drug safety surveillance. Traditionally, data sources for post-marketing surveillance mainly come from spontaneous reporting system such as the Food and Drug Administration Adverse Event Reporting System (FAERS). Social media data such as posts on X (formerly Twitter) contain rich patient and medication information and could potentially accelerate drug surveillance research. However, ADE information in social media data is usually locked in the text, making it difficult to be employed by traditional statistical approaches. In recent years, large language models (LLMs) have shown promise in many natural language processing tasks. In this study, we developed several LLMs to perform ADE classification on X data. We fine-tuned various LLMs including BERT-base, Bio_ClinicalBERT, RoBERTa, and RoBERTa-large. We also experimented ChatGPT few-shot prompting and ChatGPT fine-tuned on the whole training data. We then evaluated the model performance based on sensitivity, specificity, negative predictive value, positive predictive value, accuracy, F1-measure, and area under the ROC curve. Our results showed that RoBERTa-large achieved the best F1-measure (0.8) among all models followed by ChatGPT fine-tuned model with F1-measure of 0.75. Our feature importance analysis based on 1200 random samples and RoBERTa-Large showed the most important features are as follows: "withdrawals"/"withdrawal", "dry", "dealing", "mouth", and "paralysis". The good model performance and clinically relevant features show the potential of LLMs in augmenting ADE detection for post-marketing drug safety surveillance.

摘要

药物不良事件(ADEs)是导致住院的主要原因之一,且与发病率和死亡率的增加相关。上市后ADE的识别是药物安全监测最重要的阶段之一。传统上,上市后监测的数据来源主要来自自发报告系统,如美国食品药品监督管理局不良事件报告系统(FAERS)。社交媒体数据,如X(前身为Twitter)上的帖子,包含丰富的患者和用药信息,可能会加速药物监测研究。然而,社交媒体数据中的ADE信息通常隐藏在文本中,传统统计方法难以利用。近年来,大语言模型(LLMs)在许多自然语言处理任务中显示出前景。在本研究中,我们开发了几个大语言模型来对X数据进行ADE分类。我们对包括BERT-base、Bio_ClinicalBERT、RoBERTa和RoBERTa-large在内的各种大语言模型进行了微调。我们还试验了ChatGPT的少样本提示以及在整个训练数据上进行微调的ChatGPT。然后,我们基于敏感性、特异性、阴性预测值、阳性预测值、准确性、F1分数和ROC曲线下面积评估了模型性能。我们的结果表明,RoBERTa-large在所有模型中实现了最佳的F1分数(0.8),其次是微调后的ChatGPT模型,F1分数为0.75。我们基于1200个随机样本和RoBERTa-Large的特征重要性分析表明,最重要的特征如下:“停药”/“撤药”、“干燥”、“处理”、“口腔”和“麻痹”。良好的模型性能和临床相关特征表明大语言模型在加强上市后药物安全监测的ADE检测方面具有潜力。

相似文献

1
Developing large language models to detect adverse drug events in posts on x.开发大型语言模型以检测X上帖子中的药物不良事件。
J Biopharm Stat. 2024 Sep 20:1-12. doi: 10.1080/10543406.2024.2403442.
2
Evaluating large language models for health-related text classification tasks with public social media data.利用公共社交媒体数据评估用于健康相关文本分类任务的大型语言模型。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2181-2189. doi: 10.1093/jamia/ocae210.
3
Role of Natural Language Processing in Automatic Detection of Unexpected Findings in Radiology Reports: A Comparative Study of RoBERTa, CNN, and ChatGPT.自然语言处理在放射学报告中自动检测意外发现的作用:RoBERTa、CNN 和 ChatGPT 的比较研究。
Acad Radiol. 2024 Dec;31(12):4833-4842. doi: 10.1016/j.acra.2024.07.057. Epub 2024 Aug 9.
4
Momentary Depressive Feeling Detection Using X (Formerly Twitter) Data: Contextual Language Approach.使用X(原推特)数据检测瞬间抑郁情绪:上下文语言方法。
JMIR AI. 2023 Nov 27;2:e49531. doi: 10.2196/49531.
5
Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction.模型调优还是提示调优?大型语言模型在临床概念和关系抽取中的应用研究。
J Biomed Inform. 2024 May;153:104630. doi: 10.1016/j.jbi.2024.104630. Epub 2024 Mar 26.
6
Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study.用于命名实体识别任务的大语言模型微调的样本量考量:方法学研究
JMIR AI. 2024 May 16;3:e52095. doi: 10.2196/52095.
7
Evaluating the ChatGPT family of models for biomedical reasoning and classification.评估ChatGPT系列模型在生物医学推理和分类方面的表现。
J Am Med Inform Assoc. 2024 Apr 3;31(4):940-948. doi: 10.1093/jamia/ocad256.
8
Text classification models for the automatic detection of nonmedical prescription medication use from social media.社交媒体中非医疗处方药物使用的自动检测的文本分类模型。
BMC Med Inform Decis Mak. 2021 Jan 26;21(1):27. doi: 10.1186/s12911-021-01394-0.
9
DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter.DeepADEMiner:一种用于从 Twitter 上提取和规范化药物不良事件提及的深度学习药物警戒管道。
J Am Med Inform Assoc. 2021 Sep 18;28(10):2184-2192. doi: 10.1093/jamia/ocab114.
10
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.

引用本文的文献

1
Large Language Models for Adverse Drug Events: A Clinical Perspective.用于药物不良事件的大语言模型:临床视角
J Clin Med. 2025 Aug 4;14(15):5490. doi: 10.3390/jcm14155490.
2
Developing electronic health records as a source of real-world data for veterinary pharmacoepidemiology.开发电子健康记录作为兽医药物流行病学真实世界数据的来源。
Front Vet Sci. 2025 Apr 1;12:1550468. doi: 10.3389/fvets.2025.1550468. eCollection 2025.