• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用大语言模型从处方药标签中提取药物安全信息。

Leveraging Large Language Models in Extracting Drug Safety Information from Prescription Drug Labels.

作者信息

Gisladottir Undina, Zietz Michael, Kivelson Sophia, Tanaka Yutaro, Sirdeshmukh Gaurav, Brown Kathleen LaRow, Tatonetti Nicholas P

机构信息

Department of Biomedical Informatics, Columbia University, New York, NY, USA.

Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA.

出版信息

Drug Saf. 2025 Sep 2. doi: 10.1007/s40264-025-01594-x.

DOI:10.1007/s40264-025-01594-x
PMID:40892374
Abstract

INTRODUCTION

Adverse drug reactions (ADRs), including those resulting from drug interactions, remain a leading cause of morbidity and mortality. Structured product labels (SPLs) serve as a primary source for drug safety information. Having machine-readable product labels, including adverse reactions (ARs) and drug interactions, readily available would allow researchers to streamline medication safety studies. However, extracting this information is complex and requires the use of natural language processing (NLP) methods.

OBJECTIVE

In this study, we explored the application of generative language models in the extraction of drug safety information from SPLs.

METHODS

We compared multiple generative LLMs (GPT, Llama, and Mixtral) to two baseline methods in the task of extracting adverse reactions (ARs) from SPLs. We explored various factors, such as prompting strategies and term complexity, that impact the performance of these models in the extraction of ARs. Finally, we explored the generative models' capacity to extract drug interactions from a separate section of SPLs without additional fine-tuning or training, demonstrating their flexibility and adaptability for information retrieval.

RESULTS

We found that generative language models, specifically GPT-4, are able to match or exceed the performance of previous state-of-the-art models without additional training or fine-tuning. Additionally, we found that the specific SPL section, surrounding context, and complexity of the AR term impacted the extraction performance. Finally, we demonstrated the generalizability of these models by applying them to a separate task of extracting drug names from the drug interaction section where curated training data are not available.

CONCLUSION

Generative language models demonstrate significant potential for automating drug safety information extraction from SPLs, offering a promising avenue for improving post-market surveillance and reducing ADRs. Future work should focus on refining prompting strategies and expanding the models' capabilities to handle increasingly complex and nuanced drug safety information.

摘要

引言

药物不良反应(ADR),包括药物相互作用导致的不良反应,仍然是发病和死亡的主要原因。结构化产品标签(SPL)是药物安全信息的主要来源。拥有机器可读的产品标签,包括不良反应(AR)和药物相互作用信息,将使研究人员能够简化药物安全性研究。然而,提取这些信息很复杂,需要使用自然语言处理(NLP)方法。

目的

在本研究中,我们探索了生成式语言模型在从SPL中提取药物安全信息方面的应用。

方法

我们在从SPL中提取不良反应(AR)的任务中,将多个生成式语言模型(GPT、Llama和Mixtral)与两种基线方法进行了比较。我们探讨了各种因素,如提示策略和术语复杂性,这些因素会影响这些模型在提取AR方面的性能。最后,我们探索了生成式模型在无需额外微调或训练的情况下,从SPL的单独部分提取药物相互作用的能力,展示了它们在信息检索方面的灵活性和适应性。

结果

我们发现,生成式语言模型,特别是GPT-4,能够在无需额外训练或微调的情况下,达到或超过先前最先进模型的性能。此外,我们发现AR术语的特定SPL部分、周围上下文和复杂性会影响提取性能。最后,我们通过将这些模型应用于从药物相互作用部分提取药物名称的单独任务(此处没有经过整理的训练数据),证明了这些模型的通用性。

结论

生成式语言模型在自动从SPL中提取药物安全信息方面显示出巨大潜力,为改善上市后监测和减少ADR提供了一条有前景的途径。未来的工作应集中在完善提示策略和扩展模型处理日益复杂和细微的药物安全信息的能力上。

相似文献

1
Leveraging Large Language Models in Extracting Drug Safety Information from Prescription Drug Labels.利用大语言模型从处方药标签中提取药物安全信息。
Drug Saf. 2025 Sep 2. doi: 10.1007/s40264-025-01594-x.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Aligning Large Language Models for Enhancing Psychiatric Interviews Through Symptom Delineation and Summarization: Pilot Study.通过症状描述和总结调整大型语言模型以增强精神病学访谈:初步研究。
JMIR Form Res. 2024 Oct 24;8:e58418. doi: 10.2196/58418.
4
Automated Extraction of Mortality Information From Publicly Available Sources Using Large Language Models: Development and Evaluation Study.使用大语言模型从公开可用来源自动提取死亡率信息:开发与评估研究
J Med Internet Res. 2025 Aug 18;27:e71113. doi: 10.2196/71113.
5
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
6
Extracting epilepsy-related information from unstructured clinic letters using large language models.使用大语言模型从非结构化临床信件中提取癫痫相关信息。
Epilepsia. 2025 Jul 10. doi: 10.1111/epi.18475.
7
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
8
Can open source large language models be used for tumor documentation in Germany?-An evaluation on urological doctors' notes.在德国,开源大语言模型可用于肿瘤记录吗?——对泌尿科医生笔记的评估
BioData Min. 2025 Jul 24;18(1):48. doi: 10.1186/s13040-025-00463-8.
9
Gonadotropin-releasing hormone (GnRH) analogues for premenstrual syndrome (PMS).用于经前综合征(PMS)的促性腺激素释放激素(GnRH)类似物。
Cochrane Database Syst Rev. 2025 Jun 10;6(6):CD011330. doi: 10.1002/14651858.CD011330.pub2.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

本文引用的文献

1
Artificial intelligence for optimizing recruitment and retention in clinical trials: a scoping review.人工智能在临床试验中优化招募和保留的应用:范围综述。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2749-2759. doi: 10.1093/jamia/ocae243.
2
Description and Validation of a Novel AI Tool, LabelComp, for the Identification of Adverse Event Changes in FDA Labeling.描述和验证一种新的人工智能工具 LabelComp,用于识别 FDA 标签中不良事件变化。
Drug Saf. 2024 Dec;47(12):1265-1274. doi: 10.1007/s40264-024-01468-8. Epub 2024 Jul 31.
3
Adapted large language models can outperform medical experts in clinical text summarization.
经过改编的大型语言模型在临床文本总结方面的表现优于医学专家。
Nat Med. 2024 Apr;30(4):1134-1142. doi: 10.1038/s41591-024-02855-5. Epub 2024 Feb 27.
4
PharmBERT: a domain-specific BERT model for drug labels.PharmBERT:一种针对药物标签的特定领域 BERT 模型。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad226.
5
Clinical concept and relation extraction using prompt-based machine reading comprehension.基于提示的机器阅读理解的临床概念和关系抽取。
J Am Med Inform Assoc. 2023 Aug 18;30(9):1486-1493. doi: 10.1093/jamia/ocad107.
6
Development and validation of an automated basal cell carcinoma histopathology information extraction system using natural language processing.使用自然语言处理技术开发并验证一种自动化的基底细胞癌组织病理学信息提取系统。
Front Surg. 2022 Aug 24;9:870494. doi: 10.3389/fsurg.2022.870494. eCollection 2022.
7
A smart hospital-driven approach to precision pharmacovigilance.一种由智能医院驱动的精准药物警戒方法。
Trends Pharmacol Sci. 2022 Jun;43(6):473-481. doi: 10.1016/j.tips.2022.03.009. Epub 2022 Apr 27.
8
Few-shot learning creates predictive models of drug response that translate from high-throughput screens to individual patients.少样本学习创建了药物反应的预测模型,这些模型可以从高通量筛选转化到个体患者身上。
Nat Cancer. 2021 Feb;2(2):233-244. doi: 10.1038/s43018-020-00169-2. Epub 2021 Jan 25.
9
2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records.2018n2c2 电子健康记录中药物不良反应和药物提取共享任务。
J Am Med Inform Assoc. 2020 Jan 1;27(1):3-12. doi: 10.1093/jamia/ocz166.
10
Identifying relations of medications with adverse drug events using recurrent convolutional neural networks and gradient boosting.利用递归卷积神经网络和梯度提升来识别药物与药物不良事件之间的关系。
J Am Med Inform Assoc. 2020 Jan 1;27(1):65-72. doi: 10.1093/jamia/ocz144.