• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测药物退出市场的预训练转换器模型。

Pretrained transformer models for predicting the withdrawal of drugs from the market.

机构信息

Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, P.O.B. 653, Beer-Sheva, 8410501, Israel.

出版信息

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad519.

DOI:10.1093/bioinformatics/btad519
PMID:37610328
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10469107/
Abstract

MOTIVATION

The process of drug discovery is notoriously complex, costing an average of 2.6 billion dollars and taking ∼13 years to bring a new drug to the market. The success rate for new drugs is alarmingly low (around 0.0001%), and severe adverse drug reactions (ADRs) frequently occur, some of which may even result in death. Early identification of potential ADRs is critical to improve the efficiency and safety of the drug development process.

RESULTS

In this study, we employed pretrained large language models (LLMs) to predict the likelihood of a drug being withdrawn from the market due to safety concerns. Our method achieved an area under the curve (AUC) of over 0.75 through cross-database validation, outperforming classical machine learning models and graph-based models. Notably, our pretrained LLMs successfully identified over 50% drugs that were subsequently withdrawn, when predictions were made on a subset of drugs with inconsistent labeling between the training and test sets.

AVAILABILITY AND IMPLEMENTATION

The code and datasets are available at https://github.com/eyalmazuz/DrugWithdrawn.

摘要

动机

药物发现的过程非常复杂,平均成本为 26 亿美元,需要大约 13 年的时间才能将一种新药推向市场。新药的成功率低得惊人(约为 0.0001%),而且经常会出现严重的药物不良反应(ADR),其中一些甚至可能导致死亡。早期识别潜在的 ADR 对于提高药物开发过程的效率和安全性至关重要。

结果

在这项研究中,我们使用预训练的大型语言模型(LLMs)来预测药物因安全问题而退出市场的可能性。我们的方法通过跨数据库验证实现了超过 0.75 的曲线下面积(AUC),优于经典的机器学习模型和基于图的模型。值得注意的是,当在训练集和测试集之间标签不一致的药物子集上进行预测时,我们的预训练 LLM 成功识别出超过 50%的随后被撤出的药物。

可用性和实施

代码和数据集可在 https://github.com/eyalmazuz/DrugWithdrawn 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2558/10469107/ec2c75ffd173/btad519f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2558/10469107/ec2c75ffd173/btad519f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2558/10469107/ec2c75ffd173/btad519f1.jpg

相似文献

1
Pretrained transformer models for predicting the withdrawal of drugs from the market.用于预测药物退出市场的预训练转换器模型。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad519.
2
Predicting serious rare adverse reactions of novel chemicals.预测新型化学物质的严重罕见不良反应。
Bioinformatics. 2018 Aug 15;34(16):2835-2842. doi: 10.1093/bioinformatics/bty193.
3
BiMPADR: A Deep Learning Framework for Predicting Adverse Drug Reactions in New Drugs.BiMPADR:一种用于预测新药不良反应的深度学习框架。
Molecules. 2024 Apr 14;29(8):1784. doi: 10.3390/molecules29081784.
4
Data-driven prediction of adverse drug reactions induced by drug-drug interactions.药物相互作用引起的药物不良反应的数据驱动预测。
BMC Pharmacol Toxicol. 2017 Jun 8;18(1):44. doi: 10.1186/s40360-017-0153-6.
5
Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction.模型调优还是提示调优?大型语言模型在临床概念和关系抽取中的应用研究。
J Biomed Inform. 2024 May;153:104630. doi: 10.1016/j.jbi.2024.104630. Epub 2024 Mar 26.
6
Towards early detection of adverse drug reactions: combining pre-clinical drug structures and post-market safety reports.迈向药物不良反应的早期检测:结合临床前药物结构和上市后安全报告。
BMC Med Inform Decis Mak. 2019 Dec 18;19(1):279. doi: 10.1186/s12911-019-0999-1.
7
A dataset of 200 structured product labels annotated for adverse drug reactions.一个标注了 200 个结构产品标签的药物不良反应数据集。
Sci Data. 2018 Jan 30;5:180001. doi: 10.1038/sdata.2018.1.
8
Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records.知识图谱预测未知药物不良反应并在电子健康记录中验证。
Sci Rep. 2017 Nov 27;7(1):16416. doi: 10.1038/s41598-017-16674-x.
9
Predicting adverse drug reactions using publicly available PubChem BioAssay data.利用公开的 PubChem 生物测定数据预测药物不良反应。
Clin Pharmacol Ther. 2011 Jul;90(1):90-9. doi: 10.1038/clpt.2011.81. Epub 2011 May 25.
10
Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs.利用药物的化学、生物和表型特性进行大规模药物不良反应预测。
J Am Med Inform Assoc. 2012 Jun;19(e1):e28-35. doi: 10.1136/amiajnl-2011-000699.

引用本文的文献

1
Protein-ligand binding affinity prediction using multi-instance learning with docking structures.利用对接结构的多实例学习进行蛋白质-配体结合亲和力预测。
Front Pharmacol. 2025 Jan 3;15:1518875. doi: 10.3389/fphar.2024.1518875. eCollection 2024.
2
Withdrawn 2.0-update on withdrawn drugs with pharmacovigilance data.撤回药品的 2.0 更新版,附有药物警戒数据。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1503-D1507. doi: 10.1093/nar/gkad1017.

本文引用的文献

1
Unified Deep Learning Model for Multitask Reaction Predictions with Explanation.具有解释功能的多任务反应预测统一深度学习模型。
J Chem Inf Model. 2022 Mar 28;62(6):1376-1387. doi: 10.1021/acs.jcim.1c01467. Epub 2022 Mar 10.
2
MolGPT: Molecular Generation Using a Transformer-Decoder Model.MolGPT:基于 Transformer-Decoder 模型的分子生成。
J Chem Inf Model. 2022 May 9;62(9):2064-2076. doi: 10.1021/acs.jcim.1c00600. Epub 2021 Oct 25.
3
Analyzing Learned Molecular Representations for Property Prediction.分析用于性质预测的学习分子表示。
J Chem Inf Model. 2019 Aug 26;59(8):3370-3388. doi: 10.1021/acs.jcim.9b00237. Epub 2019 Aug 13.
4
ChEMBL: towards direct deposition of bioassay data.ChEMBL:致力于直接生成生物测定数据。
Nucleic Acids Res. 2019 Jan 8;47(D1):D930-D940. doi: 10.1093/nar/gky1075.
5
PubChem 2019 update: improved access to chemical data.PubChem 2019 年更新:改善化学数据获取。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1102-D1109. doi: 10.1093/nar/gky1033.
6
Mol2vec: Unsupervised Machine Learning Approach with Chemical Intuition.Mol2vec:具有化学直觉的无监督机器学习方法。
J Chem Inf Model. 2018 Jan 22;58(1):27-35. doi: 10.1021/acs.jcim.7b00616. Epub 2018 Jan 10.
7
DrugBank 5.0: a major update to the DrugBank database for 2018.DrugBank 5.0:2018 年 DrugBank 数据库的重大更新。
Nucleic Acids Res. 2018 Jan 4;46(D1):D1074-D1082. doi: 10.1093/nar/gkx1037.
8
Classification of nervous system withdrawn and approved drugs with ToxPrint features via machine learning strategies.通过机器学习策略对具有ToxPrint特征的神经系统撤市及获批药物进行分类。
Comput Methods Programs Biomed. 2017 Apr;142:9-19. doi: 10.1016/j.cmpb.2017.02.004. Epub 2017 Feb 9.
9
Innovation in the pharmaceutical industry: New estimates of R&D costs.制药行业的创新:研发成本的新估计
J Health Econ. 2016 May;47:20-33. doi: 10.1016/j.jhealeco.2016.01.012. Epub 2016 Feb 12.
10
Post-marketing withdrawal of 462 medicinal products because of adverse drug reactions: a systematic review of the world literature.因药物不良反应导致462种药品上市后撤市:对世界文献的系统评价
BMC Med. 2016 Feb 4;14:10. doi: 10.1186/s12916-016-0553-2.