• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChemAP:通过利用多模态嵌入空间和知识蒸馏,在临床试验阶段之前利用化学结构预测药物批准。

ChemAP: predicting drug approval with chemical structures before clinical trial phase by leveraging multi-modal embedding space and knowledge distillation.

机构信息

Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Republic of Korea.

AIGENDRUG Co., Ltd, Seoul, Republic of Korea.

出版信息

Sci Rep. 2024 Oct 3;14(1):23010. doi: 10.1038/s41598-024-72868-0.

DOI:10.1038/s41598-024-72868-0
PMID:39362916
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11449903/
Abstract

Recent studies showed that the likelihood of drug approval can be predicted with clinical data and structure information of drug using computational approaches. Predicting the likelihood of drug approval can be innovative and of high impact. However, models that leverage clinical data are applicable only in clinical stages, which is not very practical. Prioritizing drug candidates and early-stage decision-making in the de novo drug development process is crucial in pharmaceutical research to optimize resource allocation. For early-stage decision-making, we need a computational model that uses only chemical structures. This seemingly impossible task may utilize the predictive power with multi-modal features including clinical data. In this work, we introduce ChemAP (Chemical structure-based drug Approval Predictor), a novel deep learning scheme for drug approval prediction in the early-stage drug discovery phase. ChemAP aims to enhance the possibility of early-stage decision-making by enriching semantic knowledge to fill in the gap between multi-modal and single-modal chemical spaces through knowledge distillation techniques. This approach facilitates the effective construction of chemical space solely from chemical structure data, guided by multi-modal knowledge related to efficacy, such as clinical trials and patents of drugs. In this study, ChemAP achieved state-of-the-art performance, outperforming both traditional machine learning and deep learning models in drug approval prediction, with AUROC and AUPRC scores of 0.782 and 0.842 respectively on the drug approval benchmark dataset. Additionally, we demonstrated its generalizability by outperforming baseline models on a recent external dataset, which included drugs from the 2023 FDA-approved list and the 2024 clinical trial failure drug list, achieving AUROC and AUPRC scores of 0.694 and 0.851. These results demonstrate that ChemAP is an effective method in predicting drug approval only with chemical structure information of drug so that decision-making can be done at the early stages of drug development process. To the best of our knowledge, our work is the first of its kind to show that prediction of drug approval is possible only with structure information of drug by defining the chemical space of approved and unapproved drugs using deep learning technology.

摘要

最近的研究表明,使用计算方法可以通过药物的临床数据和结构信息来预测药物的批准可能性。预测药物的批准可能性具有创新性和高影响力。然而,利用临床数据的模型仅适用于临床阶段,这并不十分实用。在新药开发过程中,对药物候选物进行优先级排序和早期决策对于药物研究至关重要,以优化资源分配。对于早期决策,我们需要一个仅使用化学结构的计算模型。这项看似不可能的任务可能会利用包括临床数据在内的多模态特征的预测能力。在这项工作中,我们引入了 ChemAP(基于化学结构的药物批准预测器),这是一种用于早期药物发现阶段药物批准预测的新型深度学习方案。ChemAP 旨在通过知识蒸馏技术,丰富语义知识,在多模态和单模态化学空间之间架起桥梁,从而增强早期决策的可能性。这种方法通过利用与疗效相关的多模态知识(如临床试验和药物专利),从化学结构数据中有效地构建化学空间。在这项研究中,ChemAP 在药物批准基准数据集上实现了最先进的性能,在药物批准预测方面优于传统机器学习和深度学习模型,AUROC 和 AUPRC 得分分别为 0.782 和 0.842。此外,我们通过在最近的外部数据集上优于基线模型来证明其泛化能力,该数据集包括 2023 年 FDA 批准的药物清单和 2024 年临床试验失败药物清单中的药物,AUROC 和 AUPRC 得分分别为 0.694 和 0.851。这些结果表明,ChemAP 是一种仅使用药物的化学结构信息预测药物批准的有效方法,从而可以在药物开发过程的早期阶段做出决策。据我们所知,我们的工作是第一个表明仅通过使用深度学习技术定义已批准和未批准药物的化学空间,仅使用药物的结构信息就可以预测药物批准的工作。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/869b1e6e9914/41598_2024_72868_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/436d400318fc/41598_2024_72868_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/be1046a4846c/41598_2024_72868_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/6dd1443d97e5/41598_2024_72868_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/3aec6a7e84fc/41598_2024_72868_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/ef9514dbd92a/41598_2024_72868_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/869b1e6e9914/41598_2024_72868_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/436d400318fc/41598_2024_72868_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/be1046a4846c/41598_2024_72868_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/6dd1443d97e5/41598_2024_72868_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/3aec6a7e84fc/41598_2024_72868_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/ef9514dbd92a/41598_2024_72868_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0b7/11449903/869b1e6e9914/41598_2024_72868_Fig6_HTML.jpg

相似文献

1
ChemAP: predicting drug approval with chemical structures before clinical trial phase by leveraging multi-modal embedding space and knowledge distillation.ChemAP:通过利用多模态嵌入空间和知识蒸馏,在临床试验阶段之前利用化学结构预测药物批准。
Sci Rep. 2024 Oct 3;14(1):23010. doi: 10.1038/s41598-024-72868-0.
2
DeepMPF: deep learning framework for predicting drug-target interactions based on multi-modal representation with meta-path semantic analysis.DeepMPF:基于多模态表示和元路径语义分析的深度学习框架,用于预测药物-靶标相互作用。
J Transl Med. 2023 Jan 25;21(1):48. doi: 10.1186/s12967-023-03876-3.
3
MalariaFlow: A comprehensive deep learning platform for multistage phenotypic antimalarial drug discovery.疟疾流程:用于多阶段表型抗疟药物发现的综合深度学习平台。
Eur J Med Chem. 2024 Nov 5;277:116776. doi: 10.1016/j.ejmech.2024.116776. Epub 2024 Aug 16.
4
Predicting drug-drug interactions using multi-modal deep auto-encoders based network embedding and positive-unlabeled learning.基于多模态深度自动编码器的网络嵌入和正无标签学习预测药物-药物相互作用。
Methods. 2020 Jul 1;179:37-46. doi: 10.1016/j.ymeth.2020.05.007. Epub 2020 Jun 1.
5
MDF-DTA: A Multi-Dimensional Fusion Approach for Drug-Target Binding Affinity Prediction.MDF-DTA:一种用于药物-靶标结合亲和力预测的多维融合方法。
J Chem Inf Model. 2024 Jul 8;64(13):4980-4990. doi: 10.1021/acs.jcim.4c00310. Epub 2024 Jun 18.
6
Machine learning-based prediction of drug approvals using molecular, physicochemical, clinical trial, and patent-related features.基于机器学习的药物批准预测,使用分子、物理化学、临床试验和专利相关特征。
Expert Opin Drug Discov. 2022 Dec;17(12):1425-1441. doi: 10.1080/17460441.2023.2153830. Epub 2022 Dec 13.
7
A Vision Transformer-Based Framework for Knowledge Transfer From Multi-Modal to Mono-Modal Lymphoma Subtyping Models.基于视觉Transformer 的多模态到单模态淋巴瘤分型模型知识迁移框架。
IEEE J Biomed Health Inform. 2024 Sep;28(9):5562-5572. doi: 10.1109/JBHI.2024.3407878. Epub 2024 Sep 5.
8
Multimodal fused deep learning for drug property prediction: Integrating chemical language and molecular graph.用于药物性质预测的多模态融合深度学习:整合化学语言和分子图
Comput Struct Biotechnol J. 2024 Apr 12;23:1666-1679. doi: 10.1016/j.csbj.2024.04.030. eCollection 2024 Dec.
9
AMCP Partnership Forum: Enabling the Exchange of Clinical and Economic Information Pre-FDA Approval.AMCP 合作论坛:在 FDA 批准前实现临床和经济信息的交流。
J Manag Care Spec Pharm. 2017 Jan;23(1):105-112. doi: 10.18553/jmcp.2016.16366. Epub 2016 Dec 22.
10
Exploration of chemical space with partial labeled noisy student self-training and self-supervised graph embedding.利用部分标记的噪声学生自训练和自监督图嵌入探索化学空间。
BMC Bioinformatics. 2022 May 2;23(Suppl 3):158. doi: 10.1186/s12859-022-04681-3.

本文引用的文献

1
Dual Representation Learning for Predicting Drug-Side Effect Frequency Using Protein Target Information.利用蛋白质靶点信息预测药物副作用频率的对偶表示学习
IEEE J Biomed Health Inform. 2025 Mar;29(3):1817-1827. doi: 10.1109/JBHI.2024.3350083. Epub 2025 Mar 6.
2
Predicting drug-induced liver injury using graph attention mechanism and molecular fingerprints.利用图注意力机制和分子指纹预测药物性肝损伤。
Methods. 2024 Jan;221:18-26. doi: 10.1016/j.ymeth.2023.11.014. Epub 2023 Nov 30.
3
GeoDILI: A Robust and Interpretable Model for Drug-Induced Liver Injury Prediction Using Graph Neural Network-Based Molecular Geometric Representation.
GeoDILI:基于图神经网络的分子几何表示的药物性肝损伤预测的稳健且可解释模型。
Chem Res Toxicol. 2023 Nov 20;36(11):1717-1730. doi: 10.1021/acs.chemrestox.3c00199. Epub 2023 Oct 15.
4
Drug approval prediction based on the discrepancy in gene perturbation effects between cells and humans.基于细胞和人类基因扰动效应差异的药物批准预测。
EBioMedicine. 2023 Aug;94:104705. doi: 10.1016/j.ebiom.2023.104705. Epub 2023 Jul 13.
5
A single stage knowledge distillation network for brain tumor segmentation on limited MR image modalities.基于有限模态磁共振成像的脑肿瘤分割单阶段知识蒸馏网络。
Comput Methods Programs Biomed. 2023 Oct;240:107644. doi: 10.1016/j.cmpb.2023.107644. Epub 2023 Jun 4.
6
Supervised chemical graph mining improves drug-induced liver injury prediction.监督式化学图谱挖掘可改善药物性肝损伤预测。
iScience. 2022 Dec 26;26(1):105677. doi: 10.1016/j.isci.2022.105677. eCollection 2023 Jan 20.
7
Assessing machine learning approaches for predicting failures of investigational drug candidates during clinical trials.评估机器学习方法在预测临床试验中候选药物失败中的应用。
Comput Biol Med. 2023 Feb;153:106494. doi: 10.1016/j.compbiomed.2022.106494. Epub 2022 Dec 29.
8
Machine learning-based prediction of drug approvals using molecular, physicochemical, clinical trial, and patent-related features.基于机器学习的药物批准预测,使用分子、物理化学、临床试验和专利相关特征。
Expert Opin Drug Discov. 2022 Dec;17(12):1425-1441. doi: 10.1080/17460441.2023.2153830. Epub 2022 Dec 13.
9
PubChem 2023 update.PubChem 2023 更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1373-D1380. doi: 10.1093/nar/gkac956.
10
Predicting drug approvals: The Novartis data science and artificial intelligence challenge.预测药物审批:诺华数据科学与人工智能挑战赛
Patterns (N Y). 2021 Jul 21;2(8):100312. doi: 10.1016/j.patter.2021.100312. eCollection 2021 Aug 13.