多任务深度学习对制药行业是否实用？

Is Multitask Deep Learning Practical for Pharma?

作者信息

Ramsundar Bharath, Liu Bowen, Wu Zhenqin, Verras Andreas, Tudor Matthew, Sheridan Robert P, Pande Vijay

机构信息

Department of Computer Science, Stanford University , Stanford, California 94305, United States.

Department of Chemistry, Stanford University , Stanford, California 94305, United States.

出版信息

J Chem Inf Model. 2017 Aug 28;57(8):2068-2076. doi: 10.1021/acs.jcim.7b00146. Epub 2017 Aug 1.

DOI:10.1021/acs.jcim.7b00146

PMID:28692267

Abstract

Multitask deep learning has emerged as a powerful tool for computational drug discovery. However, despite a number of preliminary studies, multitask deep networks have yet to be widely deployed in the pharmaceutical and biotech industries. This lack of acceptance stems from both software difficulties and lack of understanding of the robustness of multitask deep networks. Our work aims to resolve both of these barriers to adoption. We introduce a high-quality open-source implementation of multitask deep networks as part of the DeepChem open-source platform. Our implementation enables simple python scripts to construct, fit, and evaluate sophisticated deep models. We use our implementation to analyze the performance of multitask deep networks and related deep models on four collections of pharmaceutical data (three of which have not previously been analyzed in the literature). We split these data sets into train/valid/test using time and neighbor splits to test multitask deep learning performance under challenging conditions. Our results demonstrate that multitask deep networks are surprisingly robust and can offer strong improvement over random forests. Our analysis and open-source implementation in DeepChem provide an argument that multitask deep networks are ready for widespread use in commercial drug discovery.

摘要

多任务深度学习已成为计算药物发现的强大工具。然而，尽管有一些初步研究，但多任务深度网络尚未在制药和生物技术行业中广泛应用。这种缺乏接受度既源于软件困难，也源于对多任务深度网络稳健性的理解不足。我们的工作旨在解决这两个采用障碍。作为DeepChem开源平台的一部分，我们引入了多任务深度网络的高质量开源实现。我们的实现使简单的Python脚本能够构建、拟合和评估复杂的深度模型。我们使用我们的实现来分析多任务深度网络和相关深度模型在四个药物数据集上的性能（其中三个数据集以前在文献中未被分析过）。我们使用时间和邻居分割将这些数据集拆分为训练/验证/测试集，以在具有挑战性的条件下测试多任务深度学习性能。我们的结果表明，多任务深度网络出奇地稳健，并且可以比随机森林有显著改进。我们在DeepChem中的分析和开源实现表明，多任务深度网络已准备好在商业药物发现中广泛使用。

相似文献

Is Multitask Deep Learning Practical for Pharma?多任务深度学习对制药行业是否实用？

J Chem Inf Model. 2017 Aug 28;57(8):2068-2076. doi: 10.1021/acs.jcim.7b00146. Epub 2017 Aug 1.

Prediction of Human Cytochrome P450 Inhibition Using a Multitask Deep Autoencoder Neural Network.利用多任务深度自动编码器神经网络预测人细胞色素 P450 抑制作用。

Mol Pharm. 2018 Oct 1;15(10):4336-4345. doi: 10.1021/acs.molpharmaceut.8b00110. Epub 2018 May 30.

An Integrated Transfer Learning and Multitask Learning Approach for Pharmacokinetic Parameter Prediction.基于集成迁移学习和多任务学习的药代动力学参数预测方法。

Mol Pharm. 2019 Feb 4;16(2):533-541. doi: 10.1021/acs.molpharmaceut.8b00816. Epub 2019 Jan 4.

Assisting Multitargeted Ligand Affinity Prediction of Receptor Tyrosine Kinases Associated Nonsmall Cell Lung Cancer Treatment with Multitasking Principal Neighborhood Aggregation.多任务主邻域聚合辅助受体酪氨酸激酶相关非小细胞肺癌治疗的多靶向配体亲和力预测。

Molecules. 2022 Feb 11;27(4):1226. doi: 10.3390/molecules27041226.

Improved machine learning models for predicting selective compounds.改进的机器学习模型用于预测选择性化合物。

J Chem Inf Model. 2012 Jan 23;52(1):38-50. doi: 10.1021/ci200346b. Epub 2011 Dec 23.

The Next Era: Deep Learning in Pharmaceutical Research.下一个时代：药物研究中的深度学习。

Pharm Res. 2016 Nov;33(11):2594-603. doi: 10.1007/s11095-016-2029-7. Epub 2016 Sep 6.

Multitask deep networks with grid featurization achieve improved scoring performance for protein-ligand binding.基于网格特征化的多任务深度网络可提高蛋白质-配体结合的评分性能。

Chem Biol Drug Des. 2020 Sep;96(3):973-983. doi: 10.1111/cbdd.13648.

Modeling Physico-Chemical ADMET Endpoints with Multitask Graph Convolutional Networks.运用多任务图卷积网络模拟理化 ADMET 终点。

Molecules. 2019 Dec 21;25(1):44. doi: 10.3390/molecules25010044.

Deep Learning in Drug Discovery.药物研发中的深度学习

Mol Inform. 2016 Jan;35(1):3-14. doi: 10.1002/minf.201501008. Epub 2015 Dec 30.

A Multitask Approach to Learn Molecular Properties.一种学习分子性质的多任务方法。

J Chem Inf Model. 2021 Aug 23;61(8):3824-3834. doi: 10.1021/acs.jcim.1c00646. Epub 2021 Jul 21.

引用本文的文献

Metabolic profiling and antimicrobial activity of D. don by implicated through computational studies.通过计算研究揭示了多刺蚁的代谢谱和抗菌活性。（你提供的原文似乎不太完整或准确，按照字面意思翻译如上，可能需要你进一步确认原文信息。）这里的“D. don”推测可能是某种特定的蚂蚁学名，但不太明确准确所指，如果是“Dorylus donisthorpei” 多刺蚁，这样翻译会更准确些：多刺蚁的代谢谱和抗菌活性通过计算研究得以揭示。你可以根据实际情况进行调整。另外，原句中“by implicated through”表述有误，推测应该是“were implicated through”之类的正确表达。以上是基于现有内容尽可能准确的翻译及说明。

Front Pharmacol. 2025 Jul 1;16:1575727. doi: 10.3389/fphar.2025.1575727. eCollection 2025.

Prediction and Prioritisation of Novel Anthelmintic Candidates from Public Databases Using Deep Learning and Available Bioactivity Data Sets.利用深度学习和现有生物活性数据集从公共数据库中预测新型驱虫候选物并进行优先级排序。

Int J Mol Sci. 2025 Mar 28;26(7):3134. doi: 10.3390/ijms26073134.

Modeling and Interpretability Study of the Structure-Activity Relationship for Multigeneration EGFR Inhibitors.多代表皮生长因子受体（EGFR）抑制剂构效关系的建模与可解释性研究

ACS Omega. 2025 Mar 14;10(11):11176-11187. doi: 10.1021/acsomega.4c10464. eCollection 2025 Mar 25.

Recent Development, Applications, and Patents of Artificial Intelligence in Drug Design and Development.人工智能在药物设计与开发中的最新进展、应用及专利

Curr Drug Discov Technol. 2025 Feb 10. doi: 10.2174/0115701638364199250123062248.

Integrating natural product research laboratory with artificial intelligence: Advancements and breakthroughs in traditional medicine.整合天然产物研究实验室与人工智能：传统医学的进展与突破。

Biomedicine (Taipei). 2024 Dec 1;14(4):1-14. doi: 10.37796/2211-8039.1475. eCollection 2024.

A novel multitask learning algorithm for tasks with distinct chemical space: zebrafish toxicity prediction as an example.一种用于具有不同化学空间任务的新型多任务学习算法：以斑马鱼毒性预测为例。

J Cheminform. 2024 Aug 2;16(1):91. doi: 10.1186/s13321-024-00891-4.

Multimodal fused deep learning for drug property prediction: Integrating chemical language and molecular graph.用于药物性质预测的多模态融合深度学习：整合化学语言和分子图

Comput Struct Biotechnol J. 2024 Apr 12;23:1666-1679. doi: 10.1016/j.csbj.2024.04.030. eCollection 2024 Dec.

Enhancing Opioid Bioactivity Predictions through Integration of Ligand-Based and Structure-Based Drug Discovery Strategies with Transfer and Deep Learning Techniques.通过整合配体和基于结构的药物发现策略与转移和深度学习技术来增强阿片类药物的生物活性预测。

J Phys Chem B. 2023 Dec 21;127(50):10691-10699. doi: 10.1021/acs.jpcb.3c05306. Epub 2023 Dec 11.

Poor Generalization by Current Deep Learning Models for Predicting Binding Affinities of Kinase Inhibitors.当前用于预测激酶抑制剂结合亲和力的深度学习模型泛化能力较差。

bioRxiv. 2023 Sep 6:2023.09.04.556234. doi: 10.1101/2023.09.04.556234.

The Coming of Age of AI/ML in Drug Discovery, Development, Clinical Testing, and Manufacturing: The FDA Perspectives.人工智能/机器学习在药物发现、开发、临床测试和制造中的崭露头角：FDA 的观点。

Drug Des Devel Ther. 2023 Sep 6;17:2691-2725. doi: 10.2147/DDDT.S424991. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

多任务深度学习对制药行业是否实用？

Is Multitask Deep Learning Practical for Pharma?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献