• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

揭开用于定量构效关系的多任务深度神经网络的神秘面纱。

Demystifying Multitask Deep Neural Networks for Quantitative Structure-Activity Relationships.

作者信息

Xu Yuting, Ma Junshui, Liaw Andy, Sheridan Robert P, Svetnik Vladimir

机构信息

Biometrics Research Department, Merck & Co., Inc. , Rahway, New Jersey 07065, United States.

Modeling and Informatics Department, Merck & Co., Inc. , Kenilworth, New Jersey 07033, United States.

出版信息

J Chem Inf Model. 2017 Oct 23;57(10):2490-2504. doi: 10.1021/acs.jcim.7b00087. Epub 2017 Oct 2.

DOI:10.1021/acs.jcim.7b00087
PMID:28872869
Abstract

Deep neural networks (DNNs) are complex computational models that have found great success in many artificial intelligence applications, such as computer vision1,2 and natural language processing.3,4 In the past four years, DNNs have also generated promising results for quantitative structure-activity relationship (QSAR) tasks.5,6 Previous work showed that DNNs can routinely make better predictions than traditional methods, such as random forests, on a diverse collection of QSAR data sets. It was also found that multitask DNN models-those trained on and predicting multiple QSAR properties simultaneously-outperform DNNs trained separately on the individual data sets in many, but not all, tasks. To date there has been no satisfactory explanation of why the QSAR of one task embedded in a multitask DNN can borrow information from other unrelated QSAR tasks. Thus, using multitask DNNs in a way that consistently provides a predictive advantage becomes a challenge. In this work, we explored why multitask DNNs make a difference in predictive performance. Our results show that during prediction a multitask DNN does borrow "signal" from molecules with similar structures in the training sets of the other tasks. However, whether this borrowing leads to better or worse predictive performance depends on whether the activities are correlated. On the basis of this, we have developed a strategy to use multitask DNNs that incorporate prior domain knowledge to select training sets with correlated activities, and we demonstrate its effectiveness on several examples.

摘要

深度神经网络(DNN)是复杂的计算模型,在许多人工智能应用中都取得了巨大成功,如计算机视觉[1,2]和自然语言处理[3,4]。在过去四年中,DNN在定量构效关系(QSAR)任务中也取得了令人鼓舞的成果[5,6]。先前的工作表明,在各种QSAR数据集上,DNN通常能比传统方法(如随机森林)做出更好的预测。研究还发现,多任务DNN模型(即在多个QSAR属性上进行训练并同时预测的模型)在许多(但并非所有)任务中比在单个数据集上单独训练的DNN表现更好。迄今为止,对于多任务DNN中嵌入的一个任务的QSAR为何能从其他不相关的QSAR任务中借用信息,尚无令人满意的解释。因此,以一种始终能提供预测优势的方式使用多任务DNN成为一项挑战。在这项工作中,我们探究了多任务DNN在预测性能上产生差异的原因。我们的结果表明,在预测过程中,多任务DNN确实会从其他任务训练集中具有相似结构的分子中借用“信号”。然而,这种借用导致预测性能变好还是变差取决于活性是否相关。基于此,我们开发了一种使用多任务DNN的策略,该策略结合先验领域知识来选择具有相关活性的训练集,并在几个例子中证明了其有效性。

相似文献

1
Demystifying Multitask Deep Neural Networks for Quantitative Structure-Activity Relationships.揭开用于定量构效关系的多任务深度神经网络的神秘面纱。
J Chem Inf Model. 2017 Oct 23;57(10):2490-2504. doi: 10.1021/acs.jcim.7b00087. Epub 2017 Oct 2.
2
Deep neural nets as a method for quantitative structure-activity relationships.深度神经网络作为一种定量构效关系的方法。
J Chem Inf Model. 2015 Feb 23;55(2):263-74. doi: 10.1021/ci500747n. Epub 2015 Feb 17.
3
Prediction of Human Cytochrome P450 Inhibition Using a Multitask Deep Autoencoder Neural Network.利用多任务深度自动编码器神经网络预测人细胞色素 P450 抑制作用。
Mol Pharm. 2018 Oct 1;15(10):4336-4345. doi: 10.1021/acs.molpharmaceut.8b00110. Epub 2018 May 30.
4
Predictive Multitask Deep Neural Network Models for ADME-Tox Properties: Learning from Large Data Sets.用于 ADME-Tox 性质的预测性多任务深度神经网络模型:从大数据集学习。
J Chem Inf Model. 2019 Mar 25;59(3):1253-1268. doi: 10.1021/acs.jcim.8b00785. Epub 2019 Jan 24.
5
Prediction of Compound Profiling Matrices, Part II: Relative Performance of Multitask Deep Learning and Random Forest Classification on the Basis of Varying Amounts of Training Data.化合物特征矩阵预测,第二部分:基于不同数量训练数据的多任务深度学习和随机森林分类的相对性能
ACS Omega. 2018 Sep 30;3(9):12033-12040. doi: 10.1021/acsomega.8b01682. Epub 2018 Sep 27.
6
Validation Study of QSAR/DNN Models Using the Competition Datasets.QSAR/DNN 模型的竞争数据集验证研究。
Mol Inform. 2020 Jan;39(1-2):e1900154. doi: 10.1002/minf.201900154. Epub 2019 Dec 18.
7
Dissecting Machine-Learning Prediction of Molecular Activity: Is an Applicability Domain Needed for Quantitative Structure-Activity Relationship Models Based on Deep Neural Networks?解析机器学习对分子活性的预测:基于深度神经网络的定量构效关系模型是否需要适用域?
J Chem Inf Model. 2019 Jan 28;59(1):117-126. doi: 10.1021/acs.jcim.8b00348. Epub 2018 Nov 21.
8
Deep learning for predicting toxicity of chemicals: a mini review.用于预测化学物质毒性的深度学习:一篇综述
J Environ Sci Health C Environ Carcinog Ecotoxicol Rev. 2018;36(4):252-271. doi: 10.1080/10590501.2018.1537563. Epub 2019 Mar 1.
9
Quantitative Toxicity Prediction Using Topology Based Multitask Deep Neural Networks.基于拓扑的多任务深度神经网络的定量毒性预测。
J Chem Inf Model. 2018 Feb 26;58(2):520-531. doi: 10.1021/acs.jcim.7b00558. Epub 2018 Jan 31.
10
The role of different sampling methods in improving biological activity prediction using deep belief network.不同采样方法在利用深度信念网络改进生物活性预测中的作用。
J Comput Chem. 2017 Feb 5;38(4):195-203. doi: 10.1002/jcc.24671. Epub 2016 Nov 14.

引用本文的文献

1
Molecular property prediction in the ultra-low data regime.超低数据量情况下的分子性质预测
Commun Chem. 2025 Jul 8;8(1):201. doi: 10.1038/s42004-025-01592-1.
2
Modeling and Interpretability Study of the Structure-Activity Relationship for Multigeneration EGFR Inhibitors.多代表皮生长因子受体(EGFR)抑制剂构效关系的建模与可解释性研究
ACS Omega. 2025 Mar 14;10(11):11176-11187. doi: 10.1021/acsomega.4c10464. eCollection 2025 Mar 25.
3
Applications of Artificial Intelligence in Drug Repurposing.人工智能在药物重新定位中的应用。
Adv Sci (Weinh). 2025 Apr;12(14):e2411325. doi: 10.1002/advs.202411325. Epub 2025 Mar 6.
4
kMoL: an open-source machine and federated learning library for drug discovery.kMoL:一个用于药物发现的开源机器学习与联邦学习库。
J Cheminform. 2025 Feb 25;17(1):22. doi: 10.1186/s13321-025-00967-9.
5
HDAC3_VS_assistant: cheminformatics-driven discovery of histone deacetylase 3 inhibitors.HDAC3与助手:基于化学信息学的组蛋白去乙酰化酶3抑制剂发现
Mol Divers. 2024 Dec 23. doi: 10.1007/s11030-024-11066-6.
6
Rapid prediction of conformationally-dependent DFT-level descriptors using graph neural networks for carboxylic acids and alkyl amines.使用图神经网络对羧酸和烷基胺进行构象依赖性DFT水平描述符的快速预测。
Digit Discov. 2024 Nov 28;4(1):222-233. doi: 10.1039/d4dd00284a. eCollection 2025 Jan 15.
7
Benchmarking Cross-Docking Strategies in Kinase Drug Discovery.激酶药物发现中的交叉对接策略基准测试
J Chem Inf Model. 2024 Dec 9;64(23):8848-8858. doi: 10.1021/acs.jcim.4c00905. Epub 2024 Nov 18.
8
off-target profiling for enhanced drug safety assessment.用于增强药物安全性评估的脱靶分析
Acta Pharm Sin B. 2024 Jul;14(7):2927-2941. doi: 10.1016/j.apsb.2024.03.002. Epub 2024 Mar 6.
9
Development of a Novel In Silico Classification Model to Assess Reactive Metabolite Formation in the Cysteine Trapping Assay and Investigation of Important Substructures.新型计算机分类模型的建立用于评估半胱氨酸捕集试验中反应代谢产物的形成,并研究重要的亚结构。
Biomolecules. 2024 Apr 30;14(5):535. doi: 10.3390/biom14050535.
10
Absorption Distribution Metabolism Excretion and Toxicity Property Prediction Utilizing a Pre-Trained Natural Language Processing Model and Its Applications in Early-Stage Drug Development.利用预训练自然语言处理模型预测吸收、分布、代谢、排泄及毒性特性及其在早期药物研发中的应用
Pharmaceuticals (Basel). 2024 Mar 17;17(3):382. doi: 10.3390/ph17030382.