• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度神经网络作为一种定量构效关系的方法。

Deep neural nets as a method for quantitative structure-activity relationships.

作者信息

Ma Junshui, Sheridan Robert P, Liaw Andy, Dahl George E, Svetnik Vladimir

机构信息

Biometrics Research Department and ‡Structural Chemistry Department, Merck Research Laboratories , Rahway, New Jersey 07065, United States.

出版信息

J Chem Inf Model. 2015 Feb 23;55(2):263-74. doi: 10.1021/ci500747n. Epub 2015 Feb 17.

DOI:10.1021/ci500747n
PMID:25635324
Abstract

Neural networks were widely used for quantitative structure-activity relationships (QSAR) in the 1990s. Because of various practical issues (e.g., slow on large problems, difficult to train, prone to overfitting, etc.), they were superseded by more robust methods like support vector machine (SVM) and random forest (RF), which arose in the early 2000s. The last 10 years has witnessed a revival of neural networks in the machine learning community thanks to new methods for preventing overfitting, more efficient training algorithms, and advancements in computer hardware. In particular, deep neural nets (DNNs), i.e. neural nets with more than one hidden layer, have found great successes in many applications, such as computer vision and natural language processing. Here we show that DNNs can routinely make better prospective predictions than RF on a set of large diverse QSAR data sets that are taken from Merck's drug discovery effort. The number of adjustable parameters needed for DNNs is fairly large, but our results show that it is not necessary to optimize them for individual data sets, and a single set of recommended parameters can achieve better performance than RF for most of the data sets we studied. The usefulness of the parameters is demonstrated on additional data sets not used in the calibration. Although training DNNs is still computationally intensive, using graphical processing units (GPUs) can make this issue manageable.

摘要

神经网络在20世纪90年代被广泛用于定量构效关系(QSAR)研究。由于各种实际问题(例如,处理大问题时速度慢、难以训练、容易过拟合等),它们在21世纪初被支持向量机(SVM)和随机森林(RF)等更强大的方法所取代。在过去十年中,由于防止过拟合的新方法、更高效的训练算法以及计算机硬件的进步,神经网络在机器学习领域得以复兴。特别是深度神经网络(DNN),即具有多个隐藏层的神经网络,在许多应用中都取得了巨大成功,如计算机视觉和自然语言处理。在此我们表明,在从默克公司药物发现工作中获取的一组多样化的大型QSAR数据集上,DNN通常能比RF做出更好的前瞻性预测。DNN所需的可调参数数量相当大,但我们的结果表明,无需针对单个数据集对其进行优化,对于我们研究的大多数数据集,一组推荐参数就能比RF实现更好的性能。这些参数在未用于校准的其他数据集上也证明了其有效性。尽管训练DNN在计算上仍然很密集,但使用图形处理单元(GPU)可以使这个问题得到控制。

相似文献

1
Deep neural nets as a method for quantitative structure-activity relationships.深度神经网络作为一种定量构效关系的方法。
J Chem Inf Model. 2015 Feb 23;55(2):263-74. doi: 10.1021/ci500747n. Epub 2015 Feb 17.
2
Demystifying Multitask Deep Neural Networks for Quantitative Structure-Activity Relationships.揭开用于定量构效关系的多任务深度神经网络的神秘面纱。
J Chem Inf Model. 2017 Oct 23;57(10):2490-2504. doi: 10.1021/acs.jcim.7b00087. Epub 2017 Oct 2.
3
Support vector machine based training of multilayer feedforward neural networks as optimized by particle swarm algorithm: application in QSAR studies of bioactivity of organic compounds.基于支持向量机的多层前馈神经网络训练,经粒子群算法优化:在有机化合物生物活性定量构效关系研究中的应用
J Comput Chem. 2007 Jan 30;28(2):519-27. doi: 10.1002/jcc.20561.
4
Extreme Gradient Boosting as a Method for Quantitative Structure-Activity Relationships.极端梯度提升在定量构效关系中的应用。
J Chem Inf Model. 2016 Dec 27;56(12):2353-2360. doi: 10.1021/acs.jcim.6b00591. Epub 2016 Dec 13.
5
Dissecting Machine-Learning Prediction of Molecular Activity: Is an Applicability Domain Needed for Quantitative Structure-Activity Relationship Models Based on Deep Neural Networks?解析机器学习对分子活性的预测:基于深度神经网络的定量构效关系模型是否需要适用域?
J Chem Inf Model. 2019 Jan 28;59(1):117-126. doi: 10.1021/acs.jcim.8b00348. Epub 2018 Nov 21.
6
Comparing the Influence of Simulated Experimental Errors on 12 Machine Learning Algorithms in Bioactivity Modeling Using 12 Diverse Data Sets.比较模拟实验误差对使用 12 个不同数据集的生物活性建模中 12 种机器学习算法的影响。
J Chem Inf Model. 2015 Jul 27;55(7):1413-25. doi: 10.1021/acs.jcim.5b00101. Epub 2015 Jun 18.
7
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework.GXNOR-Net:在统一的离散化框架下使用三进制权重和激活函数训练深度神经网络,无需全精度存储。
Neural Netw. 2018 Apr;100:49-58. doi: 10.1016/j.neunet.2018.01.010. Epub 2018 Feb 2.
8
ADMET Evaluation in Drug Discovery. 18. Reliable Prediction of Chemical-Induced Urinary Tract Toxicity by Boosting Machine Learning Approaches.药物发现中的 ADMET 评估。18. 通过机器学习方法的提升实现可靠的化学诱导的泌尿道毒性预测。
Mol Pharm. 2017 Nov 6;14(11):3935-3953. doi: 10.1021/acs.molpharmaceut.7b00631. Epub 2017 Oct 27.
9
Application of support vector machine (SVM) for prediction toxic activity of different data sets.支持向量机(SVM)在不同数据集毒性活性预测中的应用。
Toxicology. 2006 Jan 16;217(2-3):105-19. doi: 10.1016/j.tox.2005.08.019. Epub 2005 Oct 5.
10
Performance of Deep and Shallow Neural Networks, the Universal Approximation Theorem, Activity Cliffs, and QSAR.深度神经网络和浅层神经网络的性能、通用逼近定理、活动悬崖和定量构效关系。
Mol Inform. 2017 Jan;36(1-2). doi: 10.1002/minf.201600118. Epub 2016 Oct 26.

引用本文的文献

1
Pushing the boundaries of few-shot learning for low-data drug discovery with a Bayesian meta-learning hypernetwork framework.利用贝叶斯元学习超网络框架拓展少样本学习在低数据药物发现中的边界。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf408.
2
PTML models of self assembled ligand free nanoparticle catalysts for cross coupling reactions.用于交叉偶联反应的自组装无配体纳米颗粒催化剂的PTML模型。
Sci Rep. 2025 Aug 14;15(1):29895. doi: 10.1038/s41598-025-14080-2.
3
Computer-aided discovery of triazolothiadiazoles as DYRK1A-targeted neuroprotective agents.
计算机辅助发现作为靶向双特异性酪氨酸磷酸化调节激酶1A的神经保护剂的三唑并噻二唑类化合物。
RSC Med Chem. 2025 Jul 14. doi: 10.1039/d5md00289c.
4
DPPPRED-IV: An Ensembled QSAR-Based Web Server for the Prediction of Dipeptidyl Peptidase 4 Inhibitors.DPPPRED-IV:一个基于定量构效关系的集成式网络服务器,用于预测二肽基肽酶4抑制剂。
Int J Mol Sci. 2025 Jun 11;26(12):5579. doi: 10.3390/ijms26125579.
5
Advancing active compound discovery for novel drug targets: insights from AI-driven approaches.推进针对新型药物靶点的活性化合物发现:人工智能驱动方法的见解。
Acta Pharmacol Sin. 2025 Jun 17. doi: 10.1038/s41401-025-01591-x.
6
Establishing predictive machine learning models for drug responses in patient derived cell culture.建立针对患者来源细胞培养中药物反应的预测性机器学习模型。
NPJ Precis Oncol. 2025 Jun 13;9(1):180. doi: 10.1038/s41698-025-00937-2.
7
A Deep Generative Model for the Inverse Design of Transition Metal Ligands and Complexes.用于过渡金属配体和配合物逆向设计的深度生成模型
JACS Au. 2025 Apr 23;5(5):2294-2308. doi: 10.1021/jacsau.5c00242. eCollection 2025 May 26.
8
"Amide - amine + alcohol = carboxylic acid." chemical reactions as linear algebraic analogies in graph neural networks.“酰胺 - 胺 + 醇 = 羧酸。” 作为图神经网络中线性代数类比的化学反应。
Chem Sci. 2025 Apr 23. doi: 10.1039/d4sc05655h.
9
Generalizable, fast, and accurate DeepQSPR with fastprop.具有快速传播的可推广、快速且准确的深度定量构效关系模型。
J Cheminform. 2025 May 13;17(1):73. doi: 10.1186/s13321-025-01013-4.
10
Opportunities and challenges with artificial intelligence in allergy and immunology: a bibliometric study.人工智能在过敏与免疫学领域的机遇与挑战:一项文献计量学研究
Front Med (Lausanne). 2025 Apr 9;12:1523902. doi: 10.3389/fmed.2025.1523902. eCollection 2025.