• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于亲脂性的机器学习模型及其适用范围。

Machine learning models for lipophilicity and their domain of applicability.

作者信息

Schroeter Timon, Schwaighofer Anton, Mika Sebastian, Laak Antonius Ter, Suelzle Detlev, Ganzer Ursula, Heinrich Nikolaus, Müller Klaus-Robert

机构信息

Fraunhofer FIRST, Kekuléstrasse 7, 12489 Berlin, Germany.

出版信息

Mol Pharm. 2007 Jul-Aug;4(4):524-38. doi: 10.1021/mp0700413. Epub 2007 Jul 19.

DOI:10.1021/mp0700413
PMID:17637064
Abstract

Unfavorable lipophilicity and water solubility cause many drug failures; therefore these properties have to be taken into account early on in lead discovery. Commercial tools for predicting lipophilicity usually have been trained on small and neutral molecules, and are thus often unable to accurately predict in-house data. Using a modern Bayesian machine learning algorithm--a Gaussian process model--this study constructs a log D7 model based on 14,556 drug discovery compounds of Bayer Schering Pharma. Performance is compared with support vector machines, decision trees, ridge regression, and four commercial tools. In a blind test on 7013 new measurements from the last months (including compounds from new projects) 81% were predicted correctly within 1 log unit, compared to only 44% achieved by commercial software. Additional evaluations using public data are presented. We consider error bars for each method (model based error bars, ensemble based, and distance based approaches), and investigate how well they quantify the domain of applicability of each model.

摘要

亲脂性和水溶性不佳导致许多药物研发失败;因此,在先导化合物发现的早期阶段就必须考虑这些性质。用于预测亲脂性的商业工具通常是基于小分子和中性分子进行训练的,因此往往无法准确预测内部数据。本研究使用一种现代贝叶斯机器学习算法——高斯过程模型,基于拜耳先灵医药公司的14556种药物研发化合物构建了一个log D7模型。将其性能与支持向量机、决策树、岭回归以及四种商业工具进行了比较。在对过去几个月的7013个新测量值(包括来自新项目化合物)的盲测中,81%的预测值在1个对数单位内正确,而商业软件仅达到44%。还给出了使用公共数据的其他评估结果。我们考虑了每种方法的误差条(基于模型的误差条、基于集成的误差条和基于距离的方法),并研究了它们对每个模型适用范围的量化程度。

相似文献

1
Machine learning models for lipophilicity and their domain of applicability.用于亲脂性的机器学习模型及其适用范围。
Mol Pharm. 2007 Jul-Aug;4(4):524-38. doi: 10.1021/mp0700413. Epub 2007 Jul 19.
2
Estimating the domain of applicability for machine learning QSAR models: a study on aqueous solubility of drug discovery molecules.估计机器学习定量构效关系模型的适用域:关于药物发现分子水溶性的研究
J Comput Aided Mol Des. 2007 Sep;21(9):485-98. doi: 10.1007/s10822-007-9125-z. Epub 2007 Jul 14.
3
A probabilistic approach to classifying metabolic stability.一种用于代谢稳定性分类的概率方法。
J Chem Inf Model. 2008 Apr;48(4):785-96. doi: 10.1021/ci700142c. Epub 2008 Mar 8.
4
Estimating the domain of applicability for machine learning QSAR models: a study on aqueous solubility of drug discovery molecules.估计机器学习定量构效关系模型的适用域:药物发现分子水溶性的研究
J Comput Aided Mol Des. 2007 Dec;21(12):651-64. doi: 10.1007/s10822-007-9160-9. Epub 2007 Dec 1.
5
Accurate solubility prediction with error bars for electrolytes: a machine learning approach.用于电解质的带误差线的准确溶解度预测:一种机器学习方法。
J Chem Inf Model. 2007 Mar-Apr;47(2):407-24. doi: 10.1021/ci600205g. Epub 2007 Jan 23.
6
In silico log P prediction for a large data set with support vector machines, radial basis neural networks and multiple linear regression.使用支持向量机、径向基神经网络和多元线性回归对大数据集进行计算机模拟的log P预测。
Chem Biol Drug Des. 2009 Aug;74(2):142-7. doi: 10.1111/j.1747-0285.2009.00840.x. Epub 2009 Jun 22.
7
Prediction of P-glycoprotein substrates by a support vector machine approach.基于支持向量机方法的P-糖蛋白底物预测
J Chem Inf Comput Sci. 2004 Jul-Aug;44(4):1497-505. doi: 10.1021/ci049971e.
8
Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.基于统计几何学,使用随机森林和神经模糊分类器预测非同义单核苷酸多态性的功能效应
Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838.
9
Prediction of antibacterial compounds by machine learning approaches.通过机器学习方法预测抗菌化合物。
J Comput Chem. 2009 Jun;30(8):1202-11. doi: 10.1002/jcc.21148.
10
Contemporary QSAR classifiers compared.当代定量构效关系分类器比较。
J Chem Inf Model. 2007 Jan-Feb;47(1):219-27. doi: 10.1021/ci600332j.

引用本文的文献

1
Machine learning estimates of natural product conformational energies.天然产物构象能量的机器学习估计
PLoS Comput Biol. 2014 Jan;10(1):e1003400. doi: 10.1371/journal.pcbi.1003400. Epub 2014 Jan 16.
2
Paradigm shift in toxicity testing and modeling.毒性测试和建模中的范式转变。
AAPS J. 2012 Sep;14(3):473-80. doi: 10.1208/s12248-012-9358-1. Epub 2012 Apr 20.
3
Estimation of the applicability domain of kernel-based machine learning models for virtual screening.基于核的机器学习模型在虚拟筛选中适用性域的估计。
J Cheminform. 2010 Mar 11;2(1):2. doi: 10.1186/1758-2946-2-2.
4
Estimating the domain of applicability for machine learning QSAR models: a study on aqueous solubility of drug discovery molecules.估计机器学习定量构效关系模型的适用域:药物发现分子水溶性的研究
J Comput Aided Mol Des. 2007 Dec;21(12):651-64. doi: 10.1007/s10822-007-9160-9. Epub 2007 Dec 1.
5
Estimating the domain of applicability for machine learning QSAR models: a study on aqueous solubility of drug discovery molecules.估计机器学习定量构效关系模型的适用域:关于药物发现分子水溶性的研究
J Comput Aided Mol Des. 2007 Sep;21(9):485-98. doi: 10.1007/s10822-007-9125-z. Epub 2007 Jul 14.