• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于遗传算法的描述符选择方法优化的水溶性和分配系数预测

Prediction of aqueous solubility and partition coefficient optimized by a genetic algorithm based descriptor selection method.

作者信息

Wegner Jörg K, Zell Andreas

机构信息

Zentrum für Bioinformatik Tübingen, Universität Tübingen, Sand 1, D-72076 Tübingen, Germany.

出版信息

J Chem Inf Comput Sci. 2003 May-Jun;43(3):1077-84. doi: 10.1021/ci034006u.

DOI:10.1021/ci034006u
PMID:12767167
Abstract

The paper describes a fast and flexible descriptor selection method using a genetic algorithm variant (GA-SEC). The relevance of the descriptors will be measured using Shannon entropy (SE) and differential Shannon entropy (DSE), which have very sparse memory requirements and allow the processing of huge data sets. A small quantity of the most important descriptors will be used automatically to build a value prediction model. The most important descriptors are not a linear combination of other descriptors, but transparent, pure descriptors. We used an artificial neural network (ANN) model to predict the aqueous solubility logS and the octanol/water partition coefficient logP. The logS data set was divided into a training set of 1016 compounds and a test set of 253 compounds. A correlation coefficient of 0.93 and an empirical standard deviation of 0.54 were achieved. The logP data set was divided into a training set of 1853 compounds and a test set of 138 compounds. A correlation coefficient of 0.92 and an empirical standard deviation of 0.44 were achieved.

摘要

本文描述了一种使用遗传算法变体(GA-SEC)的快速灵活的描述符选择方法。描述符的相关性将使用香农熵(SE)和差分香农熵(DSE)来衡量,它们具有非常稀疏的内存需求,并允许处理海量数据集。少量最重要的描述符将被自动用于构建值预测模型。最重要的描述符不是其他描述符的线性组合,而是透明的、纯粹的描述符。我们使用人工神经网络(ANN)模型来预测水溶性logS和辛醇/水分配系数logP。logS数据集被分为一个包含1016种化合物的训练集和一个包含253种化合物的测试集。相关系数达到0.93,经验标准差为0.54。logP数据集被分为一个包含1853种化合物的训练集和一个包含138种化合物的测试集。相关系数达到0.92,经验标准差为0.44。

相似文献

1
Prediction of aqueous solubility and partition coefficient optimized by a genetic algorithm based descriptor selection method.基于遗传算法的描述符选择方法优化的水溶性和分配系数预测
J Chem Inf Comput Sci. 2003 May-Jun;43(3):1077-84. doi: 10.1021/ci034006u.
2
A neural network based prediction of octanol-water partition coefficients using atomic5 fragmental descriptors.基于神经网络并使用原子5片段描述符预测正辛醇-水分配系数
Bioorg Med Chem Lett. 2004 Feb 23;14(4):851-3. doi: 10.1016/j.bmcl.2003.12.024.
3
Modeling aqueous solubility.模拟水溶性。
J Chem Inf Comput Sci. 2003 May-Jun;43(3):837-41. doi: 10.1021/ci020279y.
4
Prediction of aqueous solubility based on large datasets using several QSPR models utilizing topological structure representation.基于大型数据集,利用多种采用拓扑结构表示法的定量构效关系(QSPR)模型预测水溶解度。
Chem Biodivers. 2004 Nov;1(11):1829-41. doi: 10.1002/cbdv.200490137.
5
Prediction of air-to-blood partition coefficients of volatile organic compounds using genetic algorithm and artificial neural network.利用遗传算法和人工神经网络预测挥发性有机化合物的气-血分配系数
Anal Chim Acta. 2008 Jul 7;619(2):157-64. doi: 10.1016/j.aca.2008.04.065. Epub 2008 May 13.
6
Prediction of octanol-water partition coefficients of organic compounds by multiple linear regression, partial least squares, and artificial neural network.通过多元线性回归、偏最小二乘法和人工神经网络预测有机化合物的正辛醇-水分配系数
J Comput Chem. 2009 Nov 30;30(15):2455-65. doi: 10.1002/jcc.21243.
7
Modeling the octanol-water partition coefficients by an optimized molecular connectivity index.通过优化的分子连接性指数对正辛醇-水分配系数进行建模。
J Chem Inf Model. 2005 Jul-Aug;45(4):930-8. doi: 10.1021/ci050024v.
8
QSPR model of Henry's law constant for a diverse set of organic chemicals based on genetic algorithm-radial basis function network approach.基于遗传算法-径向基函数网络方法的多种有机化学品亨利定律常数的定量构效关系模型
Chemosphere. 2007 Feb;66(11):2067-76. doi: 10.1016/j.chemosphere.2006.09.049. Epub 2006 Nov 20.
9
Computational aqueous solubility prediction for drug-like compounds in congeneric series.同系物系列中类药物化合物的计算水溶解度预测
Eur J Med Chem. 2008 Mar;43(3):501-12. doi: 10.1016/j.ejmech.2007.04.009. Epub 2007 May 6.
10
Computation of octanol-water partition coefficients by guiding an additive model with knowledge.通过知识引导加和模型计算正辛醇-水分配系数。
J Chem Inf Model. 2007 Nov-Dec;47(6):2140-8. doi: 10.1021/ci700257y. Epub 2007 Nov 7.

引用本文的文献

1
Will we ever be able to accurately predict solubility?我们是否能够准确地预测溶解度?
Sci Data. 2024 Mar 18;11(1):303. doi: 10.1038/s41597-024-03105-6.
2
-Terphenylamines, Acting as Selective COX-1 Inhibitors, Block Microglia Inflammatory Response and Exert Neuroprotective Activity.联苯胺类化合物作为选择性 COX-1 抑制剂,能够阻断小胶质细胞炎症反应并发挥神经保护活性。
Molecules. 2023 Jul 13;28(14):5374. doi: 10.3390/molecules28145374.
3
Prediction of aqueous intrinsic solubility of druglike molecules using Random Forest regression trained with Wiki-pS0 database.
使用基于Wiki-pS0数据库训练的随机森林回归预测类药物分子的水相固有溶解度。
ADMET DMPK. 2020 Mar 4;8(1):29-77. doi: 10.5599/admet.766. eCollection 2020.
4
Pruned Machine Learning Models to Predict Aqueous Solubility.用于预测水溶性的剪枝机器学习模型
ACS Omega. 2020 Jul 1;5(27):16562-16567. doi: 10.1021/acsomega.0c01251. eCollection 2020 Jul 14.
5
Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure.基于对数和正则化的化学结构生物活性描述符选择
Int J Mol Sci. 2017 Dec 22;19(1):30. doi: 10.3390/ijms19010030.
6
In Silico Prediction of Physicochemical Properties of Environmental Chemicals Using Molecular Fingerprints and Machine Learning.利用分子指纹和机器学习对环境化学品的物理化学性质进行计算机模拟预测。
J Chem Inf Model. 2017 Jan 23;57(1):36-49. doi: 10.1021/acs.jcim.6b00625. Epub 2017 Jan 9.
7
Extended solvent-contact model approach to blind SAMPL5 prediction challenge for the distribution coefficients of drug-like molecules.扩展溶剂接触模型方法用于药物类分子分配系数的盲SAMPL5预测挑战
J Comput Aided Mol Des. 2016 Nov;30(11):1019-1033. doi: 10.1007/s10822-016-9928-x. Epub 2016 Jul 23.
8
Prediction of pharmacokinetic parameters using a genetic algorithm combined with an artificial neural network for a series of alkaloid drugs.使用遗传算法结合人工神经网络对一系列生物碱药物的药代动力学参数进行预测。
Sci Pharm. 2013 Sep 22;82(1):53-70. doi: 10.3797/scipharm.1306-10. Print 2014 Jan-Mar.
9
Simultaneous feature selection and parameter optimisation using an artificial ant colony: case study of melting point prediction.使用人工蚁群进行同步特征选择和参数优化:熔点预测案例研究
Chem Cent J. 2008 Oct 29;2:21. doi: 10.1186/1752-153X-2-21.
10
Recent progress in the computational prediction of aqueous solubility and absorption.水溶性和吸收的计算预测方面的最新进展。
AAPS J. 2006 Feb 3;8(1):E27-40. doi: 10.1208/aapsj080104.