Du-Cuny Lei, Huwyler Jörg, Wiese Michael, Kansy Manfred
F. Hoffmann-La Roche Ltd., Pharmaceuticals Division, Grenzacherstrasse, CH-4070 Basel, Switzerland.
Eur J Med Chem. 2008 Mar;43(3):501-12. doi: 10.1016/j.ejmech.2007.04.009. Epub 2007 May 6.
It was the aim of the present work to develop a quantitative structure-property relationship (QSPR) model for predicting the aqueous solubility of drug-like compounds in congeneric series. Lipophilicity combined with structural fragment information, fragmental based correction factors and congeneric series indices were used as descriptors for a principal component analysis (PCA) followed by multivariate partial least squares regression statistics (PLS). The derived PLS regression model for the prediction of solubility parameters was based on an in-house data set of 2473 drug-like compounds. The generated PLS model had a coefficient of determination (R(2))=0.844 and a root-mean-square (rms) error of 0.51 log units. It predicted the solubility of the test data set with a high degree of accuracy (R(2)=0.81). In addition, the PLS model was successful in predicting the solubility of new congeneric test sets when solubility values of corresponding scaffolds were accessible.
本研究的目的是开发一种定量构效关系(QSPR)模型,用于预测同系物系列中类药物化合物的水溶性。将亲脂性与结构片段信息、基于片段的校正因子和同系物系列指数用作描述符,进行主成分分析(PCA),随后进行多元偏最小二乘回归统计(PLS)。用于预测溶解度参数的推导PLS回归模型基于2473种类药物化合物的内部数据集。生成的PLS模型的决定系数(R(2))=0.844,均方根(rms)误差为0.51对数单位。它以高度的准确性预测了测试数据集的溶解度(R(2)=0.81)。此外,当相应支架的溶解度值可获取时,PLS模型成功地预测了新的同系物测试集的溶解度。