Ghasemi Jahanbakhsh, Saaidpour Saadi
Chemistry Department, Faculty of Sciences, Razi University, Kermanshah, Iran.
Anal Chim Acta. 2007 Dec 5;604(2):99-106. doi: 10.1016/j.aca.2007.10.004. Epub 2007 Oct 11.
A quantitative structure-property relationship (QSPR) study was performed to develop models those relate the structures of 150 drug organic compounds to their n-octanol-water partition coefficients (logP(o/w)). Molecular descriptors derived solely from 3D structures of the molecular drugs. A genetic algorithm was also applied as a variable selection tool in QSPR analysis. The models were constructed using 110 molecules as training set, and predictive ability tested using 40 compounds. Modeling of logP(o/w) of these compounds as a function of the theoretically derived descriptors was established by multiple linear regression (MLR). Four descriptors for these compounds molecular volume (MV) (geometrical), hydrophilic-lipophilic balance (HLB) (constitutional), hydrogen bond forming ability (HB) (electronic) and polar surface area (PSA) (electrostatic) are taken as inputs for the model. The use of descriptors calculated only from molecular structure eliminates the need for experimental determination of properties for use in the correlation and allows for the estimation of logP(o/w) for molecules not yet synthesized. Application of the developed model to a testing set of 40 drug organic compounds demonstrates that the model is reliable with good predictive accuracy and simple formulation. The prediction results are in good agreement with the experimental value. The root mean square error of prediction (RMSEP) and square correlation coefficient (R2) for MLR model were 0.22 and 0.99 for the prediction set logP(o/w).
开展了一项定量结构-性质关系(QSPR)研究,以建立将150种药物有机化合物的结构与其正辛醇-水分配系数(logP(o/w))相关联的模型。分子描述符仅从分子药物的三维结构推导得出。遗传算法也被用作QSPR分析中的变量选择工具。使用110个分子作为训练集构建模型,并使用40种化合物测试预测能力。通过多元线性回归(MLR)建立了这些化合物的logP(o/w)作为理论推导描述符函数的模型。将这些化合物的四个描述符——分子体积(MV)(几何)、亲水亲油平衡(HLB)(组成)、氢键形成能力(HB)(电子)和极性表面积(PSA)(静电)作为模型的输入。仅从分子结构计算描述符的使用消除了为相关性使用而进行性质实验测定的需要,并允许估计尚未合成分子的logP(o/w)。将所开发的模型应用于40种药物有机化合物的测试集表明,该模型可靠,具有良好的预测准确性和简单的公式。预测结果与实验值吻合良好。对于预测集logP(o/w),MLR模型的预测均方根误差(RMSEP)和平方相关系数(R2)分别为0.22和0.99。