Ahmadinejad Neda, Shafiei Fatemeh
Department of Chemistry, Arak Branch, Islamic Azad University, Arak, Iran.
Comb Chem High Throughput Screen. 2019;22(6):387-399. doi: 10.2174/1386207322666190708112251.
A Quantitative Structure-Activity Relationship (QSAR) has been widely developed to derive a correlation between chemical structures of molecules to their known activities. In the present investigation, QSAR models have been carried out on 76 Camptothecin (CPT) derivatives as anticancer drugs to develop a robust model for the prediction of physicochemical properties.
A training set of 60 structurally diverse CPT derivatives was used to construct QSAR models for the prediction of physiochemical parameters such as Van der Waals surface area (SvdW), Van der Waals Volume (VvdW), Molar Refractivity (MR) and Polarizability (α). The QSAR models were optimized using Multiple Linear Regression (MLR) analysis. A test set of 16 compounds was evaluated using the defined models. The Genetic Algorithm And Multiple Linear Regression Analysis (GA-MLR) were used to select the descriptors derived from the Dragon software to generate the correlation models that relate the structural features to the studied properties.
QSAR models were used to delineate the important descriptors responsible for the properties of the CPT derivatives. The statistically significant QSAR models derived by GA-MLR analysis were validated by Leave-One-Out Cross-Validation (LOOCV) and test set validation methods. The multicollinearity and autocorrelation properties of the descriptors contributed in the models were tested by calculating the Variance Inflation Factor (VIF) and the Durbin-Watson (DW) statistics.
The predictive ability of the models was found to be satisfactory. Thus, QSAR models derived from this study may be helpful for modeling and designing some new CPT derivatives and for predicting their activity.
定量构效关系(QSAR)已得到广泛发展,用于推导分子化学结构与其已知活性之间的相关性。在本研究中,对76种作为抗癌药物的喜树碱(CPT)衍生物进行了QSAR模型研究,以建立一个用于预测物理化学性质的稳健模型。
使用一组包含60种结构多样的CPT衍生物的训练集来构建QSAR模型,以预测诸如范德华表面积(SvdW)、范德华体积(VvdW)、摩尔折射率(MR)和极化率(α)等物理化学参数。通过多元线性回归(MLR)分析对QSAR模型进行优化。使用定义的模型对一组包含16种化合物的测试集进行评估。采用遗传算法和多元线性回归分析(GA-MLR)来选择从Dragon软件导出的描述符,以生成将结构特征与所研究性质相关联的相关模型。
QSAR模型用于确定对CPT衍生物性质起重要作用的描述符。通过留一法交叉验证(LOOCV)和测试集验证方法对由GA-MLR分析得出的具有统计学意义的QSAR模型进行了验证。通过计算方差膨胀因子(VIF)和杜宾-沃森(DW)统计量,对模型中描述符的多重共线性和自相关性质进行了测试。
发现模型的预测能力令人满意。因此,本研究得出的QSAR模型可能有助于对一些新的CPT衍生物进行建模和设计,并预测其活性。