Department of Chemistry, College of Sciences, Shiraz University, Shiraz, Iran.
SAR QSAR Environ Res. 2012 Oct;23(7-8):665-82. doi: 10.1080/1062936X.2012.696552. Epub 2012 Jun 29.
The present work focuses on the development of an interpretable quantitative structure-activity relationship (QSAR) model for predicting the anti-HIV activities of 67 thiazolylthiourea derivatives. This set of molecules has been proposed as potent HIV-1 reverse transcriptase inhibitors (RT-INs). The molecules were encoded to a diverse set of molecular descriptors, spanning different physical and chemical properties. Monte Carlo (MC) sampling and multivariate adaptive regression spline (MARS) techniques were used to select the most important descriptors and to predict the activity of the molecules. The most important descriptor was found to be the aspherisity index. The analysis of variance (ANOVA) and interpretable spline equations showed that the geometrical shape of the molecules has considerable effect on their activities. It seems that the linear molecules are more active than symmetric top compounds. The final MARS model derived displayed a good predictive ability judging from the determination coefficient corresponding to the leave multiple out (LMO) cross-validation technique, i.e. r (2 )= 0.828 (M = 12) and r (2 )= 0.813 (M = 20). The results of this work showed that the developed spline model is robust, has a good predictive power, and can then be used as a reliable tool for designing novel HIV-1 RT-INs.
本工作专注于开发可解释的定量构效关系(QSAR)模型,以预测 67 个噻唑基硫脲衍生物的抗 HIV 活性。这组分子被提议作为有效的 HIV-1 逆转录酶抑制剂(RT-INs)。这些分子被编码为多种分子描述符,涵盖了不同的物理和化学性质。蒙特卡罗(MC)采样和多元自适应回归样条(MARS)技术被用于选择最重要的描述符并预测分子的活性。最重要的描述符被发现是各向异性指数。方差分析(ANOVA)和可解释样条方程的分析表明,分子的几何形状对其活性有相当大的影响。似乎线性分子比对称顶部化合物更活跃。从留一多重交叉验证技术(LMO)对应的决定系数来看,所得到的最终 MARS 模型显示出良好的预测能力,即 r(2)=0.828(M=12)和 r(2)=0.813(M=20)。这项工作的结果表明,所开发的样条模型是稳健的,具有良好的预测能力,因此可以作为设计新型 HIV-1 RT-INs 的可靠工具。