Center of Excellence in Electrochemistry, Faculty of Chemistry, University of Tehran, P O Box 14155-6455, Tehran, Iran.
Mol Divers. 2011 Aug;15(3):645-53. doi: 10.1007/s11030-010-9283-0. Epub 2010 Oct 8.
Multiple linear regressions (MLR) and support vector machine (SVM) were used to develop quantitative structure-activity relationship (QSAR) models of novel Hepatitis C virus (HCV) NS5B polymerase inhibitors. Various kinds of molecular descriptors were calculated to represent the molecular structures of compounds, such as chemical, topological, geometrical, and quantum descriptors. Principal component analysis (PCA) was used to select the training set. A variable selection method utilizing a genetic algorithm (GA) was employed to select from the large pool of calculated descriptors, an optimal subset of descriptors which have significant contribution to the overall inhibitory activity. The models were validated using Leave-One-Out (LOO) and Leave-Group-Out (LGO) crossvalidation, and Y-randomization test. Results demonstrated the SVM model offers powerful prediction capabilities.
采用多元线性回归(MLR)和支持向量机(SVM)方法,为新型丙型肝炎病毒(HCV) NS5B 聚合酶抑制剂构建定量构效关系(QSAR)模型。为了表示化合物的分子结构,计算了各种分子描述符,例如化学、拓扑、几何和量子描述符。主成分分析(PCA)用于选择训练集。利用遗传算法(GA)的变量选择方法,从大量计算出的描述符中选择对整体抑制活性有重要贡献的描述符的最佳子集。使用留一法(LOO)和留组法(LGO)交叉验证以及 Y 随机化检验对模型进行验证。结果表明,SVM 模型具有强大的预测能力。