Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indiana University, Indianapolis, Indiana, United States.
J Chem Inf Model. 2011 Sep 26;51(9):2132-8. doi: 10.1021/ci200078f. Epub 2011 Jul 26.
The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the two CSAR data sets. The features used to train SVR-KB are knowledge-based pairwise potentials, while SVR-EP is based on physicochemical properties. SVR-KB and SVR-EP were compared to seven other widely used scoring functions, including Glide, X-score, GoldScore, ChemScore, Vina, Dock, and PMF. Results showed that SVR-KB trained with features obtained from three-dimensional complexes of the PDBbind data set outperformed all other scoring functions, including best performing X-score, by nearly 0.1 using three correlation coefficients, namely Pearson, Spearman, and Kendall. It was interesting that higher performance in rank ordering did not translate into greater enrichment in virtual screening assessed using the 40 targets of the Directory of Useful Decoys (DUD). To remedy this situation, a variant of SVR-KB (SVR-KBD) was developed by following a target-specific tailoring strategy that we had previously employed to derive SVM-SP. SVR-KBD showed a much higher enrichment, outperforming all other scoring functions tested, and was comparable in performance to our previously derived scoring function SVM-SP.
社区结构-活性资源 (CSAR) 数据集用于开发和测试基于支持向量机的回归模式评分函数 (SVR)。 有两个评分函数 (SVR-KB 和 SVR-EP) 是基于重现两个 CSAR 数据集内提供的实验结合亲和力趋势的目标而得出的。 SVR-KB 所使用的特征是基于知识的成对势能,而 SVR-EP 则基于物理化学性质。 SVR-KB 和 SVR-EP 与其他七种广泛使用的评分函数进行了比较,包括 Glide、X-score、GoldScore、ChemScore、Vina、Dock 和 PMF。 结果表明,SVR-KB 经过训练,使用从 PDBbind 数据集的三维复合物中获得的特征,在使用三个相关系数(Pearson、Spearman 和 Kendall)时,其性能优于其他所有评分函数,包括性能最佳的 X-score,接近 0.1。 有趣的是,在排序性能上的提高并没有转化为在虚拟筛选中的更高富集,而虚拟筛选是使用目录中的 40 个目标有用诱饵 (DUD) 进行评估的。 为了弥补这种情况,我们根据我们之前用于推导 SVM-SP 的目标特定定制策略,开发了 SVR-KB 的变体 (SVR-KBD)。 SVR-KBD 表现出更高的富集度,优于所有测试的其他评分函数,并且与我们之前推导的评分函数 SVM-SP 的性能相当。