Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, USA.
J Chem Inf Model. 2013 Aug 26;53(8):1915-22. doi: 10.1021/ci400216q. Epub 2013 Jul 17.
We report on the prediction accuracy of ligand-based (2D QSAR) and structure-based (MedusaDock) methods used both independently and in consensus for ranking the congeneric series of ligands binding to three protein targets (UK, ERK2, and CHK1) from the CSAR 2011 benchmark exercise. An ensemble of predictive QSAR models was developed using known binders of these three targets extracted from the publicly available ChEMBL database. Selected models were used to predict the binding affinity of CSAR compounds toward the corresponding targets and rank them accordingly; the overall ranking accuracy evaluated by Spearman correlation was as high as 0.78 for UK, 0.60 for ERK2, and 0.56 for CHK1, placing our predictions in the top 10% among all the participants. In parallel, MedusaDock, designed to predict reliable docking poses, was also used for ranking the CSAR ligands according to their docking scores; the resulting accuracy (Spearman correlation) for UK, ERK2, and CHK1 were 0.76, 0.31, and 0.26, respectively. In addition, performance of several consensus approaches combining MedusaDock- and QSAR-predicted ranks altogether has been explored; the best approach yielded Spearman correlation coefficients for UK, ERK2, and CHK1 of 0.82, 0.50, and 0.45, respectively. This study shows that (i) externally validated 2D QSAR models were capable of ranking CSAR ligands at least as accurately as more computationally intensive structure-based approaches used both by us and by other groups and (ii) ligand-based QSAR models can complement structure-based approaches by boosting the prediction performances when used in consensus.
我们报告了基于配体的(2D QSAR)和基于结构的(MedusaDock)方法的预测准确性,这些方法分别独立使用和共识使用,用于对来自 CSAR 2011 基准测试的三个蛋白质靶标(UK、ERK2 和 CHK1)的同类配体系列进行排序。使用从公开的 ChEMBL 数据库中提取的这三个靶标的已知配体,开发了一组预测性 QSAR 模型。选择的模型用于预测 CSAR 化合物与相应靶标的结合亲和力,并根据预测结果对它们进行排序;通过 Spearman 相关系数评估的整体排序准确性对于 UK 高达 0.78,对于 ERK2 为 0.60,对于 CHK1 为 0.56,我们的预测结果在所有参与者中排名前 10%。同时,设计用于预测可靠对接构象的 MedusaDock 也用于根据对接分数对 CSAR 配体进行排序;对于 UK、ERK2 和 CHK1,得到的准确性(Spearman 相关系数)分别为 0.76、0.31 和 0.26。此外,还探索了几种共识方法的性能,这些方法将 MedusaDock 和 QSAR 预测的排名结合在一起;最佳方法对 UK、ERK2 和 CHK1 的 Spearman 相关系数系数分别为 0.82、0.50 和 0.45。本研究表明:(i)经过外部验证的 2D QSAR 模型至少能够像我们和其他小组使用的更计算密集的基于结构的方法一样准确地对 CSAR 配体进行排序;(ii)基于配体的 QSAR 模型可以通过在共识中使用来提高预测性能,从而补充基于结构的方法。